TeragoogleDocumentInfo

AI Overview😉

  • The potential purpose of this module is to extract and store information about a document, such as its content, format, and structure, in a way that allows for efficient querying and analysis. This information can be used to improve search results by providing a more detailed understanding of the document's content and relevance.
  • This module could impact search results by allowing Google's algorithm to better understand the content and structure of a document, and to use this information to improve the ranking and relevance of search results. For example, the averageTermWeight attribute could be used to determine the importance of certain keywords in a document, while the section and sectionType attributes could be used to identify the document's structure and content.
  • A website may change things to be more favorable for this function by ensuring that their documents are well-structured and contain relevant and descriptive content. This could include using clear and concise headings, organizing content into logical sections, and using keywords and phrases that accurately describe the document's content. Additionally, websites could ensure that their documents are encoded in a way that allows for easy analysis, such as using UTF-8 encoding.

Interesting Module? Vote 👇

Voting helps other researchers find interesting modules.

Current Votes: 1

GoogleApi.ContentWarehouse.V1.Model.TeragoogleDocumentInfo (google_api_content_warehouse v0.4.0)

The docshards are a FlatSSTable from (64-bit) urlfp to data containing the information found in this protocol buffer. (It's not actually one of these PBs for various performance reasons; instead, the DocumentFormat class handles serialization and deserialization in our own way) Next available ID: 37

Attributes

  • attachment (type: list(GoogleApi.ContentWarehouse.V1.Model.TeragoogleDocumentInfoAttachment.t), default: nil) -
  • averageTermWeight (type: integer(), default: nil) - The average weight of terms in the document. If not available, there will be no term weight averaging: font sizes will be taken literally from the document HTML.
  • doc (type: GoogleApi.ContentWarehouse.V1.Model.GDocumentBase.t, default: nil) - The document itself. If present, the docservers parse the contents to create a mustang repository.
  • extendedDocid (type: String.t, default: nil) - The serialized ExtendedDocId, needed to construct a proper docinfo response if the docinfo request is missing it and it's present.
  • globalDocid (type: String.t, default: nil) - the global docid, we need it in the docservers (Continuum mode) to construct proper docinfo response when the docinfo request does not contain a valid global docid
  • originalEncoding (type: integer(), default: nil) - If the original encoding isn't UTF8
  • section (type: list(GoogleApi.ContentWarehouse.V1.Model.TeragoogleDocumentInfoSection.t), default: nil) -
  • sectionType (type: String.t, default: nil) - Indicates format of 'tokens' field in all Section entries.

Summary

Types

t()

Functions

decode(value, options)

Unwrap a decoded JSON object into its complex fields.

Types

Link to this type

t()

@type t() :: %GoogleApi.ContentWarehouse.V1.Model.TeragoogleDocumentInfo{
  attachment:
    [GoogleApi.ContentWarehouse.V1.Model.TeragoogleDocumentInfoAttachment.t()]
    | nil,
  averageTermWeight: integer() | nil,
  doc: GoogleApi.ContentWarehouse.V1.Model.GDocumentBase.t() | nil,
  extendedDocid: String.t() | nil,
  globalDocid: String.t() | nil,
  originalEncoding: integer() | nil,
  section:
    [GoogleApi.ContentWarehouse.V1.Model.TeragoogleDocumentInfoSection.t()]
    | nil,
  sectionType: String.t() | nil
}

Functions

Link to this function

decode(value, options)

@spec decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.