RepositoryWebrefSimplifiedCompositeDoc

AI Overview😉

  • The potential purpose of this module is to process and store information about web pages, including their content, anchors, and metadata, in a compact and efficient way. This allows for faster and more efficient processing of web pages and their relationships.
  • This module could impact search results by influencing how web pages are ranked and displayed. For example, the "forwardingDups" attribute suggests that the module is involved in handling duplicate or canonical URLs, which could affect how search results are filtered and displayed. The "matchingMetadata" attribute implies that the module is also involved in determining why a particular web page is relevant to a search query, which could impact the ranking of search results.
  • To be more favorable to this module, a website could ensure that its content is well-structured and easily parseable, with clear and consistent metadata and anchor text. Additionally, using canonical URLs and avoiding duplicate content could help the module to better understand the website's structure and content, potentially leading to improved search engine rankings.

Interesting Module? Vote 👇

Voting helps other researchers find interesting modules.

Current Votes: 0

GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefSimplifiedCompositeDoc (google_api_content_warehouse v0.4.0)

Represents an information which is very close to composite doc, but compresses how the anchors are represented to save space. Next available tag: 18.

Attributes

  • anchors (type: GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefSimplifiedAnchors.t, default: nil) - The composite doc anchors trimmed and transformed in a smaller data structure and aggregated (if they have the exact same text).
  • cdocContainer (type: GoogleApi.ContentWarehouse.V1.Model.Proto2BridgeMessageSet.t, default: nil) - IMPORTANT: do not access this field directly, use the simplified-cdoc-access library functions to get the composite doc out of this proto.
  • documentMentionSpans (type: GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefRefconDocumentMentionSpans.t, default: nil) - The composite doc spans which were annotated with entities by Webref.
  • forwardingDups (type: list(GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefSimplifiedForwardingDup.t), default: nil) - Forwarding dups generated by merging existing forwarding dups within cdoc_container and URLs from ReferencePageCandidateList. Populated only before writing to online CDocs Spanner table and used for keeping an index on non canonical URLs in table. [!] Do not use unless fetching CDocs from Enricher's online Spanner CDocs database knowledge-enricher-cdocs-spanner.
  • matchingMetadata (type: GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefPreprocessingUrlMatchingMetadata.t, default: nil) - Metadata related to why this doc was matched to its owning entity.
  • pageContent (type: list(String.t), default: nil) -
  • refconDocumentMetadata (type: GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefRefconRefconDocumentMetadata.t, default: nil) - Additional document metadata needed by Refcon.
  • sourceSnapshotType (type: String.t, default: nil) -
  • url (type: String.t, default: nil) - The URL, populated independently of whether we have a CompositeDoc proto. If the cdoc exists, the url is the same as CompositeDoc.doc.url.
  • webrefOutlinkInfos (type: GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefOutlinkInfos.t, default: nil) -

Summary

Types

t()

Functions

decode(value, options)

Unwrap a decoded JSON object into its complex fields.

Types

Link to this type

t()

@type t() ::
  %GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefSimplifiedCompositeDoc{
    anchors:
      GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefSimplifiedAnchors.t()
      | nil,
    cdocContainer:
      GoogleApi.ContentWarehouse.V1.Model.Proto2BridgeMessageSet.t() | nil,
    documentMentionSpans:
      GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefRefconDocumentMentionSpans.t()
      | nil,
    forwardingDups:
      [
        GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefSimplifiedForwardingDup.t()
      ]
      | nil,
    matchingMetadata:
      GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefPreprocessingUrlMatchingMetadata.t()
      | nil,
    pageContent: [String.t()] | nil,
    refconDocumentMetadata:
      GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefRefconRefconDocumentMetadata.t()
      | nil,
    sourceSnapshotType: String.t() | nil,
    url: String.t() | nil,
    webrefOutlinkInfos:
      GoogleApi.ContentWarehouse.V1.Model.RepositoryWebrefWebrefOutlinkInfos.t()
      | nil
  }

Functions

Link to this function

decode(value, options)

@spec decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.