IndexingDocjoinerServingTimeClusterId

AI Overview😉

  • The potential purpose of this module is to group similar documents together, such as translated pages or paginated content, to prevent duplicates from appearing in search results. This ensures that only one representative document from each cluster is shown to the user.
  • This module could impact search results by filtering out duplicate or very similar documents, providing users with a more diverse set of results. It may also prioritize documents that are more likely to be relevant to the user's query, as the clustered documents are likely to have similar content.
  • A website may change things to be more favorable for this function by ensuring that their translated or paginated pages are properly linked together, using consistent URL structures and pagination patterns. This would allow the module to accurately identify and cluster similar documents, increasing the visibility of the website's content in search results.

Interesting Module? Vote 👇

Voting helps other researchers find interesting modules.

Current Votes: 0

GoogleApi.ContentWarehouse.V1.Model.IndexingDocjoinerServingTimeClusterId (google_api_content_warehouse v0.4.0)

The serving cluster id metadata. Why we cluster the documents. The default value is UNKNOWN. Serving-Side Clustering is used for pages which represent the same content but are not identical (e.g. translated pages, or paginated content). Those documents are not dup-clustered in Alexandria so that all documents (and their tokens) are available to search queries. However, those documents are assigned the same Serving-Time-Cluster-ID (on the same reasontype), which during serving guarantees that only one of them is shown to the user.

Attributes

  • clusterId (type: String.t, default: nil) - The unique id to distinguish members in cluster. It could be generated in different ways according to reason, e.g. LINK_REL_NEXT_PREVIOUS: it is the fingerprint of the URL of index 0; PAGE_PARAMETER_INFO: it is the fingerprint of pagination pattern (pagination_pattern_fp field in PageParamInfo message).
  • indexForDebugging (type: integer(), default: nil) - The member index of this document in cluster. Starts from "0". Note that indices of pages in a cluster may not be consistent with each other, because we may update them at different points in time. Currently only used for debugging.
  • reason (type: String.t, default: nil) - The reason why this document is clustered into this cluster.

Summary

Types

t()

Functions

decode(value, options)

Unwrap a decoded JSON object into its complex fields.

Types

Link to this type

t()

@type t() ::
  %GoogleApi.ContentWarehouse.V1.Model.IndexingDocjoinerServingTimeClusterId{
    clusterId: String.t() | nil,
    indexForDebugging: integer() | nil,
    reason: String.t() | nil
  }

Functions

Link to this function

decode(value, options)

@spec decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.