GoogleCloudDocumentaiV1DocumentPage

AI Overview😉

  • The potential purpose of this module is to analyze and understand the structure and content of a document, including its layout, text, images, and other visual elements. This module appears to be part of a larger document analysis system that can extract relevant information from documents.
  • This module could impact search results by allowing Google to better understand the content and structure of documents, which could lead to more accurate and relevant search results. For example, if a user searches for a specific phrase or keyword, this module could help Google identify documents that contain that phrase or keyword in a specific context or layout.
  • To be more favorable for this function, a website could optimize its documents by using clear and consistent layouts, using headings and subheadings to structure content, and ensuring that images and other visual elements are properly tagged and described. Additionally, using semantic HTML and providing alternative text for images could also help this module better understand the content and structure of a document.

Interesting Module? Vote 👇

Voting helps other researchers find interesting modules.

Current Votes: 0

GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPage (google_api_content_warehouse v0.4.0)

A page in a Document.

Attributes

  • blocks (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageBlock.t), default: nil) - A list of visually detected text blocks on the page. A block has a set of lines (collected into paragraphs) that have a common line-spacing and orientation.
  • detectedBarcodes (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageDetectedBarcode.t), default: nil) - A list of detected barcodes.
  • detectedLanguages (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageDetectedLanguage.t), default: nil) - A list of detected languages together with confidence.
  • dimension (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageDimension.t, default: nil) - Physical dimension of the page.
  • formFields (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageFormField.t), default: nil) - A list of visually detected form fields on the page.
  • image (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageImage.t, default: nil) - Rendered image for this page. This image is preprocessed to remove any skew, rotation, and distortions such that the annotation bounding boxes can be upright and axis-aligned.
  • imageQualityScores (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageImageQualityScores.t, default: nil) - Image quality scores.
  • layout (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageLayout.t, default: nil) - Layout for the page.
  • lines (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageLine.t), default: nil) - A list of visually detected text lines on the page. A collection of tokens that a human would perceive as a line.
  • pageNumber (type: integer(), default: nil) - 1-based index for current Page in a parent Document. Useful when a page is taken out of a Document for individual processing.
  • paragraphs (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageParagraph.t), default: nil) - A list of visually detected text paragraphs on the page. A collection of lines that a human would perceive as a paragraph.
  • provenance (type: GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentProvenance.t, default: nil) - The history of this page.
  • symbols (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageSymbol.t), default: nil) - A list of visually detected symbols on the page.
  • tables (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageTable.t), default: nil) - A list of visually detected tables on the page.
  • tokens (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageToken.t), default: nil) - A list of visually detected tokens on the page.
  • transforms (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageMatrix.t), default: nil) - Transformation matrices that were applied to the original document image to produce Page.image.
  • visualElements (type: list(GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageVisualElement.t), default: nil) - A list of detected non-text visual elements e.g. checkbox, signature etc. on the page.

Summary

Types

t()

Functions

decode(value, options)

Unwrap a decoded JSON object into its complex fields.

Types

Link to this type

t()

@type t() :: %GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPage{
  blocks:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageBlock.t()
    ]
    | nil,
  detectedBarcodes:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageDetectedBarcode.t()
    ]
    | nil,
  detectedLanguages:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageDetectedLanguage.t()
    ]
    | nil,
  dimension:
    GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageDimension.t()
    | nil,
  formFields:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageFormField.t()
    ]
    | nil,
  image:
    GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageImage.t()
    | nil,
  imageQualityScores:
    GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageImageQualityScores.t()
    | nil,
  layout:
    GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageLayout.t()
    | nil,
  lines:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageLine.t()
    ]
    | nil,
  pageNumber: integer() | nil,
  paragraphs:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageParagraph.t()
    ]
    | nil,
  provenance:
    GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentProvenance.t()
    | nil,
  symbols:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageSymbol.t()
    ]
    | nil,
  tables:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageTable.t()
    ]
    | nil,
  tokens:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageToken.t()
    ]
    | nil,
  transforms:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageMatrix.t()
    ]
    | nil,
  visualElements:
    [
      GoogleApi.ContentWarehouse.V1.Model.GoogleCloudDocumentaiV1DocumentPageVisualElement.t()
    ]
    | nil
}

Functions

Link to this function

decode(value, options)

@spec decode(struct(), keyword()) :: struct()

Unwrap a decoded JSON object into its complex fields.