Voting helps other researchers find interesting modules.
The SnapshotDocument contains a list of TextNode's. Each node contains a string of text of the webpage, its bounding box in the agove snapshot image, and its font size (in number of pixels in the snapshot, which could be a fraction number since the snapshot image is typically shrinked). This list of text nodes are extracted from the output from the rendering service: htmlrender_webkit_headless_proto.Document The extraction is done by TrimDocument defined in ./shared/doctrimmer.cc
boundingBox
GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t
nil
fontSize
number()
inLink
integer()
maxSplit
text
String.t
Unwrap a decoded JSON object into its complex fields.
@type t() :: %GoogleApi.ContentWarehouse.V1.Model.SnapshotTextNode{ boundingBox: GoogleApi.ContentWarehouse.V1.Model.SnapshotBox.t() | nil, fontSize: number() | nil, inLink: integer() | nil, maxSplit: integer() | nil, text: String.t() | nil }
@spec decode(struct(), keyword()) :: struct()