UiPath Certified Professional Specialized AI Professional v1.0 UiPath-SAIv1 Question # 40 Topic 5 Discussion
UiPath-SAIv1 Exam Topic 5 Question 40 Discussion:
Question #: 40
Topic #: 5
What is the Document Object Model (DOM) in the context of Document Understanding?
A.
The DOM is a JSON object containing information such as name, content type, text length, number of pages, page rotation, detected language, content, and coordinates for the words identified in the file.
B.
The DOM is a built-in artificial intelligence system that automatically understands and interprets the content and the type of documents, eliminating the need for manual data extraction.
C.
The DOM is a feature that allows you to convert physical documents into virtual objects that can be manipulated using programming code.
D.
The DOM is a graphical user interface (GUI) tool in UiPath Document Understanding that provides visual representations of documents, making it easier for users to navigate and interact with the content.
The Document Object Model (DOM) is a data representation of the objects that comprise the structure and content of a document on the web1. In the context of Document Understanding, the DOM is a JSON object that is generated by the Digitize Document activity, which uses the UiPath Document OCR engine to extract the text and layout information from the input document2. The DOM contains the following properties for each document3:
name: The name of the document file.
contentType: The MIME type of the document file, such as application/pdf or image/jpeg.
textLength: The number of characters in the document text.
pages: An array of objects, each representing a page in the document. Each page object has the following properties:
pageNumber: The number of the page, starting from 1.
rotation: The angle of rotation of the page, in degrees. A positive value indicates clockwise rotation, and a negative value indicates counterclockwise rotation.
language: The language code of the page, such as en or fr.
content: An array of objects, each representing a word or a line in the page. Each content object has the following properties:
type: The type of the content, either word or line.
text: The text of the content.
boundingBox: An array of four numbers, representing the coordinates of the top-left and bottom-right corners of the content, in the format [x1, y1, x2, y2]. The coordinates are relative to the page, with the origin at the top-left corner, and the unit is pixel.
confidence: A number between 0 and 1, indicating the confidence level of the OCR engine in recognizing the content.
The DOM can be used as an input for other activities in the Document Understanding framework, such as Classify Document Scope, Data Extraction Scope, or Export Extraction Results. The DOM can also be manipulated using programming code, such as JavaScript or Python, to perform custom operations on the document data.
References:
1: Introduction to the DOM - Web APIs | MDN 2: Digitize Document 3: Document Object Model
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit