Output JSON format
The result of text recognition is saved by default in JSON format. You can easily parse the output JSON to extract the data you need.
The output represents one recognized document. The main keys are:
- definitions key provides the description of keys that are reused in the keys with document content
- properties key has the full contents of the document:
- languages key with the list of languages used to recognize the text
- layout key with the information about all document pages and blocks found on each page, including full data on block location, type, content, and recognition confidence
- content key with the logical structure of the document: the arrays of all paragraphs and all lists found in the document, each linked to the ID of the layout block or blocks where the paragraph or list is located
For all the details, see the JSON schema file that is provided with your distribution.
2/19/2024 10:23:36 AM