Output JSON format

The result of text recognition is saved by default in JSON format. You can easily parse the output JSON to extract the data you need.

The output represents one recognized document. The main keys are:

  • definitions key provides the description of keys that are reused in the keys with document content
  • properties key has the full contents of the document:
    • languages key with the list of languages used to recognize the text
    • layout key with the information about all document pages and blocks found on each page, including full data on block location, type, content, and recognition confidence
    • content key with the logical structure of the document: the arrays of all paragraphs and all lists found in the document, each linked to the ID of the layout block or blocks where the paragraph or list is located

For all the details, see the JSON schema file that is provided with your distribution.

19.02.2024 10:23:36

Usage of Cookies. In order to optimize the website functionality and improve your online experience ABBYY uses cookies. You agree to the usage of cookies when you continue using this site. Further details can be found in our Privacy Notice.