XML schema
The schema described in this article determines the structure of XML files containing document processing results for an OCR skill and Process skill, if an XML format enabled in Output activity.
All object properties, their data types, and their allowed values are listed in the XML schema, hence we recommend familiarizing yourself with it in order to make it easier to set up tools to analyze XML files. You can download the XML schema here.
The root object describes the whole recognized document. General information about the document is stored in the page and documentData elements.
The table below lists all root document element properties:
Property | Data type | Description |
---|---|---|
version * | string |
The version of the XML schema document. Set to the following by default: Vantage OCR.Skill XML output v1.0 |
producer * | string |
The source of the XML file. Set to the following by default: ABBYY Vantage OCR.Skill |
languages | string | A list of all languages detected in the document. |
12/22/2023 12:36:42 PM