XML schema

The schema described in this article determines the structure of XML files containing document processing results for an OCR skill and Process skill, if an XML format enabled in Output activity.

All object properties, their data types, and their allowed values are listed in the XML schema, hence we recommend familiarizing yourself with it in order to make it easier to set up tools to analyze XML files. You can download the XML schema here.

The root object describes the whole recognized document. General information about the document is stored in the page and documentData elements.

The table below lists all root document element properties:

Property Data type Description
version * string

The version of the XML schema document.

Set to the following by default: Vantage OCR.Skill XML output v1.0

producer * string

The source of the XML file.

Set to the following by default: ABBYY Vantage OCR.Skill

languages string A list of all languages detected in the document.

* Indicates a required property.

22.12.2023 12:36:42

Usage of Cookies. In order to optimize the website functionality and improve your online experience ABBYY uses cookies. You agree to the usage of cookies when you continue using this site. Further details can be found in our Privacy Notice.