XML output
The following table describes various available output XML file types. The Skill type column lists the skill types that can generate a particular type of XML file.
File type | Export option | Description | Skill type | File name | XML schema |
---|---|---|---|---|---|
XML | Text only |
This mode is suitable for extracting all text from the input image, including small text areas of low quality. The document appearance and structure are ignored, pictures and tables are not detected. It is designed for the situations when you need to retrieve the data from the image for some further processing on your side, such as extracting data from bills, receipts or invoices. Selecting this mode makes export to DOCX and XLSX impossible. |
OCR | <Source_file_name>.xml ** | XML schema |
Process | <Applied_skill_name>.xml * | ||||
Preserve document structure | This mode is focused on retaining the original document structure and appearance, including font styles, pictures, background color, etc., and is more focused on documents like agreements, contracts, specifications. | OCR | <Source_file_name>.xml ** | ||
Process | <Applied_skill_name>.xml * |
*<Applied_skill_name> will be one of the following:
- The name of the skill if a transaction was created for a Document or Classification skill.
- If a transaction was created for a Process skill:
- the name of the last Document skill applied to the document
- the name of the last Classification skill applied to the document if no Document skills were applied
- "Unknown" if no Document and Classification skills were applied, while at least one of them exists in the Process skill flow.
**<Source_file_name> will be the name of the first file used to assemble the document, e.g. "IMG_12234".
22.12.2023 12:36:42