Dialog Box: Alto XML

This dialog box allows you to specify Alto XML format settings. This format is mainly used by electronic libraries.

Set the parameters for saving the recognized text into an Alto XML file:

Option Option description
ALTO version

Specifies the version of Alto XML to be used for the output file:

  • 2.0;
  • 3.0;
  • 3.1 (default);
  • 4.0;
  • 4.1;
  • 4.2.

Detection of text coordinates

Specifies how text should be divided: by Words or by Lines.
Character formatting

Select the desired font formatting mode:

  • Plain. Text formatting is not preserved (except subscript and superscript).
  • Restricted. Retains fonts, font sizes, and paragraphs, but does not retain the exact locations of the objects on the page or the spacing. The resulting text will be left-aligned.
  • Full. Produced document maintains the formatting of the original.
Measurement unit

Specifies the measurement unit used to describe size and coordinates of objects in the output XML file:

  • inch/1200,
  • mm/10,
  • pixel.

Write coordinates based on original image

(checkbox)

If this option is enabled, the coordinates of all objects in the output ALTO XML file will be relative to the source image. If it is disabled, the coordinates in the output file will be relative to the image that would be produced when exporting to an image format.

See also

Output Format Settings Dialog Box

3/26/2024 1:49:49 PM

Please leave your feedback about this article

Usage of Cookies. In order to optimize the website functionality and improve your online experience ABBYY uses cookies. You agree to the usage of cookies when you continue using this site. Further details can be found in our Privacy Notice.