Dialog Box: CSV

This dialog box allows you to specify CSV format settings.

Set the parameters for saving the recognized text into a CSV file:

Option Description
Text settings group
Ignore text outside tables Select this option if you only want to save tables in the CSV file.
Insert page break character (#12) to separate pages Select this option if you want the original break-down into pages to be retained in CSV format.

Field separator

(drop-down list)

Specifies the character that will separate the fields in the CSV file.
Character encoding group

Encoding type

(drop-down list)

Specifies the encoding type of the output file in CSV format:

  • Simple
    Simple encoding, one byte per symbol.
  • Unicode UTF-16
    Native Unicode format where every symbol is represented by two-byte sequence.
  • Unicode UTF-8
    Unicode UTF-8 format. UTF-8 is a code page that uses a string of bytes to represent a 16-bit Unicode string where ASCII text (<=U+007F) remains unchanged as a single byte, U+0080-07FF (including Latin, Greek, Cyrillic, Hebrew, and Arabic) is converted to a 2-byte sequence, and U+0800-FFFF (Chinese, Japanese, Korean, and others) becomes a 3-byte sequence.

Code page

(drop-down list)

By default the code page is detected automatically. Select the (Automatic) value to use the automatic detection. Still, you may select the code page manually if necessary, just choose the value you need from the list.
See also

Output Format Settings Dialog Box

