TextEncodingTypeEnum

TextEncodingTypeEnum enumeration constants are used to denote possible types of the output file encoding for export in HTML, TXT and CSV formats.

typedef enum {
 TET_Simple,
 TET_UTF8,
 TET_UTF16,
 TET_Auto
} TextEncodingTypeEnum;
    

Elements

Name Description
TET_Auto

Encoding is selected automatically. The selection depends on the values of other properties of the parameters object:

TET_Simple Simple encoding, one byte per symbol.
TET_UTF16 Native Unicode format where every symbol is represented by two-byte sequence.
TET_UTF8 Unicode UTF-8 format. UTF-8 is a code page that uses a string of bytes to represent a 16-bit Unicode string where ASCII text (<=U+007F) remains unchanged as a single byte, U+0080-07FF (including Latin, Greek, Cyrillic, Hebrew, and Arabic) is converted to a 2-byte sequence, and U+0800-FFFF (Chinese, Japanese, Korean, and others) becomes a 3-byte sequence.

Used in

ITextExportParams::EncodingType

IHTMLExportParams::EncodingType

IPlainText::SaveToTextFile

24.03.2023 8:51:52

Usage of Cookies. In order to optimize the website functionality and improve your online experience ABBYY uses cookies. You agree to the usage of cookies when you continue using this site. Further details can be found in our Privacy Notice.