TextEncodingTypeEnum

TextEncodingTypeEnum enumeration constants are used to specify encoding type of a Unicode file.

typedef enum {
 TET_UTF16LE,
 TET_UTF16BE,
 TET_UTF8,
 TET_UTF8_WoBOM
} TextEncodingTypeEnum;
    

Elements

Name Description
TET_UTF16LE Unicode UTF-16LE format. UTF-16BE is a 16-bit variable-width encoding, each 16-bit unit is represented in little-endian byte order.
TET_UTF16BE Unicode UTF-16BE format. UTF-16BE is a 16-bit variable-width encoding, each 16-bit unit is represented in big-endian byte order.
TET_UTF8 Unicode UTF-8 format. UTF-8 is a code page that uses a string of bytes to represent a 16-bit Unicode string where ASCII text (<=U+007F) remains unchanged as a single byte, U+0080-07FF (including Latin, Greek, Cyrillic, Hebrew, and Arabic) is converted to a 2-byte sequence, and U+0800-FFFF (Chinese, Japanese, Korean, and others) becomes a 3-byte sequence.
TET_UTF8_WoBOM Unicode UTF-8 format without the byte order mark (BOM) at the start of the text stream.

Used in

ITextExportParams::EncodingType

IXMLExportParams::EncodingType

15.08.2023 13:19:30

Usage of Cookies. In order to optimize the website functionality and improve your online experience ABBYY uses cookies. You agree to the usage of cookies when you continue using this site. Further details can be found in our Privacy Notice.