WordModelTypeEnum
WordModelTypeEnum enumeration constants are used to describe the type of the word model.
typedef enum {
WMT_MonolingualWord,
WMT_RegExpWord,
WMT_BilingualComposite,
WMT_Acronym,
WMT_Number,
WMT_NumberWithQualifier,
WMT_WordNumberComposite,
WMT_BilingualWordNumberComposite,
WMT_RomanNumber,
WMT_MonolingualWordWithExtras,
WMT_MixedFormDictionaryWord,
WMT_PhoneNumber,
WMT_Punctuation,
WMT_FileName,
WMT_UrlOrEmail,
WMT_NoSuitableModel,
WMT_MRZString
} WordModelTypeEnum;
Elements
Name | Description |
---|---|
WMT_Acronym | An acronym consisting of capital letters. The word can contain digits and separators, e.g., "B2B", "C.E.R.N.". |
WMT_BilingualComposite | A bilingual compound word with an explicit dividing point. |
WMT_BilingualWordNumberComposite | A compound word with explicit dividing points consisting of two words belonging to different languages and a number, e.g., "Windows-2000-kompatibel". |
WMT_FileName | A DOS/Windows or UNIX file name, e.g., "README.TXT", "C:\WINNT\system32", "/etc/motd.rc". |
WMT_MixedFormDictionaryWord | A word from the dictionary of the language, and this word contains some characters (for example, digits) that are not from the alphabet of the language, e.g., "100BaseTX". |
WMT_MonolingualWord | A common word. Its grammar is determined by the language alphabet. Besides that, the word can contain characters-separators, e.g., "/" or "-". |
WMT_MonolingualWordWithExtras | A word consisting of the language alphabet characters and special characters, digits, etc., e.g., "Alias|Wavefront". |
WMT_MRZString | A machine-readable string of a passport, e.g., "P<USAERIKSSON<<ANNA<MARIA<<<<<<<<<<<<<<<<<<<". |
WMT_NoSuitableModel | A word that does not meet any word model. Every word character is recognized separately, without context. The recognition result may be a meaningless character sequence. |
WMT_Number | A word consisting of digits and punctuators, e.g., "123", "4.56", "#789". |
WMT_NumberWithQualifier | A word with a prefix or suffix that serves as a qualifier or inflexion, e.g., "USD250", "1.2GHz", "2nd". |
WMT_PhoneNumber | A phone number. A prefix is allowed, e.g., "Ph.(495)123-45678". |
WMT_Punctuation | A set of punctuation marks separated from a word by a space(s). |
WMT_RegExpWord | A word from the language which grammar is described by a regular expression. |
WMT_RomanNumber | A Roman number. |
WMT_UrlOrEmail | An URL or e-mail address, e.g., "http://www.abbyy.com", "engine_support@abbyy.com". |
WMT_WordNumberComposite | A compound word with an explicit dividing point consisting of a word and a digit, e.g., "2-meter". |
Used in
17.09.2024 15:14:41