FlexiLayouts and classifiers have a variety of user-defined settings, including pre-recognition settings such as recognition languages, text type, pre-recognition modes and areas. Selecting the right pre-recognition settings will help you create FlexiLayouts and Classifiers that are well-suited for processing your documents.
You can change pre-recognition settings in the Pre-recognition Properties dialog box. To open this dialog box:
- Click Properties... on the FlexiLayout or Classifier menu or on the shortcut menu of the FlexiLayout or Classifier.
- Click the Advanced Pre-recognition Properties... button on the General tab of the Properties of %Name% dialog box.
The Pre-recognition Properties dialog box will open. The options available in this dialog box are listed below.
The method that was used for printing the text on the documents:
Determine the type of text and evaluate its quality before selecting these options.
|Text languages||The languages used in the documents. You can select one or several languages from the drop-down list. For the full list of available languages, see OCR languages supported in ABBYY FlexiLayout™ Studio.|
|User dictionaries||This group of options lets you add user dictionaries. User dictionaries are used to improve recognition quality by supplementing built-in dictionaries with specialized vocabulary, abbreviations, company names, etc.|
This group contains two barcode processing options:
Contains options for processing CJK (Chinese, Japanese, and Korean) languages.
Extract named entities – Select this option to extract meaningful information from a field or field group using NLP methods.
Note. This option is only available for licenses that include an NLP module.
|Vertical text extraction||
Vertical text extraction parameters:
|Pre-recognition area||The area to be pre-recognized. You can specify position of pre-recognition area relative to page edges.|
|User pattern||This option allows you to add a user pattern in PTN or FBT format. We recommend using these user patterns if your documents contain non-standard fonts and characters.|