Glossary

A B C D F I L M O P R S T U

A

ABBYY Business Card Reader is a handy application that enables users to scan business cards, capture the contact data, and export the captured contacts into various electronic formats. The application can also handle scans and photos of business cards stored on users' computers.

ABBYY Hot Folder is a scheduling agent which allows users to select a folder with images and specify the time for processing the images in this folder. ABBYY FineReader will automatically process the images from the selected folder.

ABBYY Screenshot Reader is an application that enables users to create screenshots and recognize texts on them.

Abbreviation is a shortened form of a word or phrase (e.g. MS-DOS for Microsoft Disk Operating System, UN for United Nations, etc.).

Activation is the process of obtaining a special code from ABBYY which allows the user to use his/her copy of the product in full mode on a given computer.

Activation code is a code that is issued by ABBYY to each user of ABBYY FineReader 12 during activation. An activation code is used to activate ABBYY FineReader on the computer that generated the Product ID.

Active area is a currently selected area on the image that can be deleted, moved or modified. To make an area active, click it. The frame enclosing an active area is bold and has sizing handles that can be dragged to change its size.

ADRT (Adaptive Document Recognition Technology) is a technology that increases the quality of conversion of multi-page documents. For example, it can recognize such elements as headings, headers and footers, footnotes, page numbering, and signatures.

Area is a section of an image enclosed by a frame and containing a certain type of data. Before performing OCR, ABBYY FineReader detects text, picture, table, and barcode areas in order to determine which sections of the image should be recognized and in what order.

Area template is a template that contains information about the size and locations of the areas in similar-looking documents.

Automated task is a set of document processing steps. ABBYY FineReader 12 includes a number of preconfigured automated tasks, which are accessible from the Tasks window. Users can also create their own automated tasks.

Automatic Document Feeder (ADF) is a device that automatically feeds documents to a scanner. A scanner with an ADF can scan multiple pages without manual intervention. ABBYY FineReader can work with multi-page documents.

Back to Top

B

Background picture area is an image area that contains a picture with text printed over it.

Barcode area is an image area that contains a barcode.

Brightness is an image parameter that reflects the amount of white in an image. Increasing brightness makes the image "whiter". Specifying a suitable brightness value increases recognition quality. For details, see "Scanning Paper Documents."

C

Code page is a table that establishes correspondences between characters and their codes. Users can select the characters they need from those available in a code page.

Color mode determines whether document colors are to be retained. Black-and-white images produce smaller FineReader documents and are faster to process.

Compound word is a word made up of two or more existing words. In ABBYY FineReader, a compound word is a word which the program cannot find in its dictionary but which it can create from two or more dictionary words.

D

Document analysis is the process of identifying the logical structure of a document and areas that contain various types of data. Document analysis can be carried out automatically or manually.

Document Open password is a password which prevents users from opening a PDF document unless they type the password specified by the author.

Document options is the set of options that can be selected in the Options dialog box (Tools > Options). Document options also include user languages and patterns. Document options can be saved and then used in other FineReader documents.

Document type is a parameter that tells the program how the original text was printed (e.g. on a laser printer, on a typewriter, etc.). For laser-printed texts, select Auto; for typewritten texts, select Typewriter; for faxes, select Fax.

Dots per inch (dpi) is a measure of image resolution.

Driver is a software program that controls a computer peripheral (e.g., a scanner, a monitor, etc.).

Back to Top

F

Font effects is the appearance of a font (e.g. bold, italic, underlined, strikethrough, subscript, superscript, small caps).

FineReader document is an object created by ABBYY FineReader to process a paper document. It contains the images of the document pages, their recognized text (if any), and the program settings.

I

Ignored characters are any non-letter characters found in words (e.g. syllable characters or stress marks). These characters are ignored during the spell check.

Inverted image is an image with white characters printed against a dark background.

L

License Manager is a utility used for managing ABBYY FineReader licenses and activating ABBYY FineReader 12 Corporate.

Ligature is a combination of two or more characters which are "stuck" together (e.g. fi, fl, ffi). Such characters are difficult for ABBYY FineReader to separate. Treating them as one character improves OCR accuracy.

Low-confidence characters are characters that may have been recognized by the program incorrectly.

Low-confidence words are words that contain one or more low-confidence characters.

M

Monospaced font is a font (such as Courier New) in which all characters are equally spaced. For better OCR results on monospaced fonts, select Tools > Options..., click the Document tab, and select Typewriter under Document type.

O

Omnifont system is a recognition system that recognizes characters set in any font without prior training.

Optional hyphen is a hyphen (?) that indicates exactly where a word or word combination should be split if it occurs at the end of a line (e.g. "autoformat" should be split into "auto" and "format"). ABBYY FineReader replaces all hyphens found in dictionary words with optional hyphens.

Back to Top

P

Page layout is the arrangement of text, tables, pictures, paragraphs, and columns on a page. The fonts, font sizes, font colors, text background, and text orientation are also part of the page layout.

Paradigm is the set of all grammatical forms of a word.

Pattern is a set of associations between averaged character images and their respective names. Patterns are created when you train ABBYY FineReader on a specific text.

PDF security settings are restrictions that prevent a PDF document from being opened, edited, copied or printed. These settings include Document Open passwords, Permissions passwords, and encryption levels.

Permissions password is a password which prevents other users from printing and editing a PDF document unless they type the password specified by the author. If certain security settings are specified for a PDF document, other users will not be able to change these settings until they type the Permissions password.

Picture area is an image area that contains a picture. This type of area may enclose an actual picture or any other object (e.g. a text fragment) that should be displayed as a picture.

Primary form is the "dictionary" form of a word (headwords of dictionary entries are usually given in their primary forms).

Product ID is automatically generated on the basis of the hardware configuration when activating ABBYY FineReader on a given computer.

Prohibited characters are characters that you think will never occur in a text to be recognized. Specifying prohibited characters increases the speed and quality of OCR.

R

Recognition area is an image area that ABBYY FineReader should analyze automatically.

Resolution is a scanning parameter measured in dots per inch (dpi). Resolution of 300 dpi should be used for texts set in 10 pt fonts and larger, 400 to 600 dpi is preferable for texts of smaller font sizes (9 pt and less).

Back to Top

S

Scanner is a device for inputting images into a computer.

Scanning mode is a scanning parameter that determines whether an image must be scanned in black and white, grayscale, or color.

Separators are symbols that can separate words (e.g. /, \, dash) and that are separated by spaces from the words themselves.

Support ID is a unique identifier of a serial number that contains information about the license and the computer on which it is used. A Support ID provides additional protection and is checked by the technical support staff before providing technical support.

T

Table area is an image area that contains data in tabular form. When the program reads this type of area, it draws vertical and horizontal separators inside the area to form a table. This area is then rendered as a table in the output text.

Tagged PDF is a PDF document which contains information about the document structure, such as its logical parts, pictures, and tables. The structure of a document is encoded in PDF tags. A PDF file with such tags may be reflowed to fit different screen sizes, and will display well on handheld devices.

Task Manager is an ABBYY FineReader feature that allows users to run an automated task, create and modify automated tasks, and delete custom automated tasks.

Text area is an image area that contains text. Text areas should only contain single-column text.

Training is the process of establishing a correspondence between a character image and the character itself. (See "Recognition with Training" for details.)

U

Unicode is a standard developed by the Unicode Consortium (Unicode, Inc.) and is a 16-bit international encoding system for processing texts.

Back to Top

11/2/2018 5:31:18 PM


Please leave your feedback about this article