Document Scanning and Image Import
ABBYY FineReader Engine can receive images from three types of sources: document scanning, opening from files, or directly from memory.
Document Scanning APIs
- TWAIN interface (including ADF support and manual input feeding)
- FineReader document scanning UI
With its powerful document scanning software tools, ABBYY FineReader Engine 12 enables flexible management of scanning parameters, such as: brightness, colority, resolution, image size, duplex scanning, pause between pages setup and more. For OCR purposes, the best resolutions lie in the range of 200-400 dpi. The choice of resolution depends on the quality of the paper original, the size of the font and other factors. For more details, please see the description of the Scanning scenario.
Image file formats
ABBYY FineReader Engine supports the majority of image formats, including multi-page TIFF and JPEG 2000 (part1), and works with black-and-white, grayscale and color images. It also opens PDF files by converting them into images with PDFium Technology.
See more in Supported Image Formats.
Memory image formats
Opening digital documents
Digitally-born documents may also be loaded for processing using the same methods that open image files. The following formats are supported:
- text documents: DOC, DOCX, RTF, HTML, TXT, ODT
- tables: XLS, XLSX, ODS
- presentations: PPT, PPTX, ODP
Additional features for PDF files
- Extracting text layer from PDF
- Image only PDF input
- Vectorized PDF
- Password protected PDF