Training while processing documents

ABBYY FlexiCapture for Invoices lets you improve recognition quality while processing documents. If the program fails to detect the correct location of a field on a document image, an Operator can specify the correct location and the program will use it when recognizing other documents.

Training is only available if ABBYY FlexiCapture can reliably identify the vendor by finding the corresponding record in a vendor database. If you have no vendor databases but still want to use field training, you can accumulate company information by adding records to your data sets while capturing invoices. For more information, see Looking up vendors and business units in the database.

This article explains how to train ABBYY FlexiCapture for Invoices using the locally-installed version of the Verification Station, and covers some training-related issues that Operators need to know about.

  1. Collect a batch of invoices (e.g. the invoices processed within the past month) and start feeding them to the program. See How to capture invoices.
  2. Once the documents are fed to the program, they are automatically recognized (this happens only if the Recognize added images automatically option is enabled on the Document Processing tab of the Options dialog; to open this dialog, click Tools → Options...) and the data are checked by means of validation rules.
  3. If the status of a recognized invoice is other than Valid or if you have other reasons to believe that the program failed to detect some of the fields, open the document in the document editor.
  4. Review the document form. The Vendor group of fields must be filled out correctly.

    More...

  1. Training will only be successful if the regions of all the fields are marked up correctly, so make sure that the regions match the actual locations of their respective fields on the image. See Training line items for more information on how to mark up line items on an invoice.
    To achieve this, in the image window of the document editor, adjust the regions or draw regions for those fields which the program failed to detect.
    After that, the program will analyze the document. If the mark up of the field regions was modified and the training for this vendor is not prohibited, the document will be added to the batch.

How to change the region of a field

  1. All the fields of the invoice will be used for training purposes, not just those whose markup you have added or modified.
  2. Repeat steps 4-6 for the next document.
  3. When the third and subsequent invoices of the same vendor are added to the batch, the program starts the training process. The program will either train a special FlexiLayout (a FlexiLayout Variant) or suggest that a user gathers more examples (in this case move to the next document and go back to step 4).
    If the FlexiLayout for the alternative has been successfully trained, it will be used with the next vendor invoice that determines this invoice variant. After the recognition, field regions will be imposed on an invoice image based on the training results.
    If a new image is added to the batch, the program determines the quality of FlexiLayout application for the variant. If the added image deteriorates the quality of the application of field regions, it will not be used. Otherwise, it will be used for testing.
  4. Add a few more invoices from the vendor whose Document Variant has been trained and recognize them. Then open the newly added invoices one by one in the document editor to check if the regions are marked up correctly. If all the regions are located correctly, no additional training is required.
    If you are not satisfied with the results, continue training the program on invoices from the given vendor (repeat steps 4-6). Now each time, the training process will be started. If the training is successful, a new FlexiLayout Variant will be created.

Training While Setting Up ABBYY FlexiCapture for Invoices

01.12.2020 7:03:59


Please leave your feedback about this article