English (English) - Change language

Paragraph

Paragraph is an element that describes a paragraph. When searching for the paragraph, the program will consider all text objects wholly or partially contained within the search area.

A Paragraph element is marked with in the tree of elements.

This element is used to search for text paragraphs whose text is unknown. The program will consider the Recognized Words and Recognized Lines objects detected by pre-recognition and falling within the element's search area.

The properties of a Paragraph element

The properties of a Paragraph element are specified on the Paragraph tab of the element's Properties dialog box. To open the element's Properties dialog box, in the FlexiLayout window, right-click the element and select Properties... from the shortcut menu.

Show Properties dialog box, Character String tab

  • Min line count
    specifies the minimum number of lines in the paragraph (a fuzzy interval is used)
  • Max line count
    specifies the maximum number of lines in the paragraph (a fuzzy interval is used).
  • Alignment
    specifies the type of alignment.
  • Max disalignment
    specifies the maximum disalignment value in user-defined units of measurement.
  • Allow first line indent
    specifies whether to allow first line indent. The indent value is measured in user-defined units of measurement.
  • Allow "hanging" line
    specifies whether to include hanging lines in paragraphs.
  • Max space in line
    specifies, in user-defined units of measurement, the maximum allowed absolute length of the space inside the detected object; when searching for the text, characters are added to the string until the distance between the neighboring characters becomes greater than the value in Max space in line.
  • Max interline space
    specifies the maximum distance between the lines in user-defined units of measurement.
  • Max relative interline space
    specifies the maximum distance between the two lines relative to the height of the top line; if the distance between the lines is greater than this value, the second line does not make it into the hypothesis.
  • Min line height
    specifies the minimum line height (a fuzzy interval is used).
  • Max line height
    specifies the maximum line height (a fuzzy interval is used).
  • Min line width
    specifies the minimum line width (a fuzzy interval is used).
  • Max line width
    specifies the maximum line width (a fuzzy interval is used).
  • Whole words only
    specifies whether the hypothesis should include the parts of the words which are partially contained within the search area; enable this option if you need to exclude word fragments from the hypothesis
  • Remove stamp
    When recognizing payment documents, stamps placed over text may pose a problem. Typically, a stamp is placed in the bottom right corner of the text. The text fragment with a superimposed stamp is practically illegible and is recognized by the program as bits of phrases which make no sense. To avoid this, do not include the stamped text fragment into the block and key in the text manually from the image (and you won't have to remove the incorrectly recognized text).
    If Remove stamp is enabled, the program will analyze the bottom right corner of the search area and remove any bits of text that may have been caused by a superimposed stamp.
  • Ignore separators
    specifies whether separators should be ignored when searching for the paragraph; disable this option if the paragraphs are marked off by separators

See also:

Creating and deleting elements

An overview of element properties

Search area

Additional search constraints for Paragraph element

12/11/2019 6:06:02 AM


Please leave your feedback about this article