PDFExportSettings Object (COM-based API)
This object provides functionality for configuring the export of recognized text to PDF format. The IPDFExportSettings interface is a child object of the IOutputFormatSettings interface and inherits all its properties. It is an element of the OutputFormatSettingsCollection collection.
When you save texts that use a non-Latin codepage (say, Cyrillic, Greek, Czech, etc.), ABBYY FineReader Server Open API uses the fonts provided by ParaType company (http://www.paratype.com/store/).
Peculiarities of exporting hieroglyphic languages to PDF
You can export hieroglyphic languages to PDF in any mode other than PDF Image Only (PDFExportMode = PEM_ImageOnly). For correct depicting of hieroglyphic characters, files for East Asian languages must be installed on all computers where the Processing Station component is installed.
When exporting to PDF, the following fonts are used:
- for the ChineseTraditional recognition language – the PMingLiU font and the MingLiU monospace font;
- for the ChineseSimplified recognition language – the SimSun font;
- for the Japanese recognition language – the MS PMincho font and the MS Mincho monospace font;
- for the Korean recognition language – the Batang font and the BatangChe monospace font.
Note. The earliest version of the PDF file which matches the specified properties of the PDFEncryptionInfo object is selected as the version of the PDF file.
- The earliest file version available is version 1.3.
- If at least one of the AllowFillingFormFields, AllowExtractingTextAndGraphicsExt, AllowDocumentAssembling or AllowPrintingExt properties is TRUE, or the EncryptionLevel property is PEL_High, the PDF file version will be 1.4.
- If the EncryptionLevel property is PEL_HighAES, the version will be 1.6.
Properties
Name | Type | Description |
Author | String | Stores the name of the author of the PDF document*. You may set this property to the name of the user. The default value is an empty string. |
ColorControlType | PictureColorControlTypeEnum |
Sets a color mode for output PDF file: Auto, ColorToBwBinary, ColorToGray or SaveColoration. Note. These settings will be ignored if UseImprovedCompression is TRUE. |
CreateOutlines | Boolean |
If this property is set to TRUE, a multi-evel table of contents will be created based on the headings in the recognized document. Note. If no headings are detected or if there is no recognized text, no table of context will be created. |
DocumentInfo | DocumentInfoItems, read-only | Stores the metadata from the PDF file*. |
EnableFastWebView | Boolean | If this property is set to TRUE, a linearized PDF will be created. |
EncryptionInfo | PDFEncryptionInfo, read-only | Specifies encryption parameters of the PDF file. The property returns a reference to the PDFEncryptionInfo object. |
EmbedFonts | Boolean | |
IsEncryptionRequested | Boolean | Specifies whether the PDF file must be encrypted. The default value is FALSE. |
KeepOriginalHeadersFooters | Boolean | If this property is set to TRUE, original headers and footers will be preserved in the output file. The default value is TRUE. |
KeepPictures | Boolean | Specifies whether to retain pictures. |
Keywords | String | Stores the keywords of the PDF document*. The default value is an empty string. |
OverwriteMetadata | Boolean |
Specifies if document metadata (the key-value pairs of the DocumentInfo property and the values of the Title, Author, Subject, Producer, Keywords properties*) must be used instead of the properties of the source document. The default value is FALSE. Note. If at least one of the Title, Author, Subject, Producer, Keywords, and DocumentInfo properties is changed, this property is automatically set to TRUE. |
PageStamps | PageStampCollectionSettings | A collection of the stamps to be applied to the output document (headers, footers, watermarks, page numbering, etc.). |
PaperHeight | Long | Stores paper height in twips (1/1440 of inch). The default value is the height of A4 format page. See the table "Paper size in different units of measurement". The value of this property is ignored, if the UseOriginalPaperSize property is set to TRUE. |
PaperWidth | Long | Stores paper width in twips (1/1440 of inch). The default value is the width of A4 format page. See the table "Paper size in different units of measurement". The value of this property is ignored, if the UseOriginalPaperSize property is set to TRUE. |
PDFExportMode | PDFExportModeEnum | Specifies the mode of export of recognized text into PDF format. This may be: text and pictures only, text over the page image, text under the page image, page image only. The default value is PEM_ImageOnText. |
PDFFontMode | BatchPDFFontModeEnum | Allows specifying fonts for exported text. |
PDFVersion | PDFVersionEnum | Specifies a PDF version. |
PictureResolution | Long | Stores the value of picture resolution in dpi, which is used for exporting pictures into PDF format. This property may be set to -1, which means that the original resolution must be preserved. The default value is 96 dpi. |
Producer | String | Stores the name of the producer of the PDF document. The default value is "ABBYY FineReader Server". |
Quality | Long | Stores the value (in percent) of the JPEG quality for color pictures saved in PDF format. This value is ignored for black-and-white pictures. The default value is 50%. |
ReplaceTextLayerOnlyInPdfs | Boolean | If this property is set to TRUE, only the text layer will be replaced in input PDF files. All other data will remain unchanged. |
ReplaceUncertainWordsWithImages | Boolean | Specifies if uncertainly recognized words will be replaced with their images when exporting into PDF format. You may use this property when the ExportMode property is set to PEM_TextOnly or PEM_TextOnImage, otherwise its value is ignored. The default value is FALSE. |
Scenario | String | Specifies the export profile. This may be: BestQuality, MinSize, MaxSpeed or Balanced. The default value is Balanced. |
SignatureSettings | IPDFSignatureSettings | Specifies the signature parameters for the PDF document. |
Subject | String | Stores the subject of the PDF document*. The default value is an empty string. |
Title | String | Stores the title of the PDF document*. The default value is an empty string. |
UseImprovedCompression | Boolean |
If this property is TRUE, the images in the PDF file will be compressed using Mixed Raster Content (MRC) technology. MRC will provide better compression for some images than the commonly used JPEG method. For example, MRC will better compress good quality images with high-contrast text against a uniform background. Use MRC to reduce the size of the resulting PDF file. The default value is FALSE. Note. When using MRC on low contrast images with a parti-coloured background artifacts are possible. |
UseOriginalPaperSize | Boolean | If this property is TRUE, original paper size is retained during export to PDF format. If the value of this property is FALSE, the paper size specified in the PaperWidth and PaperHeight properties is used. The default value is TRUE. |
WriteAnnotations (Obsolete) | Boolean | Specifies whether to write annotations when creating a PDF file. The default value is TRUE. |
WriteTaggedPdf | Boolean | Specifies if the recognized text should be exported to tagged PDF. Tagged PDF is a particular use of structured PDF that allows page content to be extracted and used for various purposes such as reflow of text and graphics, conversion to file formats such as HTML and XML, and accessibility to the visually impaired. The default value is FALSE. |
* – Note. If you change the values of the Author, Keywords, Subject or Title properties the values of the corresponding metadata keys of the DocumentInfo property will be changed or the corresponding key-value pairs will be added to the collection. Vice versa, if the values of the Author, Keywords, Subject or Title keys of the DocumentInfo property are set to some value, the values of the corresponding properties of the PDFExportSettings object will be set to the same value.
See also
26.03.2024 13:49:49