Version 11.3.0 Changelog (February 19th, 2025)

New In This Release

Version 11.3 is primarily a bugfix release.


New Features

  • Added a new Generic Key-Value engine to the Data Extraction Module. (DataExtractionModule.e_GenericKeyValue)
  • Added a function to update http request custom headers for viewing linearized PDF file from URL. (PDFViewCtrl.UpdateCustomHeaders() and PDFViewWPF.UpdateCustomHeaders())

New Options

  • Added Inclusion and Exclusion zones to the Data Extraction Module. These zones allow users to specify regions of interest in a document when using specific Data Extraction Module engines. (DataExtractionOptions.AddInclusionZonesForPage() and DataExtractionOptions.AddExclusionZonesForPage())

Improvements:

  • [pdf] Improved file size when inserting a CCITT image into a page by using CCITT compression in the PDF image stream if the original image is a TIFF image that is CCITT compressed and if the user has not specified any particular compression to use.
  • [pdf] Improved how overlay text is aligned inside a redaction rectangle for cases where the PDF pages have rotation.
  • [pdfua] Improved placement of link annotations within the structure tree when auto-tagging.
  • [pdf] Improved performance of rendering of corrupt PDFs that contain vector graphics paths with enormously long segments. Previously, rendering such PDFs could take an extremely long period of time.
  • [pdf] Added getter and setter functions for the radio button groups of optional content groups (OCGs) to the OCG Config class.
  • [pdf] Improved the Apryse SDK's repair capabilities when opening corrupted PDF files. Previously our repair could be incomplete, causing inaccurate rendering or content extraction.
  • [c++] Added an overload of PDFDoc.SaveViewerOptimized(), which accepts a ViewerOptimizedOptions reference.

Bugfixes:

  • [all] Fixed issue that could cause the error "locale::facet::_S_create_c_locale name not valid" on certain AWS Lambda runtimes (e.g., Node.js 20.x)
  • [pdf] Fixed an issue where the 32-bit SDK created corrupt output PDF files when the 4GB size limit was exceeded. Now an exception is thrown instead, and the 64-bit SDK has no such size limit.
  • [pdfa] Fixed a PDF/A validation case where an error "Embedded Type1 font descriptor's CharSet string does not list all glyphs." was reported when it was not the case.
  • [pdfa] Fixed conversion to PDF/A-1a when a font is missing a ToUnicode table.
  • [pdfa] Fixed the regeneration of the CID font's CIDSet during PDF/A-1a conversion when the same descendant font is shared by more than one CID parent font.
  • [pdf] Fixed an issue where digital signature verification could fail when the OCSP response contains multiple certificates.
  • [xfdf] Fixed a crash that could occur when converting FDF to XFDF if a page's viewport has no BBox entry.
  • [pdf] Fixed a potential timeout issue in the Data Extraction Module. Previously the following exception may have occurred with very large PDFs: "Error extracting Form Fields from PDF using the AIPageObjectExtractor module Add-On. This module was not designed to run as a standalone application".

Office Fidelity:

  • [docx] Improved handling of interaction between floating elements and inline tables.
  • [docx] Fixed paragraph spacing when the space before is set to auto.
  • [docx] Fixed incorrect default hooter and header sizes in Word documents.
  • [docx] Fixed incorrect layout of vertical text in some shapes.
  • [docx] Improved handling of malformed tables.
  • [ppt] Fixed incorrect spacing between bullet points and text in some PPT documents.
  • [docx] Fixed issues with incorrect content placement in linked text boxes.
  • [docx] Improved Layout around rounded rectangles.
  • [xlsx] Fixed a rare null position reference exception occurring during Excel conversion.
  • [office] Fixed incorrect text color in some diagrams.
  • [office] Fixed an issue with missing image outlines in some documents.
  • [office] Fixed incorrect size of shape text boxes in rotated shapes with the "Do not rotate text" flag set.
  • [docx] Fixed an unexpected default stroke added to some shapes.
  • [docx] Fixed displaying of deleted section breaks when change tracking is disabled.
  • [xlsx] Fixed template filling for XLS/XLSX workbooks.
  • [office] Fixed incorrect 3D rotation applied to shapes when the "Isometric Top Up" preset is selected.
  • [xlsx] Fixed the "caller.m_operators.back() == "f"" exception thrown for some Excel documents.
  • [xlsx] Fixed an issue with incorrect evaluation of Excel formulas returning strings as boolean values.
  • [docx] Added support for text outlines in DOCX documents.
  • [xlsx] Fixed incorrect resolution of local hyperlinks in Excel.
  • [xlsx] Avoided adding empty trailing pages to Excel sheets.
  • [ppt] Fixed an issue missing tab indentations in PPT documents.
  • [doc] Fixed an issue with missing bottom borders of the first row in tables in some DOC documents.
  • [xlsx] Fixed an issue with missing text after images in Excel headers/footers.
  • [docx] Fixed incorrect lengths of lines in some drawings.
  • [pptx] Multi-column text boxes are now balanced when they overflow.
  • [docx] Fixed an issue with extra borders between paragraphs with additional spacing.
  • [pptx] Fixed incorrect text color in some PowerPoint documents.
  • [docx] Fixed incorrect positions of floating shapes anchored inside framed paragraphs.
  • [xlsx] Fixed an issue with horizontally oversized tables inside floating shapes.
  • [doc] Fixed an issue where unexpected list numbers were appearing at the beginning of paragraphs in DOC document.
  • [doc] Fixed an issue where some list numbers were being duplicated in DOC documents.
  • [docx] Fixed the incorrect size of watermarks in some documents.
  • [docx] Fixed an issue with "Keep lines together" paragraphs were moved to the next page for no reason.
  • [docx] Content in "New column" sections is now placed on the correct page.
  • [ppt] Fixed an issue where text did not appear bold when the "Times New Roman Bold" font was selected in PPT document.
  • [docx] Accounted for border thickness when applying relative size and position to floating shapes.
  • [office] Fixed incorrect rounding of percentage values in some chart data labels.
  • [doc] Optimized output PDF size by eliminating duplicate images during DOC document conversion.
  • [docx] Fixed a rare case of a floating table being in the wrong column.
  • [docx] Fixed an issue with incorrect table width inside text boxes.
  • [docx] Fixed incorrect line spacing after continuous sections breaks.
  • [office] Fixed an issue where the EMF image lost transparency in office documents.
  • [docx] Fixed incorrect positioning of some shapes in drawings.
  • [xlsx] Added support for page number offsets in Excel headers/footers.
  • [docx] Fixed an issue with incorrect float avoidance for some paragraphs with custom spacing.
  • [office] Fixed an issue where chart labels were misaligned in EMF images.
  • [office] Fixed an issue with unexpected reordering of digits in RTL text.
  • [docx] Fixed an issue with extra paragraph lines placed inside table cells at the end of the page.


Fixes and improvements for the Structured Output Module

  • [docx] Improved column and row detection of hybrid split tables.
  • [office] Implemented the recognition of non standard encoded vertical Japanese characters.
  • [office] Improved the precision of non standard encoded Arabic character coordinates.
  • [docx] Improved detection of single column non-table content.
  • [docx] Improved table detection.
  • [office] Improved the rendering of Type 3 font glyphs.
  • [docx] Improved the optical character recognition of large images on 32 bit platforms.
  • [docx] Implemented list recognition in pdfs using image bullets.
  • [json] Implemented the option to ignore the detection of tiled pages in json export.
  • [json] Improved json export of pages that exceed Microsoft size limitations.
  • [json] Support nested tables being placed inside corresponding cell contents.
  • [office] Applied custom language string options for Chinese text recovery
  • [office] Improved initialization mode for use of Thai trained data language file
  • [docx] Improved the detection of list items to prevent inclusion of undesirable footnote content.
  • [office] Implemented automatic rotation detection of Japanese and Korean documents using optical character recognition.
  • [docx] Fixed a bug causing the misdetection of multiple glyph shapes representing a single letter "e" in a document.
  • [docx] Fixed a bug preventing detection of a borderless table when the table contained extended spaces between rows.
  • [docx] Fixed a bug causing the detection of unnecessary column breaks in a document with right to left aligned Arabic text.
  • [docx] Fixed a relative height calculation issue preventing a very large document from opening in Microsoft Word.
  • [docx] Fixed a bug preventing the detection of columns when ignoring the tagged table structure of a pdf.

Did you find this helpful?

Trial setup questions?

Ask experts on Discord

Need other help?

Contact Support

Pricing or product questions?

Contact Sales