Version 10.9.0 Changelog (May 1st, 2024)

New In This Release

Version 10.9 is primarily a bugfix release.

Support for MHT and EML

  • If the HTML2PDF module is available, Convert.ToPdf() and HTML2PDF.Convert() will now automatically use this module to convert .mht/.mhtml and .eml files to PDF.

New Digital Signature Utilities

  • Added support for digitally signing using the RSASSA-PSS signature algorithm. (DigitalSignatureField.GenerateCMSSignature() that takes AlgorithmIdentifier, RSASSAPSSParams class)

New Options

  • Added an option to save incrementally to a file when converting from Office to PDF. This saving leads to a significant reduction in memory usage when processing documents with many pages (particularly large Excel documents), but may slow the conversion down if the filesystem is slow. (OfficeToPDFOptions.SetIncrementalSave()).
  • Added options to force or disable OCR processing when converting from PDF to PowerPoint or Excel using the Structured Output Module. These options bring these formats in line with the existing option for conversion to Word. (PowerPointOutputOptions.SetSearchableImageSetting() and ExcelOutputOptions.SetSearchableImageSetting()).
  • Added an option to exclude bookmarks when converting office to PDF (OfficeToPDFOptions.SetIncludeBookmarks()).

Improvements:

  • [html] Added support for HTML2PDF.WebPageSettings.SetAllowJavaScript() when using html2pdf_chromium. This method is no longer deprecated.
  • [pdf] Fixed occasional excessive highlights with semantic text comparison.
  • [pdf] Exposed PDFDoc.FDFExtract() with a PageSet argument on Java and all versions of .Net.
  • [all] Improved connection error messaging for network-enabled consumption license keys.
  • [pdf] Adjusted cloudy FreeText annotation text padding and alignment to more closely match other PDF viewers.
  • [ocr] Improved OCR quality for documents containing low resolution bitmaps that are not aligned with the crop box.
  • [pdf] Improved support for extracting text from corrupt PDF documents with mismatched q/Q operator pairs.
  • [pdf] Improved resilience of digital signature validation against corrupt files containing free references. Previously these could cause an exception to be thrown.

Bugfixes:

  • [node.js] Added missing e_ccitt enum value in PDFNet.Optimizer.MonoImageSettings.CompressionMode for Node.js.
  • [html] Fixed issue when converting HTML to PDF that could lead to unexpectedly large PDF output.
  • [html] Fixed invalid PDF structure tree generated by HTML to PDF conversion.
  • [pdf] Fixed an issue with DataExtractionModule.DetectAndAddFormFieldsToPDF() when a page range is specified.
  • [pdf] Fixed an issue with potentially incorrect text extraction in the case where a symbolic TrueType font is used for Latin text.
  • [pdf] Fixed an issue with GeometryCollection where processing of Form XObjects could fail thereby preventing snapping to objects described within.
  • [pdf] Fixed PDF rendering issues with a rare type of tiling pattern.
  • [pdfa] Fixed a PDF/A validation error after converting a PDF that uses device CMYK colorspace with no default and has overprint enabled in the graphics state.
  • [pdf] Fixed an issue with calling Annot.RefreshAppearance(RefreshOptions) in Python, Ruby, PHP and Go.
  • [xod] Fixed an issue where some content could be missing after converting certain XOD files to PDF.
  • [pdf] Fixed an issue where disabling digest verification would make TimestampingConfiguration.TestConfiguration() report a failure.
  • [pdfa] Fixed an issue with validation and conversion to PDF/A where "/Interpolate true", which is not allowed in PDF/A, could be ignored in the "Mask" entry of images and in the page thumbnail.
  • [pdfa] Fixed an issue when converting some documents to PDF/A-1, where the output could cause errors in other PDF consumers. This occurred when processing corrupt PDF files containing image masks that also have a colorspace.

Office Fidelity:

  • [office] Fixed multiple element placement issues in stacked column charts.
  • [docx] Added clickable hyperlinks to tab leaders in the table of contents.
  • [docx] Fixed an issue where the text within a textbox could appear upside down in Word 2013 documents.
  • [docx] Enabled heading structure tags for PDF output.
  • [xlsx] Reduced peak memory usage for Excel document conversions.
  • [docx] Tuned line height and text baseline placement in a number of different scenarios.
  • [xls] Fixed an issue with incorrect width of some shapes in XLS documents.
  • [docx] Fixed a rare issue with infinite pages caused by floating elements pushed outside page margins.
  • [docx] Fixed a rare "pure virtual method called" error on Linux.
  • [xlsx] Fixed an issue with incorrect handling of Excel documents with more than 65,535 rows.
  • [office] Various fixes for office to PDF accessibility tags for tables.
  • [pdf] Changed font subsetting to also subset glyph widths, reducing PDF file size.
  • [pdf] Improved font processing performance for office PDFs.
  • [docx] Fixed an issue where a floating image could incorrectly overlap a line of text.
  • [xls] Fixed a rare issue with unexpected (extra or missing) cell borders in Excel documents.
  • [office] Added office file metadata to PDF output in office to PDF conversion (title, author. etc.).
  • [docx] Improved text indentation accuracy within paragraphs and lists.
  • [docx] Fixed an issue where floating images could be placed on the wrong page.
  • [office] Fixed an issue with certain Thai accent characters, causing the Adobe "Character encoding" accessibility check to fail.
  • [xlsx] Fixed an issue that could produce incorrect Excel cell margins for merged cells.
  • [docx] Fixed a bug where vertically justified table cell contents were mispositioned.
  • [doc] Fixed incorrect table header styles in some .doc files.
  • [docx] Implemented clipping of floating elements inside table cells.
  • [xls] Fixed an issue with too narrow column widths in some .xls documents.
  • [docx] Fixed issues with incorrect application of kerning in Word documents.
  • [docx] Improved the accuracy of paragraph spacing in Word documents.
  • [office] Added accessibility tags for link annotations.
  • [docx] Fixed a rare issue where an extra line could be added to the end of a table cell.
  • [xlsx] Improved page breaking for large sheets when the ApplyPageBreaksToSheet option is not set.
  • [docx] Improved text layout in justified paragraphs to better match Word's behavior.
  • [xls] Added support for color scale conditional formatting for .xls files.
  • [docx] Fixed an issue with text wrapping around images in table cells.
  • [xls] Improved reading of shapes from malformed .xls documents.
  • [xlsx] Added support for East Asian date formats in Excel.
  • [xlsx] Fixed an issue with the display of numbers with rounding errors in Excel.
  • [pptx] Fixed a rare issue with incorrect shape fill style inherited from the slide master.
  • [docx] Fixed a rare issue where extra table rows were repeated on every page.
  • [office] Fixed incorrect scaling of some SVG images.
  • [xlsx] Fixed a bug where an Excel sheet was extended to its maximum height due to a full-column merged cell.
  • [office] Improved font substitution for multi-language documents.
  • [office] Improved typesetting of Arabic text.
  • [office] Improved font substitution by preferring an exact font match if available in documents containing characters that are not supported by the font.
  • [docx] Added accessibility tags for table header cells.
  • [docx] Fixed a bug where a table row could be be incorrectly interpreted as a repeated header.

Fixes and improvements for the Structured Output Module

  • [docx] Fixed an issue preventing successful conversion of a file.
  • [docx] Fixed an issue preventing one image of many from being correctly rendered.
  • [docx] Fixed an issue preventing successful conversion of a file on Linux operating systems only.
  • [docx] Fixed an issue preventing the detection of a Table of Contents due to the text order of the file.
  • [docx] Improved detection of breaks on scanned documents containing Arabic text.
  • [office] Streamlined optical character recognition workflow of large documents containing non-standard encoded text.
  • [office] Allowed page snapshot deletion where annotations exist.
  • [office] Improved processing of non-standard encoded characters to unicode.
  • [office] Improved detection of combined characters.
  • [office] Improved detection of Arabic diacritic characters.
  • [office] Improved detection of transparent watermarks over scanned pages.
  • [docx] Improved detection of Table of Contents.
  • [docx] Improved detection of shapes when converting to DOCX.
  • [docx] Improved detection of serial images that contain underlines.
  • [office] A limited number of third-party libraries have been updated to include the latest security fixes.

Did you find this helpful?

Trial setup questions?

Ask experts on Discord

Need other help?

Contact Support

Pricing or product questions?

Contact Sales