History
Free Trial Web API version Licensing Request A Quote
HAVE QUESTIONS OR NEED HELP?SUBMIT THE SUPPORT REQUEST FORM or write email toSUPPORT@BYTESCOUT.COM
History (changes log)
ByteScout PDF To HTML SDK history of changes. Legend: ------------------------- - - bug fixed + - new feature = - changed ! - critical ------------------------- 13.4.0.4659 (April 10, 2023) ================================ = Performance improvements - Other minor fixes and improvements. 13.3.0.4514 (September 27, 2022) ================================ = Improved parsing of PDF documents. = Other minor fixes and improvements. 13.2.0.4485 (June 7, 2022) ========================== = Improved parsing of PDF documents. = Other minor fixes and improvements. 13.1.0.4386 (January 24, 2022) ============================== + Added property 'AdditionalCssStyles' allowing to add custom CSS styles. + Added support for profiles. = Improved parsing of PDF documents. = Other minor fixes and improvements. 13.0.0.4253 (October 4, 2021) ============================= + Added support for page ranges. See overloads of methods 'GetHTML' and 'SaveHtml*'. + Added page numbers in the 'page start' and 'page end' HTML comments. = Performance improvements. = Improved parsing of PDF documents. = Other minor fixes and improvements. 12.1.0.4136 (May 18, 2021) ========================== = Improved parsing of PDF documents. = Other minor fixes and improvements. 12.0.0.4062 (February 8, 2021) ============================== = Improved filtering of invisible objects. = Improved detection of bold fonts. = Improved support for external Chinese fonts. = Now the damaged CCITT and JBIG2 images are skipped from rendering avoiding crashes. - Fixed exception in annotation parsing. - Fixed crash on some scaled images in 'HTMLExtractionMode.PlainHTML' mode. = Improved parsing of PDF documents. = Other minor fixes and improvements. 11.3.0.3983 (October 26, 2020) ============================== = Improved HTMLExtractor to not use the Environment.CurrentDirectory to get location for generated image files. Was causing errors in ASP.NET environment. = Improved parsing of PDF documents. - Other minor fixes and improvements. 11.2.0.3919 (June 20, 2020) =========================== + Added 'DetectUnderlineTextStyle' and 'DetectStrikeoutTextStyle' properties. = Improved parsing of PDF documents. - Other minor fixes and improvements. 11.1.0.3845 (March 19, 2020) ============================ = Improved parsing of PDF documents. - Other minor fixes and improvements. 11.0.0.3805 (February 11, 2020) =============================== + Added support for new revision of PDF encryption (ISO 32000-2:2017 compliance). + Added 'LicenseInfo' property providing detailed information about your license. = Improved extraction of duplicated text (shadow-like effect). - Fixed .NET Core compatibility issue. = Improved parsing of PDF documents. - Other minor fixes and improvements. 10.8.0.3732 (December 4, 2019) ============================== = Improved handling of Arabic fonts and charsets. = Improved handling of CJK fonts and charsets. = Improved parsing of PDF documents. - Other minor fixes and improvements. 10.7.0.3697 (November 1, 2019) ============================== = Improved image extraction. = Improved parsing and processing of PDF documents. - Other minor fixes and improvements. 10.6.0.3659 (October 1, 2019) ============================= = Improved replacement of not embedded PDF fonts. = Improved parsing and processing of PDF documents. - Other minor fixes and improvements. 10.5.0.3637 (September 2, 2019) =============================== = Improved parsing and processing of PDF documents. - Other minor fixes and improvements. 10.4.0.3600 (August 6, 2019) ============================ - Fixed file locking: "Reset()" and "Dispose()" methods now correctly release generated files. = Improved conversion of form controls. = Improved parsing and processing of PDF documents. - Other minor fixes and improvements. 10.3.0.3566 (July 2, 2019) ========================== = Improved the detection of the visibility of text objects when they are hidden by a overlying opaque vector object. = Improved extraction from cropped PDF pages. - Fixed `OutOfMemoryException` on tiling patterns with very large step or bounding box. - Fixed decoding of UTF-8 encoded text objects. = Improved Japanese fonts decoding. = Improved font substitution when `KeepOriginalFontNames` is true. - Other minor fixes and improvements. 10.2.0.3512 (May 28, 2019) ========================== - Fixed overlapping text objects when 'HighPrecisionTextPositioning' is disabled. = Improved parsing and processing of PDF documents. - Other minor fixes and improvements. 10.1.0.3439 (April 4, 2019) =========================== = Improved parsing and processing of PDF documents. - Other minor fixes and improvements. 9.4.0.3398 (March 12, 2019) =========================== = Improved handling of embedded fonts. = Improved parsing of PDF documents. - Fixed image files locking. - Other minor fixes and improvements. 9.3.0.3359 (January 31, 2019) ============================= + Added .NET Core compatible assemblies. = Reduced CSS cluttering. = Improved parsing of PDF documents. = Improved support of Korean fonts. - Removed C++ runtime dependencies. - Other minor fixes and improvements. 9.2.0.3254 (October 22, 2018) ============================= - Improved support of cropped PDF documents. = Performance improvements. - Other minor fixes and improvements. 9.1.0.3163 (July 18, 2018) ========================== = Fixed bugs in PDF parsing. - Other minor fixes and improvements. 9.0.0.3079 (April 11, 2018) =========================== = Improved PDF parsing. - Other minor fixes and improvements. 8.8.0.3015 (January 22, 2018) ============================= = Improved support of encrypted PDF documents. - Other minor fixes and improvements. 8.7.0.2980 (November 8, 2017) ============================= + Added `ParsingError` event allowing to handle parsing errors and interrupt or continue the processing. + Implemented embedding of images into HTML code as `data URI`. = Improved rendering of PDF documents. = Performance improvements. - Other minor fixes and improvements. 8.6.0.2911 (August 1, 2017) =========================== + HTMLExtractor: added KeepOriginalFontNames property. - Fixed crash on 64-bit CMYK images. - Subscript and superscript text objects were merged with normal text. Fixed now. = Other minor fixes and improvements. 8.5.0.2855 (June 1, 2017) ========================= = Converter now keeps empty input text fields. = Improved Japanese text conversion. = Removed obsolete ClientProfile builds. = Performance improvements; - Other minor fixes and improvements. 8.3.0.2794 (March 06, 2017) =========================== = Performance improvements. + HTMLExtractor now able to reflow PDF text in PlainHTML mode. = Other minor improvements and bug fixes. 8.2.0.2697 (January 11, 2017) ============================= = Improved PDF to plain HTML conversion: it's now possible to reflow newspaper layout to single-column HTML text. = Other minor improvements and bug fixes. 8.1.0.2600 (October 21, 2016) ============================= - Fixed extraction of text in Korean charset (KSCms-UHC-H / Code Page 949). = Improved support of colorspaces and soft masks. - Improved extraction of invisible text. = Other minor improvements and bug fixes. 8.0.0.2523 (August 19, 2016) ============================ = Improved text extraction. = Improved performance. - Fixed PDF parsing issues. - Fixed JBIG images decoding. = Other minor improvements and bug fixes. 7.00.0.2474 (May 26, 2016) ========================== - Fixed bug in installer causing some files from previous installation were interfering with updates. = Reworked the registration checking. Now the library will not throw an exception, but work in demo mode if you missed or input wrong RegistrationName and RegistrationKey. + PDF Multitool: Added recent document list to "Open PDF Document" button. + PDF Multitool: Selection can be resized now. + PDF Multitool: Added Extract JSON feature. = PDF Multitool: Improved Table Detector UI. = PDF Multitool: Greatly improved font rendering quality. + PDF Multitool: Added debug option "Show Detected Extraction Columns" to the context menu to display the detected columns on the current page. Becomes visible only after running any extraction against the current displayed page. - PDF Multitool: Fixed font rendering issue on 32-bit Windows. = Other minor improvements and bug fixes. 6.30.0.2421 (March 23, 2016) ============================ = Improved support of ICC color profiles. = Imporved handling of embedded fonts. - Fixed extracted text duplication when using OCRCacheMode.WholePage option. = Other bug fixes and improvements. 6.20.2354 (January 20, 2016) ============================ PDF To HTML conversion improved PDF reading speed improved new property PageDataCaching controlling automatic disposing of previously accessed pages. implementing page caching type example "Memory-Care Processing Of Huge Docs" renamed to "Reduce Memory Usage" new DisposePage() method to HTMLExtractor SkipInvisibleText now skips clipped text (which is not visible) Rendering improves annotations rendering Colors management improved PDF2HTML improves form controls output Images output improved to support more sub-formats Unicode text handling improved minor bug-fixes 6.12.2239 (September 17, 2015) ============================== PDF To HTML conversion improved PDF format support improved images and text support improved minor bug-fixes 6.00.2131 (June 16, 2015) ========================= HTML output improved (issues with links fixed) minor bug-fixes 6.00.2114 (June 3, 2015) ======================== improved PDF To HTML conversion improved PDF format support improved images, fonts, text encoding support minor issues fixes with HTML output minor improvements and fixes 5.10.1750 (November 27, 2014) ============================= improved pdf to html conversion from asp.net and .net issue with overlapping content when converting multiple pages from PDF fixed XHTML output minor fixes Support for text opacity added now outputs unknown character (0 to 32) as "?" improving support of pdf images conversion into html fixing minor issues with output images filenames minor improvements and fixes 5.00.1639 (August 23, 2014) =========================== improved handling of images while converting pdf to html improved support for damaged pdf files HTML text output improved PDF Viewer GUI demo improved minor improvements and fixes 4.00.1481 (May 30, 2014) improved pdf to html conversion improved PDF handling and bug-fixes the issue with missing whitespace while converting text to css multhithreading performance improved licensing updated: Now Licensing is per developer special styles for "print" device causing visible scroll bars in every div were disabled updated samples and improved demo GUI minor bug-fixes 3.40.1339 (March 5, 2014) documentation for the API updated and included now into the SDK improved text processing in PDF improved PDF reading support minor bug-fixes 3.20.1179 (October 22, 2013) pdf to html conversion improved new OutputHTMLWidth, GetOutputHTMLPageHeight, GetPageHeight, GetPageWidth functions and properties to read and set information about width and height for PDF and output HTML demo gui: the issue with automatic scrolling up to top when selecting a region fixed improved support of PDF files with rotated objects and pages minor bug-fixes and processing speed improvements 3.20.1076 (July 11, 2013) improved PDF To HTML conversion improved PDF files support minor bug-fixes 3.10.1052 (June 29, 2013) improved pdf to html conversion functionality improved images support in PDF files speed improvements for pdf to html conversion improved fonts support improved support for various PDF files minor bug-fixes 3.10.904 (May 14, 2013) improved pdf to html conversion functionality improved PDF format support minor bug-fixes 3.00.864 (April 11, 2013) improved PDF handling improved PDF fonts handling demo utility updated source code samples updated to run on any .NET framework by default minor bug-fixes 3.00.829 (March 14, 2013) improved pdf to html conversion demo utility PDF Viewer reworked and updated for better UI experience minor improvements and fixes in PDF support improved PDF stability while working with PDF files with high density vector graphics inside improved support for indexed color pallettes improved embedded fonts rendering better support for Unicode fonts new .Version property to read exact version of the dll minor updates and improvements 2.40.662 (November 5, 2012) pdf to html with vector images conversion improved minor bug-fixes related to vector images representation improved pdf support issue with incorrect folder name for images (was using original pdf filename instead of given html output filename) image files are now using shorter name to prevent issue with too long paths minor bug-fixes 2.40.650 (November 1, 2012) improved PDF to HTML conversion support improved support for Unicode text extraction improved support for PDF/A pdf files issues with white stripes appearing on multiple images combined fixed data extraction internal optimizations improved support for 8 bit images inside PDF vector drawings improved to provide better support for multiple small objects Color representation in images with indexed colors fixed Type2 fonts support improved Improved support for embedded fonts in PDF produced by Ghostscript engine CCIT images compression compression related issues fixed LZW compressed PDF support improved improved support for shading objects improved PDF fonts support improved support for PDF with 4 bit images new .ControlsAsText property. new source code sample: "PDF To HTML with Images" support for annotations and control output has been re-enabled memory and speed optimizations 2.30.568 (June 22, 2012) PDF to HTML conversion in PDF to simple HTML mode improved PDF Viewer demo fixes PDF to HTML mode settings dialog, adds AddFontHTMLTagsToText setting CSS mode is now selected by default new .AddFontStyleHTMLTagsToText to control if PDF To HTML SDK should add font style information to text objects or not PDF To HTML simple HTML mode to reworked and improved new .FontSubstitutionMap property for HTMLExtractor to read font substitution map used for PDF to HTML conversion new .PreserveFormattingOnTextExtraction property to HTML2PDF has been added new .HighPrecisionTextPositioning property for PDF To HTML with CSS mode multithreaded processing support imporved 2.20.487 (March 9, 2012) Fixing issue with black and white images missing from PDF files made by differente PDF writers Fixing inverted images in PDF files generated by some Adobe applications Improved support for corrupted PDF files Some security issues fixed Improved handling of JBIG encoded images Improved handling of PDF built-in fonts Improved handling of superscript text positions Improved support of encrypted PDF files Improved handling of different colorspaces for images 2.20.415 (December 21, 2011) improved fonts support conversion to simple HTML improved performance optimized and pdf processing speed improved improved support for PDF format PDF Viewer now provides ability to turn on/off text formatting support on extraction minor bug-fixes 2.10.356 (November 15, 2011) better support for different PDF files minor bug-fixes and improvements saving to HTML does not attemt to create .images folder anymore 2.10.318 (October 18, 2011) better support for PDF files (vector drawings are supported now) minor bug-fixes and improvements 2.10.304 (October 4, 2011) improved support for PDF files improved HTML output for multiple pages pdf processing speed increased minor bug-fixes and improvements 1.10.168 (May 6 2011) support for password protected PDF documents improved (was not working properly in previous release) minor bug-fixes and improvements 1.10.160 (12 April 2011) XML comments are available now to show hints for methods, classes and properties in Visual Studio New property: .ExtractColumnByColumn (false default), set to True to extract text column by column instead of line by line PDF Viewer freeware utility updated to feature "Extract Text (line by line)" and "Extract Text (column by column)" buttons improved support for single paged PDF documents produced by Acrobat Distiller software clipping issues were fixed fixed hanging on some broken PDF documents improved text decoding support minor bug-fixes 1.10.150 (10 March 2011) * PDF files support improved + now handles PDF files from Google Doc without errors * minor bug-fixes 1.10.144 (26 February 2011) + now works with secured documents (provide passsword if needed in .Password property) + minor bug-fixes and improvements + updated GUI demo application 1.10.121 (11 February 2011) + support for invisible text extraction added + minor bug-fixes and improvements 1.00.30 (9 November 2010) + new version