Link Search Menu Expand Document

Repair Text - VB.NET

PDF Extractor SDK sample in VB.NET demonstrating ‘Repair Text’

Imports Bytescout.PDFExtractor

Module Program

    Sub Main()


            Using extractor As New TextExtractor()

                ' Load PDF document

                ' Set the font repairing OCR mode 
                extractor.OCRMode = OCRMode.TextFromImagesAndVectorsAndRepairedFonts

                ' Set the location of OCR language data files
                extractor.OCRLanguageDataFolder = "c:\Program Files\Bytescout PDF Extractor SDK\ocrdata_best\"

                ' Set OCR language
                extractor.OCRLanguage = "eng" ' "eng" For english, "deu" For German, "fra" For French, "spa" For Spanish etc - according To files In "ocrdata" folder
                ' Find more language files at

                ' Set PDF document rendering resolution
                extractor.OCRResolution = 300

                ' Read all text
                Dim allText = extractor.GetText()

                Console.WriteLine("Extracted Text: ")

            End Using

        Catch ex As Exception
        End Try

        Console.WriteLine("Press any key to exit...")

    End Sub

End Module

Download Source Code (.zip)

Return to the previous page Explore PDF Extractor SDK

Copyright © 2016 - 2023 ByteScout