Link Search Menu Expand Document

Repair Text - VB.NET

PDF Extractor SDK sample in VB.NET demonstrating ‘Repair Text’

Imports Bytescout.PDFExtractor

Module Program

    Sub Main()


            Using extractor As New TextExtractor()

                ' Load PDF document

                ' Set the font repairing OCR mode 
                extractor.OCRMode = OCRMode.TextFromImagesAndVectorsAndRepairedFonts

                ' Set the location of OCR language data files
                extractor.OCRLanguageDataFolder = "c:\Program Files\Bytescout PDF Extractor SDK\ocrdata_best\"

                ' Set OCR language
                extractor.OCRLanguage = "eng" ' "eng" For english, "deu" For German, "fra" For French, "spa" For Spanish etc - according To files In "ocrdata" folder
                ' Find more language files at

                ' Set PDF document rendering resolution
                extractor.OCRResolution = 300

                ' Read all text
                Dim allText = extractor.GetText()

                Console.WriteLine("Extracted Text: ")

            End Using

        Catch ex As Exception
        End Try

        Console.WriteLine("Press any key to exit...")

    End Sub

End Module

Download Source Code (.zip)

Return to the previous page Explore PDF Extractor SDK