Link Search Menu Expand Document

Get Word Coordinates in XML - VBScript

PDF Extractor SDK sample in VBScript demonstrating ‘Get Word Coordinates in XML’

PdfToXml.vbs
' Create Bytescout.PDFExtractor.XMLExtractor object
Set extractor = CreateObject("Bytescout.PDFExtractor.XMLExtractor")
extractor.RegistrationName = "demo"
extractor.RegistrationKey = "demo"

' Load sample PDF document
extractor.LoadDocumentFromFile "sample3.pdf"

' Add the following params to get clean data with word nodes only:
extractor.DetectNewColumnBySpacesRatio = 0.1 ' this splits all text into words
extractor.PreserveFormattingOnTextExtraction = false ' get rid of empty nodes

extractor.SaveXMLToFile "output.xml"

WScript.Echo "Extracted data saved to 'output.xml' file."

Download Source Code (.zip)

Return to the previous page Explore PDF Extractor SDK