PDF Extractor SDK
Extract text, tables, images, attachments and other data from PDF, Reads Tables to CSV, Gets text from Images, Extracts Attachments, supports OCR with one or more languages. Handle noisy images and damaged texts transparently with the built-in OCR filters. Convert to common data structures like TXT, JSON, XLS, XLSX, CSV or XML. AI powered tables and document analysis functions.
Go To Samples
SDK Reference
- About ByteScout PDF Extractor SDK
- Getting Started
- PDF Extractor SDK API Reference
- Bytescout.PDFExtractor
- BaseTextExtractor Class
- CSVExtractor Class
- DocumentMerger Class
- DocumentOptimizer Class
- DocumentRotator Class
- DocumentSplitter Class
- DocumentSplitter2 Class
- ICSVExtractor Interface
- IDocumentOptimizer Interface
- IDocumentSplitter Interface
- IImageExtractor Interface
- IInfoExtractor Interface
- IJSONExtractor Interface
- ImageExtractor Class
- ImagePreprocessingFiltersCollection Class
- IRemover2 Interface
- ISearchablePDFMaker Interface
- ITextExtractor Interface
- IUnsearchablePDFMaker Interface
- IXLSExtractor Interface
- IXMLExtractor Interface
- JSONExtractor Class
- Remover Class
- Remover2 Class
- SearchablePDFMaker Class
- SensitiveDataDetector Class
- TextExtractor Class
- UnsearchablePDFMaker Class
- XLSExtractor Class
- Bytescout.PDFExtractor
- Installation Tasks (.NET)
- Installation Tasks (ActiveX for VB6 and other)
- History
- License Agreement
- How To Buy
Knowledgebase
ByteScout Extractor SDK fails with ASP.NET on a specific server but works on another
Is there a schema file for the XML reported from PDF documents?
VBScript assembly error when passing bytes array to LoadDocument
Samples
- ASP Classic - Extract Text from PDF
- ASP.NET C# - Batch Process PDF to Text
- ASP.NET C# - Extract Attachments from PDF
- ASP.NET C# - Extract Images from PDF
- ASP.NET C# - Extract Text By Columns from PDF
- ASP.NET C# - Extract Text From Page Area in PDF
- ASP.NET C# - Extract Text from PDF
- ASP.NET C# - Find Text With Hyphens in PDF
- ASP.NET C# - Find Text in PDF
- ASP.NET C# - Get PDF Document Information
- ASP.NET C# - Make PDF Unsearchable
- ASP.NET C# - OCR (Optical Character Recognition) in PDF
- ASP.NET C# - ZUGFeRD Invoice Extraction
- C# - Add Image Stamp in PDF
- C# - Arabic Text Extraction
- C# - Batch check folder agianst JSON settings
- C# - Census Table from Life and Annuity quote request PDF
- C# - Check If OCR Is Required for PDF
- C# - Compare PDF Documents
- C# - Convert PDF to Black and White
- C# - Convert PDF to Black and White Excluding some page
- C# - Convert Protected PDF Document to XLS
- C# - Correction to Deal with Date Issue after OCR
- C# - Data Masking in PDF
- C# - Detect Lines in PDF
- C# - Display License Info
- C# - Download and Process file
- C# - Extract 3D Animation from PDF
- C# - Extract Attachments from PDF
- C# - Extract Audio from PDF
- C# - Extract CSV from PDF and Fill Database (SQL Server)
- C# - Extract Filled PDF Form Data
- C# - Extract Images by Page from PDF
- C# - Extract Images from PDF
- C# - Extract PDF Pages
- C# - Extract PDF Text From Page Area
- C# - Extract PDF document Info
- C# - Extract PDF text To Stream
- C# - Extract Table Structure from PDF
- C# - Extract Text By Columns from PDF
- C# - Extract Text From PDF By Pages
- C# - Extract Text from Foldable Brochure Booklet
- C# - Extract Text from PDF
- C# - Extract Video from PDF
- C# - Extraction From Complex Borderless Tables
- C# - Filter Watermark Text
- C# - Find Credit Card Number in PDF with Regex
- C# - Find Email Addresses in PDF with Regex
- C# - Find Invoice Total Amount in PDF with Regex
- C# - Find Keyword And Extract Page in PDF
- C# - Find Keyword And Extract Text in PDF
- C# - Find PDF Borderless Table And Extract As CSV
- C# - Find PDF Table And Extract As CSV
- C# - Find PDF Table And Extract As JSON
- C# - Find PDF Table And Extract As Text
- C# - Find PDF Table And Extract As XML
- C# - Find Phone Number in PDF with Regex
- C# - Find SSN in PDF with Regex
- C# - Find Text in PDF
- C# - Find Text in PDF With Hyphens
- C# - Find Text in PDF using Regex
- C# - Find Text in PDF with Smart Match
- C# - Find US Address in PDF with Regex
- C# - Find Website Addresses in PDF with Regex
- C# - Find Zip Code in PDF with Regex
- C# - Get Word Coordinates in JSON
- C# - Get Word Coordinates in XML
- C# - Index PDF Documents In Folder
- C# - Index PDF Files
- C# - Load Unicode CSV to DataTable using OLEDB
- C# - Make Searchable PDF
- C# - Make Searchable PDF Discarding Existing Content
- C# - Make Searchable PDF and Fix Rotated Pages
- C# - Make Searchable PDF from Image
- C# - Make Unsearchable PDF
- C# - Maximize performance and speed
- C# - Merge All Documents Within Folder
- C# - Merge PDF Documents
- C# - Merge Protected PDF Documents
- C# - OCR Analyser
- C# - OCR Modes
- C# - OCR With Best Dataset
- C# - OCR With Fast Dataset
- C# - OCR With Mean Dataset
- C# - OCR with Multiple Languages
- C# - Optimize PDF
- C# - PDF Batch Processing
- C# - PDF Invoice Parsing
- C# - PDF To CSV
- C# - PDF To CSV (Merge multiline text to table cell)
- C# - PDF To CSV By Pages
- C# - PDF To JSON
- C# - PDF To JSON With Images
- C# - PDF To XFDF
- C# - PDF To XLS
- C# - PDF To XLSX
- C# - PDF To XLSX by pages
- C# - PDF To XML
- C# - PDF To XML With Images
- C# - PDF XFA Form To XML
- C# - PDF and OCR (Optical Character Recognition)
- C# - PDF files Parallel Processing
- C# - PDF to Scanned PDF
- C# - PDF-A Compatibility Test
- C# - Read Hindi Text from PDF
- C# - Read Text From Noisy Image
- C# - Read Values from PDF Form Fields
- C# - Reading and Writing to Azure Blob
- C# - Reduce Memory Usage for PDF to Text
- C# - Remove Empty Pages from PDF
- C# - Remove SSN Number from PDF Document
- C# - Remove Text from PDF
- C# - Repair Text in PDF
- C# - Rotate PDF Document
- C# - Scanned PDF to CSV
- C# - Scanned PDF to JSON
- C# - Scanned PDF to Text
- C# - Scanned PDF to XML
- C# - SearchablePDFMaker Progress Indication
- C# - Sensitive Data Detector
- C# - Set Configuration Profiles
- C# - Split Document into Separate Pages
- C# - Split PDF Document
- C# - Split PDF Document By Text
- C# - Split Protected PDF Document
- C# - TextExtractor Progress Indication
- C# - ZUGFeRD Invoice Extraction
- C++ - Compare PDF Documents
- C++ - Convert Protected PDF Document to Excel (C++ CLR)
- C++ - Extract PDF Pages
- C++ - Extract Text from PDF
- C++ - Find Table And Extract As CSV from PDF
- C++ - Merge PDF Documents
- C++ - Merge Protected PDF Documents (C++ CLR)
- C++ - PDF and OCR (Optical Character Recognition)
- C++ - Split PDF Document
- C++ - Split Protected PDF Document (C++ CLR)
- Delphi - Arabic Text Extraction
- Delphi - Convert PDF To CSV
- Delphi - Detect Lines in PDF
- Delphi - OCR Analyser
- Delphi - PDF Batch Processing
- Delphi - Read Text From Noisy Image
- Microsoft Excel - Extract Text From Coordinates from PDF
- Microsoft Excel - Merge PDF Documents
- Powershell - Arabic Text Extraction
- Powershell - Extract Images from PDF
- Powershell - Extract PDF Text From Page Area
- Powershell - Extract Text By Columns from PDF
- Powershell - Extract Text from PDF
- Powershell - Find Email Addresses in PDF with Regex
- Powershell - Find Invoice Total Amount in PDF with Regex
- Powershell - Find Keyword And Extract Page in PDF
- Powershell - Find PDF Table And Extract As CSV
- Powershell - Find PDF Table And Extract As XML
- Powershell - Find Text in PDF
- Powershell - Find Text in PDF using Regex
- Powershell - Make Searchable PDF
- Powershell - Merge All Documents Within Folder
- Powershell - Merge PDF Documents
- Powershell - PDF To CSV
- Powershell - PDF To CSV By Pages
- Powershell - PDF To JSON
- Powershell - PDF To JSON With Images
- Powershell - PDF To XLS
- Powershell - PDF To XLSX
- Powershell - PDF To XML
- Powershell - Split PDF Document
- VB.NET - Add Image Stamp in PDF
- VB.NET - Arabic Text Extraction
- VB.NET - Batch check folder agianst JSON settings
- VB.NET - Census Table from Life and Annuity quote request PDF
- VB.NET - Check If OCR Is Required for PDF
- VB.NET - Compare PDF Documents
- VB.NET - Convert PDF To CSV
- VB.NET - Convert PDF To CSV (Merge multiline text to table cell)
- VB.NET - Convert PDF To CSV By Pages
- VB.NET - Convert PDF To JSON
- VB.NET - Convert PDF To JSON With Images
- VB.NET - Convert PDF To XFDF
- VB.NET - Convert PDF To XLS
- VB.NET - Convert PDF To XLSX by pages
- VB.NET - Convert PDF To XML
- VB.NET - Convert PDF to Black and White
- VB.NET - Convert PDF to Black and White Excluding some page
- VB.NET - Convert Protected PDF Document to XLS
- VB.NET - Convert XFA Form To XML in PDF
- VB.NET - Correction to Deal with Date Issue after OCR
- VB.NET - Data Masking in PDF
- VB.NET - Detect Lines in PDF
- VB.NET - Display License Info
- VB.NET - Download and Process file
- VB.NET - Extract 3D Animation from PDF
- VB.NET - Extract Attachments from PDF
- VB.NET - Extract Audio from PDF
- VB.NET - Extract CSV from PDF and Fill Database in SQL Server
- VB.NET - Extract Filled PDF Form Data
- VB.NET - Extract Images by Page from PDF
- VB.NET - Extract Images from PDF
- VB.NET - Extract Info about PDF
- VB.NET - Extract PDF to Text as Stream
- VB.NET - Extract Pages from PDF
- VB.NET - Extract Table Structure from PDF
- VB.NET - Extract Text By Columns from PDF
- VB.NET - Extract Text By Pages from PDF
- VB.NET - Extract Text From Page Area in PDF
- VB.NET - Extract Text from Foldable Brochure Booklet
- VB.NET - Extract Text from PDF
- VB.NET - Extract Video from PDF
- VB.NET - Extraction From Complex Borderless Tables
- VB.NET - Filter Watermark Text
- VB.NET - Find Borderless Table in PDF And Extract As CSV
- VB.NET - Find Credit Card Number in PDF with Regex
- VB.NET - Find Email Addresses in PDF using Regex
- VB.NET - Find Invoice Total Amount in PDF with Regex
- VB.NET - Find Keyword in PDF And Extract Page
- VB.NET - Find Keyword in PDF And Extract Text
- VB.NET - Find Phone Number in PDF with Regex
- VB.NET - Find SSN Number in PDF with Regex
- VB.NET - Find Table in PDF And Extract As CSV
- VB.NET - Find Table in PDF And Extract As JSON
- VB.NET - Find Table in PDF And Extract As Text
- VB.NET - Find Table in PDF And Extract As XML
- VB.NET - Find Text With Hyphens in PDF
- VB.NET - Find Text in PDF
- VB.NET - Find Text in PDF with Regex
- VB.NET - Find Text in PDF with Smart Match
- VB.NET - Find US Address in PDF with Regex
- VB.NET - Find Website Addresses in PDF with Regex
- VB.NET - Find Zip Code in PDF with Regex
- VB.NET - Get Word Coordinates in JSON
- VB.NET - Get Word Coordinates in XML
- VB.NET - Index PDF Documents In Folder
- VB.NET - Index PDF Files
- VB.NET - Load Unicode CSV to DataTable using OLEDB
- VB.NET - Make Searchable PDF
- VB.NET - Make Searchable PDF Discarding Existing Content
- VB.NET - Make Searchable PDF and Fix Rotated Pages
- VB.NET - Make Searchable PDF from Image
- VB.NET - Make Unsearchable PDF
- VB.NET - Maximize performance and speed
- VB.NET - Merge All Documents Within Folder
- VB.NET - Merge PDF Documents
- VB.NET - Merge Protected PDF Documents
- VB.NET - OCR (Optical Character Recognition) and PDF
- VB.NET - OCR Analyser in PDF
- VB.NET - OCR Modes
- VB.NET - OCR With Best Dataset
- VB.NET - OCR With Fast Dataset
- VB.NET - OCR With Mean Dataset
- VB.NET - OCR with Multiple Languages
- VB.NET - Optimize PDF
- VB.NET - PDF Invoice Parsing
- VB.NET - PDF To XML With Images
- VB.NET - PDF files Batch Processing
- VB.NET - PDF to Scanned PDF
- VB.NET - PDF-A Compatibility Test
- VB.NET - Parallel Processing of PDF files
- VB.NET - Profiles
- VB.NET - Read Hindi Text
- VB.NET - Read Text From Noisy Image
- VB.NET - Read Values from Form Fields
- VB.NET - Reading and Writing to Azure Blob
- VB.NET - Reduce Memory Usage
- VB.NET - Remove Empty Pages
- VB.NET - Remove SSN Number from PDF Document
- VB.NET - Remove Text
- VB.NET - Repair Text
- VB.NET - Rotate Document
- VB.NET - Scanned PDF To CSV
- VB.NET - Scanned PDF To JSON
- VB.NET - Scanned PDF To Text
- VB.NET - Scanned PDF To XML
- VB.NET - SearchablePDFMaker Progress Indication
- VB.NET - Sensitive Data Detector
- VB.NET - Split Document
- VB.NET - Split Protected PDF Document
- VB.NET - Split document into separate pages
- VB.NET - TextExtractor Progress Indication
- VB.NET - ZUGFeRD Invoice Extraction
- VB.NET - convert PDF To XLSX
- VB6 - Convert PDF To CSV
- VB6 - Convert PDF To CSV (Merge multiline text to table cell)
- VB6 - Convert PDF To JSON
- VB6 - Convert PDF To Text
- VB6 - Convert PDF To XML
- VB6 - Merge PDF
- VB6 - Scanned PDF To Text
- VB6 - Split PDF
- VBScript - Arabic Text Extraction
- VBScript - Compare PDF Documents
- VBScript - Convert PDF To CSV
- VBScript - Convert PDF To CSV (Merge multiline text to table cell)
- VBScript - Convert PDF To CSV By Pages
- VBScript - Convert PDF To JSON
- VBScript - Convert PDF To JSON With Images
- VBScript - Convert PDF To XFDF
- VBScript - Convert PDF To XLS
- VBScript - Convert PDF To XML
- VBScript - Convert PDF To XML With Images
- VBScript - Convert PDF to Black and White
- VBScript - Convert PDF to Black and White Excluding some page
- VBScript - Display License Info
- VBScript - Extract Attachments from PDF
- VBScript - Extract Image Coordinates By Page from PDF
- VBScript - Extract Images Coordinates from PDF
- VBScript - Extract Images by Page from PDF
- VBScript - Extract Images from PDF
- VBScript - Extract Pages from PDF
- VBScript - Extract Table Structure from PDF
- VBScript - Extract Text By Columns from PDF
- VBScript - Extract Text By Pages from PDF
- VBScript - Extract Text From Page Area from PDF
- VBScript - Extract Text from PDF
- VBScript - Extract pdf Info from PDF
- VBScript - Extraction From Complex Borderless Tables
- VBScript - Find Hyphenated Text in PDF
- VBScript - Find PDF Table And Extract As CSV
- VBScript - Find PDF Table And Extract As XML
- VBScript - Find Text in PDF
- VBScript - Find Text in PDF Using Regex
- VBScript - Get Word Coordinates in JSON
- VBScript - Get Word Coordinates in XML
- VBScript - Index PDF Files
- VBScript - Make Searchable PDF
- VBScript - Make Searchable PDF Discarding Existing Content
- VBScript - Make Searchable PDF and Fix Rotated Pages
- VBScript - Make Unsearchable PDF
- VBScript - Maximize performance and speed
- VBScript - Merge All Documents Within Folder
- VBScript - Merge PDF Documents
- VBScript - OCR Analyser for PDF
- VBScript - OCR With Best Dataset
- VBScript - OCR With Fast Dataset
- VBScript - OCR With Mean Dataset
- VBScript - OCR with Multiple Languages
- VBScript - PDF Extraction Profiles
- VBScript - PDF OCR (Optical Character Recognition)
- VBScript - PDF XFA Form To XML
- VBScript - PDF files Batch Processing
- VBScript - PDF to Scanned PDF
- VBScript - Reduce Memory Usage for PDF Extraction
- VBScript - Remove Text from PDF
- VBScript - Rotate PDF Document
- VBScript - Split PDF Document
- VBScript - ZUGFeRD Invoice Extraction