IBaseTextExtractor Methods

The IBaseTextExtractor type exposes the following members.

Methods

	Name	Description
	AddFilter(String, Boolean, Boolean)	Adds a filter to remove a text from extracted data.
	AddFilter(String, Int32, Boolean)	Adds filter to exclude text objects with specified attributes.
	AddFilter(String, Int32, Color, Boolean)	Adds filter to exclude text objects with specified attributes.
	AddFilter(String, String, Boolean, Boolean)	Adds a filter to replace a text in extracted data.
	AddFilter(String, Int32, Int32, Int32, Int32, Boolean)	Adds filter to exclude text objects with specified attributes.
	DisposePage	Disposes the page object. Uses this method carefully to destroy the page object that should not be used further. Useful to free allocated memory when processing huge PDF documents.
	GetPreprocessedPagePreview	Returns preview image of document page with preprocessing filters applied.
	IsOCRRecommendedForPage	Detects whether OCR is recommended for specified page. OCR (Optical Character Recognition) is recommended when pages has no text objects bat has an image that might contain text. (Inherited from IBaseOCRExtractor.)
	ResetFilters	Reset text filters.
	SavePreprocessedPagePreview	Saves preview image of document page with preprocessing filters applied. Image is saved in PNG format.
	SetCustomExtractionColumns	Helper method to set CustomExtractionColumns property when using the extractor though COM from VC++ VB, VBA, VBScript, or Delphi.