Return to previous page Explore PDF To HTML SDK

HTMLExtractor Methods

Free Trial Web API version Licensing Request A Quote

HAVE QUESTIONS OR NEED HELP?SUBMIT THE SUPPORT REQUEST FORM or write email toSUPPORT@BYTESCOUT.COM

The HTMLExtractor type exposes the following members.

Methods

	Name	Description
	CreateProfile(String, Boolean, Boolean, Boolean)	Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.)
	CreateProfile(String, String, Boolean, Boolean, Boolean)	Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.)
	Dispose	Releases the unmanaged resources used by the instance and optionally releases the managed resources. (Inherited from BaseExtractor.)
	DisposePage	Disposes the page object. Uses this method carefully to destroy the page object that should not be used further. Useful to free allocated memory when processing large PDF documents.
	Equals	(Inherited from Object.)
	Finalize	(Inherited from Object.)
	FireParsingError	(Inherited from BaseExtractor.)
	GetHashCode	(Inherited from Object.)
	GetHTML	Extracts HTML from the entire document.
	GetHTML(IListInt32)	Extracts HTML from specified pages.
	GetHTML(String)	Extracts HTML from specified page ranges.
	GetHTML(Int32, Int32)	Extracts HTML from specified page range.
	GetHTMLPage	Extracts HTML from specified document page.
	GetOutputHTMLPageHeight	Get height of the output page rendered in HTML format.
	GetPageCount	(Inherited from BaseExtractor.)
	GetPageHeight	Height of the PDF page (in pdf units).
	GetPageRect_Height	(Inherited from BaseExtractor.)
	GetPageRect_Left	(Inherited from BaseExtractor.)
	GetPageRect_Top	(Inherited from BaseExtractor.)
	GetPageRect_Width	(Inherited from BaseExtractor.)
	GetPageRectangle(Int32)	(Inherited from BaseExtractor.)
	GetPageRectangle(Int32, Boolean)	(Inherited from BaseExtractor.)
	GetPageWidth	Width of the PDF page (in pdf units).
	GetType	(Inherited from Object.)
	LoadAndApplyProfiles	Loads profiles from JSON string and automatically applies them. Note that profiles containing detection keywords will be deferred until the extraction. (Inherited from BaseExtractor.)
	LoadDocumentFromFile	(Inherited from BaseExtractor.)
	LoadDocumentFromStream	(Inherited from BaseExtractor.)
	LoadProfiles	Loads profiles from JSON file. (Inherited from BaseExtractor.)
	LoadProfilesFromString	Loads profiles from JSON string. (Inherited from BaseExtractor.)
	MemberwiseClone	(Inherited from Object.)
	Reset	Resets the instance, disposes internal resources and releases the file. Use this method before loading another PDF file. (Overrides BaseExtractorReset.)
	ResetExtractionArea	(Inherited from BaseExtractor.)
	SaveHtmlPageToFile	Extracts HTML from specified page to stream.
	SaveHtmlPageToStream	Extracts HTML from specified page to stream.
	SaveHtmlToFile(String)	Extracts HTML from the entire document to file.
	SaveHtmlToFile(IListInt32, String)	Extracts HTML from specified pages to file.
	SaveHtmlToFile(String, String)	Extracts HTML from specified page ranges to file.
	SaveHtmlToFile(Int32, Int32, String)	Extracts HTML from specified page range to file.
	SaveHtmlToStream(Stream)	Extracts HTML from the entire document to stream.
	SaveHtmlToStream(IListInt32, Stream)	Extracts HTML from specified pages to stream.
	SaveHtmlToStream(String, Stream)	Extracts HTML from specified page ranges to stream.
	SaveHtmlToStream(Int32, Int32, Stream)	Extracts HTML from specified page range to stream.
	SetExtractionArea(RectangleF)	(Inherited from BaseExtractor.)
	SetExtractionArea(Double, Double, Double, Double)	(Inherited from BaseExtractor.)
	SetExtractionArea(Single, Single, Single, Single)	(Inherited from BaseExtractor.)
	ToString	(Inherited from Object.)

Top

Reference

HTMLExtractor Class

Bytescout.PDF2HTML Namespace