HTMLExtractor Methods
Free Trial Web API version Licensing Request A Quote
HAVE QUESTIONS OR NEED HELP?SUBMIT THE SUPPORT REQUEST FORM or write email toSUPPORT@BYTESCOUT.COM
The HTMLExtractor type exposes the following members.
Methods
Name | Description | |
---|---|---|
CreateProfile(String, Boolean, Boolean, Boolean) | Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.) | |
CreateProfile(String, String, Boolean, Boolean, Boolean) | Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.) | |
Dispose | Releases the unmanaged resources used by the instance and optionally releases the managed resources. (Inherited from BaseExtractor.) | |
DisposePage | Disposes the page object. Uses this method carefully to destroy the page object that should not be used further. Useful to free allocated memory when processing large PDF documents. | |
Equals | (Inherited from Object.) | |
Finalize | (Inherited from Object.) | |
FireParsingError | (Inherited from BaseExtractor.) | |
GetHashCode | (Inherited from Object.) | |
GetHTML | Extracts HTML from the entire document. | |
GetHTML(IListInt32) | Extracts HTML from specified pages. | |
GetHTML(String) | Extracts HTML from specified page ranges. | |
GetHTML(Int32, Int32) | Extracts HTML from specified page range. | |
GetHTMLPage | Extracts HTML from specified document page. | |
GetOutputHTMLPageHeight | Get height of the output page rendered in HTML format. | |
GetPageCount | (Inherited from BaseExtractor.) | |
GetPageHeight | Height of the PDF page (in pdf units). | |
GetPageRect_Height | (Inherited from BaseExtractor.) | |
GetPageRect_Left | (Inherited from BaseExtractor.) | |
GetPageRect_Top | (Inherited from BaseExtractor.) | |
GetPageRect_Width | (Inherited from BaseExtractor.) | |
GetPageRectangle(Int32) | (Inherited from BaseExtractor.) | |
GetPageRectangle(Int32, Boolean) | (Inherited from BaseExtractor.) | |
GetPageWidth | Width of the PDF page (in pdf units). | |
GetType | (Inherited from Object.) | |
LoadAndApplyProfiles | Loads profiles from JSON string and automatically applies them. Note that profiles containing detection keywords will be deferred until the extraction. (Inherited from BaseExtractor.) | |
LoadDocumentFromFile | (Inherited from BaseExtractor.) | |
LoadDocumentFromStream | (Inherited from BaseExtractor.) | |
LoadProfiles | Loads profiles from JSON file. (Inherited from BaseExtractor.) | |
LoadProfilesFromString | Loads profiles from JSON string. (Inherited from BaseExtractor.) | |
MemberwiseClone | (Inherited from Object.) | |
Reset | Resets the instance, disposes internal resources and releases the file. Use this method before loading another PDF file. (Overrides BaseExtractorReset.) | |
ResetExtractionArea | (Inherited from BaseExtractor.) | |
SaveHtmlPageToFile | Extracts HTML from specified page to stream. | |
SaveHtmlPageToStream | Extracts HTML from specified page to stream. | |
SaveHtmlToFile(String) | Extracts HTML from the entire document to file. | |
SaveHtmlToFile(IListInt32, String) | Extracts HTML from specified pages to file. | |
SaveHtmlToFile(String, String) | Extracts HTML from specified page ranges to file. | |
SaveHtmlToFile(Int32, Int32, String) | Extracts HTML from specified page range to file. | |
SaveHtmlToStream(Stream) | Extracts HTML from the entire document to stream. | |
SaveHtmlToStream(IListInt32, Stream) | Extracts HTML from specified pages to stream. | |
SaveHtmlToStream(String, Stream) | Extracts HTML from specified page ranges to stream. | |
SaveHtmlToStream(Int32, Int32, Stream) | Extracts HTML from specified page range to stream. | |
SetExtractionArea(RectangleF) | (Inherited from BaseExtractor.) | |
SetExtractionArea(Double, Double, Double, Double) | (Inherited from BaseExtractor.) | |
SetExtractionArea(Single, Single, Single, Single) | (Inherited from BaseExtractor.) | |
ToString | (Inherited from Object.) |
See Also