HTMLExtractor Methods
Free Trial Web API version Licensing Request A Quote
HAVE QUESTIONS OR NEED HELP?SUBMIT THE SUPPORT REQUEST FORM or write email toSUPPORT@BYTESCOUT.COM
The HTMLExtractor type exposes the following members.
Methods
| Name | Description | |
|---|---|---|
| CreateProfile(String, Boolean, Boolean, Boolean) | Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.) | |
| CreateProfile(String, String, Boolean, Boolean, Boolean) | Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.) | |
| Dispose | Releases the unmanaged resources used by the instance and optionally releases the managed resources. (Inherited from BaseExtractor.) | |
| DisposePage | Disposes the page object. Uses this method carefully to destroy the page object that should not be used further. Useful to free allocated memory when processing large PDF documents. | |
| Equals | (Inherited from Object.) | |
| Finalize | (Inherited from Object.) | |
| FireParsingError | (Inherited from BaseExtractor.) | |
| GetHashCode | (Inherited from Object.) | |
| GetHTML | Extracts HTML from the entire document. | |
| GetHTML(IListInt32) | Extracts HTML from specified pages. | |
| GetHTML(String) | Extracts HTML from specified page ranges. | |
| GetHTML(Int32, Int32) | Extracts HTML from specified page range. | |
| GetHTMLPage | Extracts HTML from specified document page. | |
| GetOutputHTMLPageHeight | Get height of the output page rendered in HTML format. | |
| GetPageCount | (Inherited from BaseExtractor.) | |
| GetPageHeight | Height of the PDF page (in pdf units). | |
| GetPageRect_Height | (Inherited from BaseExtractor.) | |
| GetPageRect_Left | (Inherited from BaseExtractor.) | |
| GetPageRect_Top | (Inherited from BaseExtractor.) | |
| GetPageRect_Width | (Inherited from BaseExtractor.) | |
| GetPageRectangle(Int32) | (Inherited from BaseExtractor.) | |
| GetPageRectangle(Int32, Boolean) | (Inherited from BaseExtractor.) | |
| GetPageWidth | Width of the PDF page (in pdf units). | |
| GetType | (Inherited from Object.) | |
| LoadAndApplyProfiles | Loads profiles from JSON string and automatically applies them. Note that profiles containing detection keywords will be deferred until the extraction. (Inherited from BaseExtractor.) | |
| LoadDocumentFromFile | (Inherited from BaseExtractor.) | |
| LoadDocumentFromStream | (Inherited from BaseExtractor.) | |
| LoadProfiles | Loads profiles from JSON file. (Inherited from BaseExtractor.) | |
| LoadProfilesFromString | Loads profiles from JSON string. (Inherited from BaseExtractor.) | |
| MemberwiseClone | (Inherited from Object.) | |
| Reset | Resets the instance, disposes internal resources and releases the file. Use this method before loading another PDF file. (Overrides BaseExtractorReset.) | |
| ResetExtractionArea | (Inherited from BaseExtractor.) | |
| SaveHtmlPageToFile | Extracts HTML from specified page to stream. | |
| SaveHtmlPageToStream | Extracts HTML from specified page to stream. | |
| SaveHtmlToFile(String) | Extracts HTML from the entire document to file. | |
| SaveHtmlToFile(IListInt32, String) | Extracts HTML from specified pages to file. | |
| SaveHtmlToFile(String, String) | Extracts HTML from specified page ranges to file. | |
| SaveHtmlToFile(Int32, Int32, String) | Extracts HTML from specified page range to file. | |
| SaveHtmlToStream(Stream) | Extracts HTML from the entire document to stream. | |
| SaveHtmlToStream(IListInt32, Stream) | Extracts HTML from specified pages to stream. | |
| SaveHtmlToStream(String, Stream) | Extracts HTML from specified page ranges to stream. | |
| SaveHtmlToStream(Int32, Int32, Stream) | Extracts HTML from specified page range to stream. | |
| SetExtractionArea(RectangleF) | (Inherited from BaseExtractor.) | |
| SetExtractionArea(Double, Double, Double, Double) | (Inherited from BaseExtractor.) | |
| SetExtractionArea(Single, Single, Single, Single) | (Inherited from BaseExtractor.) | |
| ToString | (Inherited from Object.) |
See Also