HTMLExtractor Methods
Free Trial Web API version Licensing Request A Quote
HAVE QUESTIONS OR NEED HELP?SUBMIT THE SUPPORT REQUEST FORM or write email toSUPPORT@BYTESCOUT.COM
The HTMLExtractor type exposes the following members.
Methods
Name | Description | |
---|---|---|
![]() | CreateProfile(String, Boolean, Boolean, Boolean) | Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.) |
![]() | CreateProfile(String, String, Boolean, Boolean, Boolean) | Creates JSON profile will all extractor properties with current values. (Inherited from BaseExtractor.) |
![]() | Dispose | Releases the unmanaged resources used by the instance and optionally releases the managed resources. (Inherited from BaseExtractor.) |
![]() | DisposePage | Disposes the page object. Uses this method carefully to destroy the page object that should not be used further. Useful to free allocated memory when processing large PDF documents. |
![]() | Equals | (Inherited from Object.) |
![]() | Finalize | (Inherited from Object.) |
![]() | FireParsingError | (Inherited from BaseExtractor.) |
![]() | GetHashCode | (Inherited from Object.) |
![]() | GetHTML | Extracts HTML from the entire document. |
![]() | GetHTML(IListInt32) | Extracts HTML from specified pages. |
![]() | GetHTML(String) | Extracts HTML from specified page ranges. |
![]() | GetHTML(Int32, Int32) | Extracts HTML from specified page range. |
![]() | GetHTMLPage | Extracts HTML from specified document page. |
![]() | GetOutputHTMLPageHeight | Get height of the output page rendered in HTML format. |
![]() | GetPageCount | (Inherited from BaseExtractor.) |
![]() | GetPageHeight | Height of the PDF page (in pdf units). |
![]() | GetPageRect_Height | (Inherited from BaseExtractor.) |
![]() | GetPageRect_Left | (Inherited from BaseExtractor.) |
![]() | GetPageRect_Top | (Inherited from BaseExtractor.) |
![]() | GetPageRect_Width | (Inherited from BaseExtractor.) |
![]() | GetPageRectangle(Int32) | (Inherited from BaseExtractor.) |
![]() | GetPageRectangle(Int32, Boolean) | (Inherited from BaseExtractor.) |
![]() | GetPageWidth | Width of the PDF page (in pdf units). |
![]() | GetType | (Inherited from Object.) |
![]() | LoadAndApplyProfiles | Loads profiles from JSON string and automatically applies them. Note that profiles containing detection keywords will be deferred until the extraction. (Inherited from BaseExtractor.) |
![]() | LoadDocumentFromFile | (Inherited from BaseExtractor.) |
![]() | LoadDocumentFromStream | (Inherited from BaseExtractor.) |
![]() | LoadProfiles | Loads profiles from JSON file. (Inherited from BaseExtractor.) |
![]() | LoadProfilesFromString | Loads profiles from JSON string. (Inherited from BaseExtractor.) |
![]() | MemberwiseClone | (Inherited from Object.) |
![]() | Reset | Resets the instance, disposes internal resources and releases the file. Use this method before loading another PDF file. (Overrides BaseExtractorReset.) |
![]() | ResetExtractionArea | (Inherited from BaseExtractor.) |
![]() | SaveHtmlPageToFile | Extracts HTML from specified page to stream. |
![]() | SaveHtmlPageToStream | Extracts HTML from specified page to stream. |
![]() | SaveHtmlToFile(String) | Extracts HTML from the entire document to file. |
![]() | SaveHtmlToFile(IListInt32, String) | Extracts HTML from specified pages to file. |
![]() | SaveHtmlToFile(String, String) | Extracts HTML from specified page ranges to file. |
![]() | SaveHtmlToFile(Int32, Int32, String) | Extracts HTML from specified page range to file. |
![]() | SaveHtmlToStream(Stream) | Extracts HTML from the entire document to stream. |
![]() | SaveHtmlToStream(IListInt32, Stream) | Extracts HTML from specified pages to stream. |
![]() | SaveHtmlToStream(String, Stream) | Extracts HTML from specified page ranges to stream. |
![]() | SaveHtmlToStream(Int32, Int32, Stream) | Extracts HTML from specified page range to stream. |
![]() | SetExtractionArea(RectangleF) | (Inherited from BaseExtractor.) |
![]() | SetExtractionArea(Double, Double, Double, Double) | (Inherited from BaseExtractor.) |
![]() | SetExtractionArea(Single, Single, Single, Single) | (Inherited from BaseExtractor.) |
![]() | ToString | (Inherited from Object.) |
See Also