Link Search Menu Expand Document

HTMLExtractor Methods

The HTMLExtractor type exposes the following members.

Methods
NameDescription
Public methodCreateProfile(String, Boolean, Boolean, Boolean)
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.)
Public methodCreateProfile(String, String, Boolean, Boolean, Boolean)
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.)
Public methodDispose
Releases the unmanaged resources used by the instance and optionally releases the managed resources.
(Inherited from BaseExtractor.)
Public methodDisposePage
Disposes the page object. Uses this method carefully to destroy the page object that should not be used further. Useful to free allocated memory when processing large PDF documents.
Public methodEquals (Inherited from Object.)
Protected methodFinalize (Inherited from Object.)
Protected methodFireParsingError (Inherited from BaseExtractor.)
Public methodGetHashCode (Inherited from Object.)
Public methodGetHTML
Extracts HTML from the entire document.
Public methodGetHTML(IListInt32)
Extracts HTML from specified pages.
Public methodGetHTML(String)
Extracts HTML from specified page ranges.
Public methodGetHTML(Int32, Int32)
Extracts HTML from specified page range.
Public methodGetHTMLPage
Extracts HTML from specified document page.
Public methodGetOutputHTMLPageHeight
Get height of the output page rendered in HTML format.
Public methodGetPageCount (Inherited from BaseExtractor.)
Public methodGetPageHeight
Height of the PDF page (in pdf units).
Public methodGetPageRect_Height (Inherited from BaseExtractor.)
Public methodGetPageRect_Left (Inherited from BaseExtractor.)
Public methodGetPageRect_Top (Inherited from BaseExtractor.)
Public methodGetPageRect_Width (Inherited from BaseExtractor.)
Public methodGetPageRectangle(Int32) (Inherited from BaseExtractor.)
Public methodGetPageRectangle(Int32, Boolean) (Inherited from BaseExtractor.)
Public methodGetPageWidth
Width of the PDF page (in pdf units).
Public methodGetType (Inherited from Object.)
Public methodLoadAndApplyProfiles
Loads profiles from JSON string and automatically applies them. Note that profiles containing detection keywords will be deferred until the extraction.
(Inherited from BaseExtractor.)
Public methodLoadDocumentFromFile (Inherited from BaseExtractor.)
Public methodLoadDocumentFromStream (Inherited from BaseExtractor.)
Public methodLoadProfiles
Loads profiles from JSON file.
(Inherited from BaseExtractor.)
Public methodLoadProfilesFromString
Loads profiles from JSON string.
(Inherited from BaseExtractor.)
Protected methodMemberwiseClone (Inherited from Object.)
Public methodReset
Resets the instance, disposes internal resources and releases the file. Use this method before loading another PDF file.
(Overrides BaseExtractorReset.)
Public methodResetExtractionArea (Inherited from BaseExtractor.)
Public methodSaveHtmlPageToFile
Extracts HTML from specified page to stream.
Public methodSaveHtmlPageToStream
Extracts HTML from specified page to stream.
Public methodSaveHtmlToFile(String)
Extracts HTML from the entire document to file.
Public methodSaveHtmlToFile(IListInt32, String)
Extracts HTML from specified pages to file.
Public methodSaveHtmlToFile(String, String)
Extracts HTML from specified page ranges to file.
Public methodSaveHtmlToFile(Int32, Int32, String)
Extracts HTML from specified page range to file.
Public methodSaveHtmlToStream(Stream)
Extracts HTML from the entire document to stream.
Public methodSaveHtmlToStream(IListInt32, Stream)
Extracts HTML from specified pages to stream.
Public methodSaveHtmlToStream(String, Stream)
Extracts HTML from specified page ranges to stream.
Public methodSaveHtmlToStream(Int32, Int32, Stream)
Extracts HTML from specified page range to stream.
Public methodSetExtractionArea(RectangleF) (Inherited from BaseExtractor.)
Public methodSetExtractionArea(Double, Double, Double, Double) (Inherited from BaseExtractor.)
Public methodSetExtractionArea(Single, Single, Single, Single) (Inherited from BaseExtractor.)
Public methodToString (Inherited from Object.)
Top
See Also

Reference


Copyright © 2016 - 2021 ByteScout