Package org.htmlunit.html.parser
Interface HTMLParser
- All Known Implementing Classes:
HtmlUnitNekoHtmlParser
public interface HTMLParser
Interface for the parser used to parse HTML into a HtmlUnit-specific DOM (HU-DOM) tree.
-
Method Summary
Modifier and TypeMethodDescriptiongetElementFactory
(SgmlPage page, String namespaceURI, String qualifiedName, boolean insideSvg, boolean svgSupport) INTERNAL API - SUBJECT TO CHANGE AT ANY TIME - USE AT YOUR OWN RISK.
Returns the pre-registered element factory corresponding to the specified tag, or an UnknownElementFactory.getFactory
(String tagName) INTERNAL API - SUBJECT TO CHANGE AT ANY TIME - USE AT YOUR OWN RISK.INTERNAL API - SUBJECT TO CHANGE AT ANY TIME - USE AT YOUR OWN RISK.void
parse
(WebResponse webResponse, HtmlPage page, boolean xhtml, boolean createdByJavascript) Parses the WebResponse into an object tree representation.void
parseFragment
(DomNode parent, String source) Parses the HTML content from the given string into an object tree representation.void
parseFragment
(DomNode parent, DomNode context, String source, boolean createdByJavascript) Parses the HTML content from the given string into an object tree representation.
-
Method Details
-
getFactory
INTERNAL API - SUBJECT TO CHANGE AT ANY TIME - USE AT YOUR OWN RISK.- Parameters:
tagName
- an HTML element tag name- Returns:
- a factory for creating HtmlElements representing the given tag
-
getSvgFactory
ElementFactory getSvgFactory()INTERNAL API - SUBJECT TO CHANGE AT ANY TIME - USE AT YOUR OWN RISK.- Returns:
- a factory for creating SvgElements representing the given tag
-
getElementFactory
ElementFactory getElementFactory(SgmlPage page, String namespaceURI, String qualifiedName, boolean insideSvg, boolean svgSupport) INTERNAL API - SUBJECT TO CHANGE AT ANY TIME - USE AT YOUR OWN RISK.
Returns the pre-registered element factory corresponding to the specified tag, or an UnknownElementFactory.- Parameters:
page
- the pagenamespaceURI
- the namespace URIqualifiedName
- the qualified nameinsideSvg
- is the node inside an SVG node or notsvgSupport
- true if called from javascript createElementNS- Returns:
- the pre-registered element factory corresponding to the specified tag, or an UnknownElementFactory
-
parseFragment
Parses the HTML content from the given string into an object tree representation.- Parameters:
parent
- the parent for the new nodessource
- the (X)HTML to be parsed- Throws:
SAXException
- if a SAX error occursIOException
- if an IO error occurs
-
parseFragment
void parseFragment(DomNode parent, DomNode context, String source, boolean createdByJavascript) throws SAXException, IOException Parses the HTML content from the given string into an object tree representation.- Parameters:
parent
- where the new parsed nodes will be added tocontext
- the context to build the fragment context stacksource
- the (X)HTML to be parsedcreatedByJavascript
- if true the (script) tag was created by javascript- Throws:
SAXException
- if a SAX error occursIOException
- if an IO error occurs
-
parse
void parse(WebResponse webResponse, HtmlPage page, boolean xhtml, boolean createdByJavascript) throws IOException Parses the WebResponse into an object tree representation.- Parameters:
webResponse
- the response datapage
- the HtmlPage to add the nodesxhtml
- if true use the XHtml parsercreatedByJavascript
- if true the (script) tag was created by javascript- Throws:
IOException
- if there is an IO error
-