Package org.htmlcleaner
package org.htmlcleaner
-
ClassDescriptionBase token interface.Base class for all tokens.Browser compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.Contains information about nodes that were closed due to their child nodes.Properties defining cleaner's behaviourContains transformation collection.This class is for thread-safe handling of private instance variables from HtmlCleanerCommand line usage class.HTML comment token.Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.Compact XML serializer - creates resulting XML by stripping whitespaces.Configuration file tag provider - reads XML file in specified format and creates a Tag Provider.HTML text token.This is the default tag provider for HTML Cleaner Note this is no longer generated from XML - see https://sourceforge.net/p/htmlcleaner/bugs/81/Most HTML 4 elements permitted within the BODY are classified as either block-level elements or inline elements.HTML doctype token.DOM serializer - creates xml DOM.HTML tag end token.Main HtmlCleaner class.General HtmlCleaner runtime exception.Support for ANT.Marker interface denoting nodes of the document treeAbstract HTML serializer - contains common logic for descendants.Main HTML tokenizer.Provides set of TagInfo instances.JDom serializer - creates xml JDom instance out of the TagNode.It contains the MathML tags to use with Html5 tagsNesting State Wrapper for a current HtmlCleaner cleaning state, keeping together the set of open tags and breaks in the current state.Class that contains information and methods for managing list of open, but unhandled tags.Pretty HTML serializer - creates resulting HTML with indenting lines.Pretty XML serializer - creates resulting XML with indenting lines.A
TagNode
that only really holds whitespace or comments - allows usingContentNode
in places where aTagNode
is expected.Basic abstract serializer - contains common logic for descendants (methodswriteXXX()
.Simple HTML serializer - creates resulting HTML without indenting and/or compacting.Simple XML serializer - creates resulting XML without indenting lines.This class contains map with special entities used in HTML and their unicodes.Class contains information about single HTML tag.
It also contains rules for tag balancing.XML node tag - basic node of the cleaned HTML tree.Defines action to be performed on TagNodesContains information about a single open tagHTML tag token - descendants are start (TagNode) and end token (EndTagToken).Describes how specified tag is transformed to another one, or is ignored during parsingA traversal-based serializer for DOM; used to avoid recursion and stack overflow for large HTML documents.Common utilities.Abstract XML serializer - contains common logic for descendants.Depth-first node traversor.Node visitor interface.Utility for searching cleaned document tree with XPath expressions.Exception that could occure during XPather evaluation.