A B C D E F G H I J K L M N O P R S T U V W X _ 
All Classes All Packages

A

addAttribute(String, String) - Method in class org.htmlcleaner.EndTagToken
 
addAttribute(String, String) - Method in class org.htmlcleaner.TagNode
Adds specified attribute to this tag or overrides existing one.
addAttribute(String, String) - Method in class org.htmlcleaner.TagToken
 
addAttributePatternTransformation(Pattern, String) - Method in class org.htmlcleaner.TagTransformation
 
addAttributePatternTransformation(Pattern, Pattern, String) - Method in class org.htmlcleaner.TagTransformation
 
addAttributePatternTransformation(AttributeTransformation) - Method in class org.htmlcleaner.TagTransformation
 
addAttributesToTag(TagNode, Map<String, String>) - Method in class org.htmlcleaner.HtmlCleaner
Add attributes from specified map to the specified tag.
addAttributeTransformation(String) - Method in class org.htmlcleaner.TagTransformation
Adds new attribute transformation in which destination attrbute will not exists (simply removes it from list of attributes).
addAttributeTransformation(String, String) - Method in class org.htmlcleaner.TagTransformation
Adds new attribute transformation to this tag transformation.
addBreak(TagPos, TagPos) - Method in class org.htmlcleaner.ChildBreaks
Adds the break info to the top of the stacks.
addChild(Object) - Method in class org.htmlcleaner.TagNode
 
addChildren(List) - Method in class org.htmlcleaner.TagNode
Add all elements from specified list to this node.
addGlobalTransformation(AttributeTransformation) - Method in class org.htmlcleaner.CleanerTransformations
 
addHtmlModificationListener(HtmlModificationListener) - Method in class org.htmlcleaner.CleanerProperties
Adds a listener to the list of objects that will be notified about changes that cleaner does during cleanup process.
addIfNeededToPruneSet(TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
addItemForMoving(Object) - Method in class org.htmlcleaner.TagNode
 
addNamespaceDeclaration(String, String) - Method in class org.htmlcleaner.TagNode
Adds namespace declaration to the node
addNewlineToHeadAndBody - Variable in class org.htmlcleaner.CleanerProperties
"cause the cleaner cannot keep track of whitespace at that level", there are 2 lists built: one for the head , one for the body.
addPossibleHeadCandidate(TagInfo, TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Checks if specified tag with specified info is candidate for moving to head section.
addPruneNode(TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
addPruneTagNodeCondition(ITagNodeCondition) - Method in class org.htmlcleaner.CleanerProperties
Adds the condition to existing prune tag set.
addSavedAsContent() - Method in class org.htmlcleaner.HtmlTokenizer
 
addTag(String, TagInfo, int, CleanTimeValues) - Method in class org.htmlcleaner.OpenTags
 
addTagNameConditions(Set<ITagNodeCondition>, String) - Method in class org.htmlcleaner.CleanerProperties
 
addText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
addToken(BaseToken) - Method in class org.htmlcleaner.HtmlTokenizer
 
addTransformation(TagTransformation) - Method in class org.htmlcleaner.CleanerTransformations
Adds specified tag transformation to the collection.
advancedxmlescape - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
advancedXmlEscape - Variable in class org.htmlcleaner.CleanerProperties
If this parameter is set to true, ampersand sign (&) that proceeds valid XML character sequences (&XXX;) will not be escaped with &XXX;
afterTagLineBreakNeeded - Variable in enum org.htmlcleaner.Display
 
all - org.htmlcleaner.ContentType
 
allowhtmlinsideattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
allowHtmlInsideAttributes - Variable in class org.htmlcleaner.CleanerProperties
 
allowInvalidAttributeNames - Variable in class org.htmlcleaner.CleanerProperties
 
allowInvalidAttributeNames - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
allowmultiwordattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
allowMultiWordAttributes - Variable in class org.htmlcleaner.CleanerProperties
 
allowsAnything() - Method in class org.htmlcleaner.TagInfo
 
allowsBody() - Method in class org.htmlcleaner.TagInfo
 
allowsItem(BaseToken) - Method in class org.htmlcleaner.TagInfo
 
allowTags - Variable in class org.htmlcleaner.CleanerProperties
 
allowTagSet - Variable in class org.htmlcleaner.CleanerProperties
the list of allowed tags (whitelist approach v.
allowTagSet - Variable in class org.htmlcleaner.CleanTimeValues
 
allTags - Variable in class org.htmlcleaner.CleanTimeValues
 
alwaysOutput - org.htmlcleaner.OptionalOutput
Always outputed, if information is not supplied in the source a default is created.
ampNcr - Static variable in class org.htmlcleaner.Utils
 
any - org.htmlcleaner.Display
The following elements may be used as either block-level elements or inline elements.
applyTagTransformations(Map<String, String>) - Method in class org.htmlcleaner.TagTransformation
 
areCopiedTokensEqual(TagNode, TagNode) - Static method in class org.htmlcleaner.HtmlCleaner
Determines if two copied tokens are equal.
ASCII_CHAR - Static variable in class org.htmlcleaner.Utils
 
assumedNamespace - Variable in class org.htmlcleaner.TagInfo
 
assumedNamespacePrefix - Variable in class org.htmlcleaner.TagInfo
 
attName - Variable in class org.htmlcleaner.conditional.TagNodeAttExistsCondition
 
attName - Variable in class org.htmlcleaner.conditional.TagNodeAttValueCondition
 
attNamePattern - Variable in class org.htmlcleaner.AttributeTransformationPatternImpl
 
attNameRegex - Variable in class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
 
attributePatternTransformations - Variable in class org.htmlcleaner.TagTransformation
 
attributes - Variable in class org.htmlcleaner.TagNode
 
attributesToLowerCase() - Method in class org.htmlcleaner.TagNode
Returns a copy of the set of attributes for this node with lowercase names.
AttributeTransformation - Interface in org.htmlcleaner
 
AttributeTransformationPatternImpl - Class in org.htmlcleaner
 
AttributeTransformationPatternImpl(String, String, String) - Constructor for class org.htmlcleaner.AttributeTransformationPatternImpl
 
AttributeTransformationPatternImpl(Pattern, Pattern, String) - Constructor for class org.htmlcleaner.AttributeTransformationPatternImpl
 
attributeTransformations - Variable in class org.htmlcleaner.TagTransformation
 
attributeValue() - Method in class org.htmlcleaner.HtmlTokenizer
Parses a single tag attribute - it is expected to be in one of the forms: name=value name="value" name='value' name
attValue - Variable in class org.htmlcleaner.conditional.TagNodeAttValueCondition
 
attValuePattern - Variable in class org.htmlcleaner.AttributeTransformationPatternImpl
 
attValueRegex - Variable in class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
 
autoGenerated - Variable in class org.htmlcleaner.TagNode
Used to indicate a start tag that was auto generated because TagInfo.isContinueAfter(String)(closedTag.getName()) returned true For example,

B

BaseHtmlNode - Class in org.htmlcleaner
 
BaseHtmlNode() - Constructor for class org.htmlcleaner.BaseHtmlNode
 
BaseToken - Interface in org.htmlcleaner
Base token interface.
BaseTokenImpl - Class in org.htmlcleaner
Base class for all tokens.
BaseTokenImpl() - Constructor for class org.htmlcleaner.BaseTokenImpl
 
BaseTokenImpl(int, int) - Constructor for class org.htmlcleaner.BaseTokenImpl
 
basicElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
bchomp(String) - Static method in class org.htmlcleaner.Utils
Removes the first newline and last newline (if present) of a string
BEGIN_CDATA - Static variable in class org.htmlcleaner.CData
 
belongsTo - Variable in class org.htmlcleaner.TagInfo
 
BelongsTo - Enum in org.htmlcleaner
 
BelongsTo(String) - Constructor for enum org.htmlcleaner.BelongsTo
 
blank - Variable in class org.htmlcleaner.ContentNode
 
block - org.htmlcleaner.Display
Block-level elements typically contain inline elements and other block-level elements.
BODY - org.htmlcleaner.BelongsTo
 
bodyNode - Variable in class org.htmlcleaner.CleanTimeValues
 
bodyNode - Variable in class org.htmlcleaner.ProxyTagNode
 
BOOL_ATT_EMPTY - Static variable in class org.htmlcleaner.CleanerProperties
 
BOOL_ATT_SELF - Static variable in class org.htmlcleaner.CleanerProperties
 
BOOL_ATT_TRUE - Static variable in class org.htmlcleaner.CleanerProperties
 
booleanAttributeValues - Variable in class org.htmlcleaner.CleanerProperties
 
booleanatts - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
BR_TAG - Static variable in class org.htmlcleaner.BrowserCompactXmlSerializer
 
BR_TAG - Static variable in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
 
breakingTags - Variable in class org.htmlcleaner.ChildBreaks
 
BrowserCompactXmlSerializer - Class in org.htmlcleaner
Browser compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.
BrowserCompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.BrowserCompactXmlSerializer
 

C

C0 - Static variable in class org.htmlcleaner.XPather
 
C9 - Static variable in class org.htmlcleaner.XPather
 
calculateRootNode(CleanTimeValues, Set<String>) - Method in class org.htmlcleaner.HtmlCleaner
Assigns root node to internal variable and adds neccessery xmlns attributes if cleaner is namespace-aware.
CD - Static variable in class org.htmlcleaner.XPather
 
cdata() - Method in class org.htmlcleaner.HtmlTokenizer
 
CData - Class in org.htmlcleaner
 
CData(String) - Constructor for class org.htmlcleaner.CData
 
characters(char[], int, int) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
charset - Variable in class org.htmlcleaner.CleanerProperties
 
charValue() - Method in class org.htmlcleaner.SpecialEntity
 
checkSublist(int, int, List) - Method in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
 
childBreaks - Variable in class org.htmlcleaner.NestingState
 
ChildBreaks - Class in org.htmlcleaner
Contains information about nodes that were closed due to their child nodes.
ChildBreaks() - Constructor for class org.htmlcleaner.ChildBreaks
 
children - Variable in class org.htmlcleaner.TagNode
 
childTags - Variable in class org.htmlcleaner.TagInfo
 
chomp(String) - Static method in class org.htmlcleaner.Utils
Removes the last newline (if present) of a string
className - Static variable in class org.htmlcleaner.CommandLine
 
clean(File) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(File, String) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(InputStream) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(InputStream, String) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(Reader) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(Reader, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Basic version of the cleaning call.
clean(String) - Method in class org.htmlcleaner.DoctypeToken
 
clean(String) - Method in class org.htmlcleaner.HtmlCleaner
 
clean(URL) - Method in class org.htmlcleaner.HtmlCleaner
Deprecated.
clean(URL, String) - Method in class org.htmlcleaner.HtmlCleaner
Deprecated.
cleaner - Variable in class org.htmlcleaner.HtmlTokenizer
 
CleanerProperties - Class in org.htmlcleaner
Properties defining cleaner's behaviour
CleanerProperties() - Constructor for class org.htmlcleaner.CleanerProperties
 
CleanerProperties(ITagInfoProvider) - Constructor for class org.htmlcleaner.CleanerProperties
 
cleanerTransformations - Variable in class org.htmlcleaner.CleanerProperties
 
CleanerTransformations - Class in org.htmlcleaner
Contains transformation collection.
CleanerTransformations() - Constructor for class org.htmlcleaner.CleanerTransformations
 
CleanerTransformations(Map) - Constructor for class org.htmlcleaner.CleanerTransformations
 
cleanTimeValues - Variable in class org.htmlcleaner.HtmlTokenizer
 
CleanTimeValues - Class in org.htmlcleaner
This class is for thread-safe handling of private instance variables from HtmlCleaner
CleanTimeValues() - Constructor for class org.htmlcleaner.CleanTimeValues
 
clear() - Method in class org.htmlcleaner.CleanerTransformations
 
CLOSE_BEFORE_COPY_INSIDE_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
 
CLOSE_BEFORE_COPY_INSIDE_TAGS - Static variable in class org.htmlcleaner.Html4TagProvider
 
CLOSE_BEFORE_COPY_INSIDE_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
 
CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
 
CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.Html4TagProvider
 
CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
 
CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.MathMLTagProvider
 
closeAll(List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Close all unclosed tags if there are any.
closedByChildBreak - Variable in class org.htmlcleaner.ChildBreaks
 
closeSnippet(List, TagPos, Object, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Forced closing
closeTag - Variable in class org.htmlcleaner.TagInfo
 
CloseTag - Enum in org.htmlcleaner
 
CloseTag(boolean, boolean) - Constructor for enum org.htmlcleaner.CloseTag
 
CM - Static variable in class org.htmlcleaner.XPather
 
col - Variable in class org.htmlcleaner.BaseTokenImpl
 
collectNamespacePrefixesOnPath(Set<String>) - Method in class org.htmlcleaner.TagNode
Collect all prefixes in namespace declarations up the path to the document root from the specified node
CommandLine - Class in org.htmlcleaner
Command line usage class.
CommandLine() - Constructor for class org.htmlcleaner.CommandLine
 
comment - Variable in class org.htmlcleaner.ProxyTagNode
 
comment() - Method in class org.htmlcleaner.HtmlTokenizer
 
CommentNode - Class in org.htmlcleaner
HTML comment token.
CommentNode(String) - Constructor for class org.htmlcleaner.CommentNode
 
CompactHtmlSerializer - Class in org.htmlcleaner
Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.
CompactHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactHtmlSerializer
 
CompactXmlSerializer - Class in org.htmlcleaner
Compact XML serializer - creates resulting XML by stripping whitespaces.
CompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactXmlSerializer
 
compileUnicodePattern(String) - Static method in class org.htmlcleaner.Utils
 
ConfigFileTagProvider - Class in org.htmlcleaner
Configuration file tag provider - reads XML file in specified format and creates a Tag Provider.
ConfigFileTagProvider() - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider(File) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider(URL) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider(InputSource) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
 
ConfigFileTagProvider.ConfigParser - Class in org.htmlcleaner
SAX parser for tag configuration files.
ConfigParser(Map) - Constructor for class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
containsEndCData() - Method in class org.htmlcleaner.HtmlTokenizer
 
content - Variable in class org.htmlcleaner.CommentNode
 
content - Variable in class org.htmlcleaner.ContentNode
 
content() - Method in class org.htmlcleaner.HtmlTokenizer
 
ContentNode - Class in org.htmlcleaner
HTML text token.
ContentNode(String) - Constructor for class org.htmlcleaner.ContentNode
 
contentType - Variable in class org.htmlcleaner.TagInfo
 
ContentType - Enum in org.htmlcleaner
 
ContentType(String) - Constructor for enum org.htmlcleaner.ContentType
 
continueAfterTags - Variable in class org.htmlcleaner.TagInfo
 
convert_To_Entity_Name(String, boolean, boolean, boolean, StringBuilder, int) - Static method in class org.htmlcleaner.Utils
 
convertToUnicode(String, boolean, boolean, boolean, StringBuilder, int) - Static method in class org.htmlcleaner.Utils
 
copyTags - Variable in class org.htmlcleaner.TagInfo
 
CP - Static variable in class org.htmlcleaner.XPather
 
createDocument(TagNode) - Method in class org.htmlcleaner.DomBuilder
 
createDocument(TagNode) - Method in class org.htmlcleaner.DomSerializer
 
createDocumentNodes(List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
createDOM(TagNode) - Method in class org.htmlcleaner.DomSerializer
 
createDOM(TagNode) - Method in class org.htmlcleaner.TraversalDomSerializer
 
createElement(TagNode) - Method in class org.htmlcleaner.JDomSerializer
 
createJDom(TagNode) - Method in class org.htmlcleaner.JDomSerializer
 
createSubnodes(Element, List<? extends BaseToken>) - Method in class org.htmlcleaner.JDomSerializer
 
createSubnodes(Document, Element, List<? extends BaseToken>) - Method in class org.htmlcleaner.DomSerializer
Serialize a given HTML Cleaner node.
createTagNode(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
 
creatingHtmlDom - Variable in class org.htmlcleaner.XmlSerializer
 
CS - Static variable in class org.htmlcleaner.XPather
 
CSS_COMMENT_END - Static variable in class org.htmlcleaner.DomSerializer
 
CSS_COMMENT_END - Static variable in class org.htmlcleaner.JDomSerializer
 
CSS_COMMENT_START - Static variable in class org.htmlcleaner.DomBuilder
 
CSS_COMMENT_START - Static variable in class org.htmlcleaner.DomSerializer
 
CSS_COMMENT_START - Static variable in class org.htmlcleaner.JDomSerializer
 

D

dbCode - Variable in enum org.htmlcleaner.BelongsTo
 
dbCode - Variable in enum org.htmlcleaner.ContentType
 
DECIMAL - Static variable in class org.htmlcleaner.Utils
 
DEFAULT_CHARSET - Static variable in class org.htmlcleaner.CleanerProperties
 
DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyHtmlSerializer
 
DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyXmlSerializer
 
DefaultTagProvider - Class in org.htmlcleaner
This is the default tag provider for HTML Cleaner Note this is no longer generated from XML - see https://sourceforge.net/p/htmlcleaner/bugs/81/
DefaultTagProvider() - Constructor for class org.htmlcleaner.DefaultTagProvider
 
defineAllowedChildrenTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineCloseBeforeCopyInsideTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineCloseBeforeTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineCloseInsideCopyAfterTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineFatalTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineForbiddenTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineHigherLevelTags(String) - Method in class org.htmlcleaner.TagInfo
 
defineNamespaceDeclarations(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
 
defineRequiredEnclosingTags(String) - Method in class org.htmlcleaner.TagInfo
 
dependencyName - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
deprecated - Variable in class org.htmlcleaner.TagInfo
 
Deprecated - org.htmlcleaner.audit.ErrorType
The tag was deprecated and current cleaner mode doesn't allows this.
deserializeCdataEntities - Variable in class org.htmlcleaner.DomBuilder
 
deserializeCdataEntities - Variable in class org.htmlcleaner.DomSerializer
 
deserializeCdataEntities - Variable in class org.htmlcleaner.TraversalDomSerializer
 
deserializeCdataEntities(String) - Method in class org.htmlcleaner.DomBuilder
 
deserializeCdataEntities(String) - Method in class org.htmlcleaner.DomSerializer
 
deserializeEntities - Variable in class org.htmlcleaner.CleanerProperties
 
deserializeEntities(String, boolean) - Static method in class org.htmlcleaner.Utils
 
dest - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
destinationElement - Variable in class org.htmlcleaner.DomBuilder
 
destTag - Variable in class org.htmlcleaner.TagTransformation
 
display - Variable in class org.htmlcleaner.TagInfo
 
Display - Enum in org.htmlcleaner
Most HTML 4 elements permitted within the BODY are classified as either block-level elements or inline elements.
Display(boolean, boolean) - Constructor for enum org.htmlcleaner.Display
 
doctype() - Method in class org.htmlcleaner.HtmlTokenizer
 
docType - Variable in class org.htmlcleaner.TagNode
 
DoctypeToken - Class in org.htmlcleaner
HTML doctype token.
DoctypeToken(String, String, String, String) - Constructor for class org.htmlcleaner.DoctypeToken
 
DoctypeToken(String, String, String, String, String) - Constructor for class org.htmlcleaner.DoctypeToken
 
document - Variable in class org.htmlcleaner.DomBuilder
 
DomBuilder - Class in org.htmlcleaner
 
DomBuilder(CleanerProperties, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.DomBuilder
 
DomSerializer - Class in org.htmlcleaner
DOM serializer - creates xml DOM.
DomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.DomSerializer
 
DomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.DomSerializer
 
DomSerializer(CleanerProperties, boolean, boolean) - Constructor for class org.htmlcleaner.DomSerializer
 
DomSerializer(CleanerProperties, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.DomSerializer
 
dontEscape(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
 
dontEscape(TagNode) - Method in class org.htmlcleaner.XmlSerializer
 
dontEscape(Element) - Method in class org.htmlcleaner.DomSerializer
encapsulate content with <[CDATA[ ]]> for things like script and style elements

E

editTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
The HTML5 edits tags (2 total)
embeddedContentTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
 
END_CDATA - Static variable in class org.htmlcleaner.CData
 
endElement(String, String, String) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
endTagPermitted - Variable in enum org.htmlcleaner.CloseTag
 
EndTagToken - Class in org.htmlcleaner
HTML tag end token.
EndTagToken() - Constructor for class org.htmlcleaner.EndTagToken
 
EndTagToken(String) - Constructor for class org.htmlcleaner.EndTagToken
 
entities - Variable in class org.htmlcleaner.SpecialEntities
key is the SpecialEntity.getKey() ( i.e.
entitiesByUnicodeCharcode - Variable in class org.htmlcleaner.SpecialEntities
Key is the Integer returned by SpecialEntity.intValue()
ErrorType - Enum in org.htmlcleaner.audit
Possible error codes (read messages) that cleaner uses to inform clients about reasons/actions that modification involves.
ErrorType() - Constructor for enum org.htmlcleaner.audit.ErrorType
 
escape - Variable in class org.htmlcleaner.SimpleHtmlSerializer
 
escapedXmlString - Variable in class org.htmlcleaner.SpecialEntity
 
escapeHtml(String, CleanerProperties) - Static method in class org.htmlcleaner.Utils
Escapes HTML string
escapeText(String) - Method in class org.htmlcleaner.HtmlSerializer
 
escapeXml - Variable in class org.htmlcleaner.DomBuilder
 
escapeXml - Variable in class org.htmlcleaner.DomSerializer
Whether XML entities should be escaped or not.
escapeXml - Variable in class org.htmlcleaner.JDomSerializer
 
escapeXml - Variable in class org.htmlcleaner.TraversalDomSerializer
Whether XML entities should be escaped or not.
escapeXml(String) - Method in class org.htmlcleaner.XmlSerializer
 
escapeXml(String, boolean, boolean, boolean, boolean, boolean, boolean) - Static method in class org.htmlcleaner.Utils
change notes: 1) convert ascii characters encoded using &#xx; format to the ascii characters -- may be an attempt to slip in malicious html 2) convert &#xxx; format characters to " style representation if available for the character.
escapeXml(String, boolean, boolean, boolean, boolean, boolean, boolean, boolean) - Static method in class org.htmlcleaner.Utils
change notes: 1) convert ascii characters encoded using &#xx; format to the ascii characters -- may be an attempt to slip in malicious html 2) convert &#xxx; format characters to " style representation if available for the character.
escapeXml(String, CleanerProperties, boolean) - Static method in class org.htmlcleaner.Utils
Escapes XML string.
evaluateAgainst(Collection, int, int, boolean, int, int, boolean, Collection) - Method in class org.htmlcleaner.XPather
 
evaluateAgainstNode(TagNode) - Method in class org.htmlcleaner.XPather
Main public method for this class - a way to execute XPath expression against specified TagNode instance.
evaluateFunction(Collection, int, int, int, int, boolean) - Method in class org.htmlcleaner.XPather
Evaluates specified function.
evaluateLogic(Collection, Collection, String) - Method in class org.htmlcleaner.XPather
Evaluates logic operation on two collections.
evaluateTemplate(String, Map<String, String>) - Method in class org.htmlcleaner.TagTransformation
Evaluates string template for specified map of variables.
evaluateXPath(String) - Method in class org.htmlcleaner.TagNode
Evaluates XPath expression on give node.
execute() - Method in class org.htmlcleaner.HtmlCleanerForAnt
Implementation of Ant task execution.
extractCharCode(String, int, boolean, StringBuilder) - Static method in class org.htmlcleaner.Utils
(earlier code was failing on this) - ŠA; is converted by FF to 3 characters: Š + 'A' + ';' �x138A; is converted by FF to 6? 7? characters: � 'x'+'1'+'3'+ '8' + 'A' + ';' #0 is displayed kind of weird ᎊ is a single character

F

factory - Variable in class org.htmlcleaner.JDomSerializer
 
FatalTagMissing - org.htmlcleaner.audit.ErrorType
Tag which existence is critical for the current is missing.
fatalTags - Variable in class org.htmlcleaner.TagInfo
 
filterByCondition(Collection, int, int) - Method in class org.htmlcleaner.XPather
Filter nodes satisfying the condition
findClosingIndex(int, int) - Method in class org.htmlcleaner.XPather
 
findElement(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
Finds first element in the tree that satisfy specified condition.
findElementByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
 
findElementByName(String, boolean) - Method in class org.htmlcleaner.TagNode
 
findElementHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
 
findFirstTagPos() - Method in class org.htmlcleaner.OpenTags
 
findMatchingTagNodes(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
Get all elements in the tree that satisfy specified condition.
findTag(String, CleanTimeValues) - Method in class org.htmlcleaner.OpenTags
 
findTagToPlaceRubbish() - Method in class org.htmlcleaner.OpenTags
 
fireConditionModification(ITagNodeCondition, TagNode) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
Fired when cleaner modifies html due to ITagNodeCondition match.
fireConditionModification(ITagNodeCondition, TagNode) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
 
fireConditionModification(ITagNodeCondition, TagNode) - Method in class org.htmlcleaner.CleanerProperties
 
fireHtmlError(boolean, TagNode, ErrorType) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
Fired when cleaner fixes some error in html syntax.
fireHtmlError(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
 
fireHtmlError(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.CleanerProperties
 
fireUglyHtml(boolean, TagNode, ErrorType) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
Fired when cleaner fixes ugly html -- when syntax was correct but task was implemented by weird code.
fireUglyHtml(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
 
fireUglyHtml(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.CleanerProperties
 
fireUserDefinedModification(boolean, TagNode, ErrorType) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
Fired when cleaner modifies html due to user specified rules.
fireUserDefinedModification(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
 
fireUserDefinedModification(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.CleanerProperties
 
flatten(int, int) - Method in class org.htmlcleaner.XPather
 
flattenNestedList(List) - Method in class org.htmlcleaner.HtmlCleaner
Flattens a list of tagnodes
FLOW_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
Most elements that are used in the body of documents and applications are categorized as flow content.
forbidden - org.htmlcleaner.CloseTag
is not permitted
foreignMarkupFlagSet - Variable in class org.htmlcleaner.TagNode
This flag is set if foreignMarkup is set; if it is false it means that the tagnode tree has not been built and so it isn't known whether this node is a HTML node or foreign markup such as SVG.
formattingElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
formElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
formTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
The HTML5 forms tags (15 total)
fullUrl(String, String) - Static method in class org.htmlcleaner.Utils
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.

G

generateCode - Variable in class org.htmlcleaner.ConfigFileTagProvider
 
getAllChildren() - Method in class org.htmlcleaner.TagNode
 
getAllElements(boolean) - Method in class org.htmlcleaner.TagNode
 
getAllElementsList(boolean) - Method in class org.htmlcleaner.TagNode
 
getAllowTags() - Method in class org.htmlcleaner.CleanerProperties
 
getAllowTagSet() - Method in class org.htmlcleaner.CleanerProperties
 
getAllowTagSet(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
getAllTags(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
getAmpNcr() - Static method in class org.htmlcleaner.Utils
 
getArgValue(String[], String, String) - Static method in class org.htmlcleaner.CommandLine
 
getAsString(String) - Method in class org.htmlcleaner.Serializer
 
getAsString(TagNode) - Method in class org.htmlcleaner.Serializer
 
getAsString(TagNode, boolean) - Method in class org.htmlcleaner.Serializer
 
getAsString(TagNode, String) - Method in class org.htmlcleaner.Serializer
 
getAsString(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
 
getAssumedNamespace() - Method in class org.htmlcleaner.TagInfo
 
getAssumedNamespacePrefix() - Method in class org.htmlcleaner.TagInfo
 
getAttributeByName(String) - Method in class org.htmlcleaner.TagNode
 
getAttributes() - Method in class org.htmlcleaner.TagNode
Returns the attributes of the tagnode.
getAttributesInLowerCase() - Method in class org.htmlcleaner.TagNode
Returns the attributes of the tagnode in lower case.
getAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
 
getBelongsTo() - Method in class org.htmlcleaner.TagInfo
 
getBooleanAttributeValues() - Method in class org.htmlcleaner.CleanerProperties
 
getCharset() - Method in class org.htmlcleaner.CleanerProperties
 
getChildBreaks() - Method in class org.htmlcleaner.NestingState
 
getChildBreaks(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
getChildIndex(HtmlNode) - Method in class org.htmlcleaner.TagNode
 
getChildren() - Method in class org.htmlcleaner.TagNode
Deprecated.
use TagNode.getChildTagList(), will be refactored and possibly removed in future versions. TODO This method should be refactored because is does not properly match the commonly used Java's getter/setter strategy.
getChildTagList() - Method in class org.htmlcleaner.TagNode
 
getChildTags() - Method in class org.htmlcleaner.TagInfo
 
getChildTags() - Method in class org.htmlcleaner.TagNode
 
getCleanerTransformations() - Method in class org.htmlcleaner.CleanerProperties
 
getCol() - Method in interface org.htmlcleaner.BaseToken
 
getCol() - Method in class org.htmlcleaner.BaseTokenImpl
 
getCommentedContent() - Method in class org.htmlcleaner.CommentNode
 
getContent() - Method in class org.htmlcleaner.CData
 
getContent() - Method in class org.htmlcleaner.CommentNode
 
getContent() - Method in class org.htmlcleaner.ContentNode
 
getContent() - Method in class org.htmlcleaner.DoctypeToken
 
getContent() - Method in class org.htmlcleaner.ProxyTagNode
 
getContentType() - Method in class org.htmlcleaner.TagInfo
 
getContentWithoutStartAndEndTokens() - Method in class org.htmlcleaner.CData
 
getContentWithStartAndEndTokens() - Method in class org.htmlcleaner.CData
 
getContinueAfterTags() - Method in class org.htmlcleaner.TagInfo
 
getCopyTags() - Method in class org.htmlcleaner.TagInfo
 
getDbCode() - Method in enum org.htmlcleaner.BelongsTo
 
getDbCode() - Method in enum org.htmlcleaner.ContentType
 
getDecimalNCR() - Method in class org.htmlcleaner.SpecialEntity
 
getDestTag() - Method in class org.htmlcleaner.TagTransformation
 
getDisplay() - Method in class org.htmlcleaner.TagInfo
 
getDocType() - Method in class org.htmlcleaner.HtmlTokenizer
 
getDocType() - Method in class org.htmlcleaner.TagNode
 
getDocument() - Method in class org.htmlcleaner.DomBuilder
 
getElementList(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
Get all elements in the tree that satisfy specified condition.
getElementListByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementListByName(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementListHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getElements(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementsByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementsByName(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getElementsByName(Collection, int, int, boolean, boolean) - Method in class org.htmlcleaner.XPather
For the given source collection and specified name, returns collection of subnodes or attribute values.
getElementsHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
 
getEscaped(boolean) - Method in class org.htmlcleaner.SpecialEntity
 
getEscapedValue() - Method in class org.htmlcleaner.SpecialEntity
 
getEscapedXmlString() - Method in class org.htmlcleaner.SpecialEntity
 
getFatalTags() - Method in class org.htmlcleaner.TagInfo
 
getHexNCR() - Method in class org.htmlcleaner.SpecialEntity
 
getHigherTags() - Method in class org.htmlcleaner.TagInfo
 
getHtmlString() - Method in class org.htmlcleaner.SpecialEntity
 
getHtmlVersion() - Method in class org.htmlcleaner.CleanerProperties
Return the html version
getHyphenReplacementInComment() - Method in class org.htmlcleaner.CleanerProperties
 
getIndent(int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
getIndent(int) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
getInnerHtml(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
For the specified node, returns it's content as string.
getInvalidAttributeNamePrefix() - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
getInvalidXmlAttributeNamePrefix() - Method in class org.htmlcleaner.CleanerProperties
Get the prefix to use to try to make valid attribute names
getItemsToMove() - Method in class org.htmlcleaner.TagNode
 
getKey() - Method in class org.htmlcleaner.SpecialEntity
 
getLastBreakingTag() - Method in class org.htmlcleaner.ChildBreaks
 
getLastBreakingTagPosition() - Method in class org.htmlcleaner.ChildBreaks
 
getLastTagPos() - Method in class org.htmlcleaner.OpenTags
 
getMaxDepth() - Method in class org.htmlcleaner.CleanerProperties
 
getMaxEntityLength() - Method in class org.htmlcleaner.SpecialEntities
 
getMustCloseTags() - Method in class org.htmlcleaner.TagInfo
 
getName() - Method in class org.htmlcleaner.DoctypeToken
 
getName() - Method in class org.htmlcleaner.TagInfo
 
getName() - Method in class org.htmlcleaner.TagNode
 
getName() - Method in class org.htmlcleaner.TagToken
 
getNamespaceDeclarations() - Method in class org.htmlcleaner.TagNode
 
getNamespacePrefixes() - Method in class org.htmlcleaner.HtmlTokenizer
 
getNamespaceURIOnPath(String) - Method in class org.htmlcleaner.TagNode
 
getOpenTags() - Method in class org.htmlcleaner.NestingState
 
getOpenTags(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
getParent() - Method in class org.htmlcleaner.BaseHtmlNode
 
getParent() - Method in interface org.htmlcleaner.HtmlNode
 
getParent() - Method in class org.htmlcleaner.ProxyTagNode
 
getPart1() - Method in class org.htmlcleaner.DoctypeToken
 
getPart2() - Method in class org.htmlcleaner.DoctypeToken
 
getPart3() - Method in class org.htmlcleaner.DoctypeToken
Deprecated.
getPart4() - Method in class org.htmlcleaner.DoctypeToken
Deprecated.
getPermittedTags() - Method in class org.htmlcleaner.TagInfo
 
getPreferredChildTag() - Method in class org.htmlcleaner.TagInfo
 
getProperties() - Method in class org.htmlcleaner.HtmlCleaner
 
getPruneTags() - Method in class org.htmlcleaner.CleanerProperties
 
getPruneTagSet() - Method in class org.htmlcleaner.CleanerProperties
 
getPruneTagSet(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
getPublicId() - Method in class org.htmlcleaner.DoctypeToken
This will retrieve the public ID of an externally referenced DTD, or an empty String if none is referenced.
getRequiredParentTags() - Method in class org.htmlcleaner.TagInfo
 
getRow() - Method in interface org.htmlcleaner.BaseToken
 
getRow() - Method in class org.htmlcleaner.BaseTokenImpl
 
getSiblings() - Method in class org.htmlcleaner.BaseHtmlNode
 
getSiblings() - Method in interface org.htmlcleaner.HtmlNode
 
getSingleLineOfChildren(List<? extends BaseToken>) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
getSingleLineOfChildren(List<? extends BaseToken>) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
getSourceTag() - Method in class org.htmlcleaner.TagTransformation
 
getSpecialEntity(String) - Method in class org.htmlcleaner.SpecialEntities
 
getSpecialEntityByUnicode(int) - Method in class org.htmlcleaner.SpecialEntities
 
getSwitchArgument(String[], String) - Static method in class org.htmlcleaner.CommandLine
If the specified argument name exists without a value, return true.
getSystemId() - Method in class org.htmlcleaner.DoctypeToken
This will retrieve the system ID of an externally referenced DTD, or an empty String if none is referenced.
getTagInfo(String) - Method in class org.htmlcleaner.ConfigFileTagProvider
 
getTagInfo(String) - Method in class org.htmlcleaner.DefaultTagProvider
 
getTagInfo(String) - Method in class org.htmlcleaner.Html4TagProvider
 
getTagInfo(String) - Method in class org.htmlcleaner.Html5TagProvider
It returns the tag information.
getTagInfo(String) - Method in interface org.htmlcleaner.ITagInfoProvider
 
getTagInfo(String, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
 
getTagInfo(String, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Returns a TagInfo object for the specified tag name.
getTagInfoProvider() - Method in class org.htmlcleaner.CleanerProperties
 
getTagInfoProvider() - Method in class org.htmlcleaner.HtmlCleaner
 
getTagName(String) - Method in class org.htmlcleaner.CleanerTransformations
 
getTemplate() - Method in interface org.htmlcleaner.AttributeTransformation
 
getTemplate() - Method in class org.htmlcleaner.AttributeTransformationPatternImpl
 
getText() - Method in class org.htmlcleaner.TagNode
 
getToken() - Method in class org.htmlcleaner.ProxyTagNode
 
getTokenList() - Method in class org.htmlcleaner.HtmlTokenizer
 
getTransformation(String) - Method in class org.htmlcleaner.CleanerTransformations
 
getTransformations() - Method in class org.htmlcleaner.HtmlCleaner
 
getType() - Method in class org.htmlcleaner.DoctypeToken
This will retrieve an integer representing the identified DocType
getUseCdataFor() - Method in class org.htmlcleaner.CleanerProperties
 
getXmlAsString(TagNode) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use getAsString() instead.
getXmlAsString(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use getAsString() instead.
getXmlName(String) - Static method in class org.htmlcleaner.Utils
 
getXmlNSPrefix(String) - Static method in class org.htmlcleaner.Utils
 
getXmlVersion() - Method in class org.htmlcleaner.DomSerializer
 
globalTransformations - Variable in class org.htmlcleaner.CleanerTransformations
 
go() - Method in class org.htmlcleaner.HtmlTokenizer
 
go(int) - Method in class org.htmlcleaner.HtmlTokenizer
 
greek - Variable in class org.htmlcleaner.SpecialEntities
 
groupingTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
The HTML5 grouping tags (14 total)

H

handleEndTagToken(BaseToken, ListIterator<BaseToken>, List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Process rules for a new end tag token in the HTML tree.
handleInterruption() - Method in class org.htmlcleaner.HtmlCleaner
Called whenever the thread is interrupted.
handleInterruption() - Method in class org.htmlcleaner.HtmlTokenizer
Called whenver the thread is interrupted.
handleInterruption() - Method in class org.htmlcleaner.TagNode
Called whenver the thread is interrupted.
handleStartTagToken(BaseToken, ListIterator<BaseToken>, List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Processes all the rules associated with a new opening tag in the HTML tree
hasAttribute(String) - Method in class org.htmlcleaner.TagNode
Checks existence of specified attribute.
hasAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
 
hasChildren() - Method in class org.htmlcleaner.TagNode
 
hasCopyTags() - Method in class org.htmlcleaner.TagInfo
 
hasIdAttributeSet(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
 
hasPermittedTags() - Method in class org.htmlcleaner.TagInfo
 
hasTransformationForTag(String) - Method in class org.htmlcleaner.CleanerTransformations
 
head(HtmlNode, int) - Method in class org.htmlcleaner.DomBuilder
 
head(HtmlNode, int) - Method in interface org.htmlcleaner.XmlVisitor
Callback for when a node is first visited.
HEAD - org.htmlcleaner.BelongsTo
 
HEAD_AND_BODY - org.htmlcleaner.BelongsTo
 
HeadlessTagNode(TagNode) - Constructor for class org.htmlcleaner.Serializer.HeadlessTagNode
 
headNode - Variable in class org.htmlcleaner.CleanTimeValues
 
HEX_RELAXED - Static variable in class org.htmlcleaner.Utils
 
HEX_STRICT - Static variable in class org.htmlcleaner.Utils
 
higherTags - Variable in class org.htmlcleaner.TagInfo
 
HTML_4 - Static variable in class org.htmlcleaner.HtmlCleaner
 
HTML_5 - Static variable in class org.htmlcleaner.HtmlCleaner
 
HTML4_0 - Static variable in class org.htmlcleaner.DoctypeToken
 
HTML4_01 - Static variable in class org.htmlcleaner.DoctypeToken
 
HTML4_01_FRAMESET - Static variable in class org.htmlcleaner.DoctypeToken
 
HTML4_01_STRICT - Static variable in class org.htmlcleaner.DoctypeToken
 
HTML4_01_TRANSITIONAL - Static variable in class org.htmlcleaner.DoctypeToken
 
Html4TagProvider - Class in org.htmlcleaner
 
Html4TagProvider() - Constructor for class org.htmlcleaner.Html4TagProvider
 
HTML5 - Static variable in class org.htmlcleaner.DoctypeToken
 
HTML5_LEGACY_TOOL_COMPATIBLE - Static variable in class org.htmlcleaner.DoctypeToken
 
Html5TagProvider - Class in org.htmlcleaner
 
Html5TagProvider() - Constructor for class org.htmlcleaner.Html5TagProvider
 
htmlCleaner - Variable in class org.htmlcleaner.OpenTags
 
HtmlCleaner - Class in org.htmlcleaner
Main HtmlCleaner class.
HtmlCleaner() - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates cleaner instance with default tag info provider,default version and default properties.
HtmlCleaner(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates the instance with default tag info provider and specified properties
HtmlCleaner(ITagInfoProvider) - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates the instance with specified tag info provider and default properties
HtmlCleaner(ITagInfoProvider, CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
Constructor - creates the instance with specified tag info provider and specified properties
HtmlCleanerException - Exception in org.htmlcleaner
General HtmlCleaner runtime exception.
HtmlCleanerException() - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerException(String) - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerException(String, Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerException(Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
 
HtmlCleanerForAnt - Class in org.htmlcleaner
Support for ANT.
HtmlCleanerForAnt() - Constructor for class org.htmlcleaner.HtmlCleanerForAnt
 
HtmlModificationListener - Interface in org.htmlcleaner.audit
Implementors can be registered on CleanerProperties to receive notifications about modifications made by html cleaner.
HtmlModificationListenerLogger - Class in org.htmlcleaner.audit
 
HtmlModificationListenerLogger(Logger) - Constructor for class org.htmlcleaner.audit.HtmlModificationListenerLogger
 
htmlModificationListeners - Variable in class org.htmlcleaner.CleanerProperties
 
htmlNode - Variable in class org.htmlcleaner.CleanTimeValues
 
HtmlNode - Interface in org.htmlcleaner
Marker interface denoting nodes of the document tree
HtmlSerializer - Class in org.htmlcleaner
Abstract HTML serializer - contains common logic for descendants.
HtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlSerializer
 
htmlSpecialEntity - Variable in class org.htmlcleaner.SpecialEntity
 
htmlString - Variable in class org.htmlcleaner.SpecialEntity
 
HtmlTokenizer - Class in org.htmlcleaner
Main HTML tokenizer.
HtmlTokenizer(HtmlCleaner, Reader, CleanTimeValues) - Constructor for class org.htmlcleaner.HtmlTokenizer
Constructor - creates instance of the parser with specified content.
htmlVersion - Variable in class org.htmlcleaner.CleanerProperties
 
hyphenreplacement - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
hyphenReplacementInComment - Variable in class org.htmlcleaner.CleanerProperties
 

I

ID_ATTRIBUTE_NAME - Static variable in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
 
identifier(boolean) - Method in class org.htmlcleaner.HtmlTokenizer
Parses an identifier from the current position.
ignoreqe - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
ignoreQuestAndExclam - Variable in class org.htmlcleaner.CleanerProperties
 
ignoreUntil(char) - Method in class org.htmlcleaner.HtmlTokenizer
 
imgElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
incharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
indents - Variable in class org.htmlcleaner.PrettyHtmlSerializer
 
indents - Variable in class org.htmlcleaner.PrettyXmlSerializer
 
indentString - Variable in class org.htmlcleaner.PrettyHtmlSerializer
 
indentString - Variable in class org.htmlcleaner.PrettyXmlSerializer
 
info - Variable in class org.htmlcleaner.TagPos
 
initCleanerTransformations(Map) - Method in class org.htmlcleaner.HtmlCleaner
 
inline - org.htmlcleaner.Display
Inline elements typically may only contain text and other inline elements.
insertChild(int, HtmlNode) - Method in class org.htmlcleaner.TagNode
Inserts specified node at specified position in array of children
insertChildAfter(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
Inserts specified node in the list of children after specified child
insertChildBefore(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
Inserts specified node in the list of children before specified child
INSTANCE - Static variable in class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
 
INSTANCE - Static variable in class org.htmlcleaner.DefaultTagProvider
 
INSTANCE - Static variable in class org.htmlcleaner.Html4TagProvider
 
INSTANCE - Static variable in class org.htmlcleaner.Html5TagProvider
 
INSTANCE - Static variable in class org.htmlcleaner.SpecialEntities
 
INSTANCE2 - Variable in class org.htmlcleaner.Html5TagProvider
 
intCode - Variable in class org.htmlcleaner.SpecialEntity
 
interactiveTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
The HTML5 Interactive tags (4 total)
intValue() - Method in class org.htmlcleaner.SpecialEntity
 
invalidAttributeNamePrefix - Variable in class org.htmlcleaner.CleanerProperties
 
invalidAttributeNamePrefix - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
isAddNewlineToHeadAndBody() - Method in class org.htmlcleaner.CleanerProperties
 
isAdvancedXmlEscape() - Method in class org.htmlcleaner.CleanerProperties
 
isAfterTagLineBreakNeeded() - Method in enum org.htmlcleaner.Display
 
isAllowedAsForeignMarkup(String, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Checks whether we can allow a tag as "foreign markup".
isAllowedInLastOpenTag(BaseToken, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
isAllowHtmlInsideAttributes() - Method in class org.htmlcleaner.CleanerProperties
 
isAllowInvalidAttributeNames() - Method in class org.htmlcleaner.CleanerProperties
If false, when outputting XML, if an attribute name is not valid, attempt to fix it by using a prefix and removing invalid characters.
isAllowInvalidAttributeNames() - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
isAllowMultiWordAttributes() - Method in class org.htmlcleaner.CleanerProperties
 
isAllRead() - Method in class org.htmlcleaner.HtmlTokenizer
Checks if end of the content is reached.
isAtt(String) - Method in class org.htmlcleaner.XPather
Checks if token is attribute (starts with @)
isAutoGenerated() - Method in class org.htmlcleaner.TagNode
 
isBlank() - Method in class org.htmlcleaner.ContentNode
 
isBrNode(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
 
isCaseSensitive - Variable in class org.htmlcleaner.conditional.TagNodeAttValueCondition
 
isChar(char) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at current runtime position is equal to specified char.
isChar(int, char) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at specified position is equal to specified char.
isContentOrInline(Object) - Method in class org.htmlcleaner.BrowserCompactXmlSerializer
 
isContinueAfter(String) - Method in class org.htmlcleaner.TagInfo
 
isCopiedTokenEqualToNextThreeCopiedTokens(TagNode, ListIterator<BaseToken>) - Static method in class org.htmlcleaner.HtmlCleaner
Determines if a copied token is equal to the next 3 tokens in the iterator.
isCopy - Variable in class org.htmlcleaner.TagNode
Indicates that the node is a copy of another node.
isCopy() - Method in class org.htmlcleaner.TagNode
 
isCopy(String) - Method in class org.htmlcleaner.TagInfo
 
isCreatingHtmlDom() - Method in class org.htmlcleaner.XmlSerializer
 
isDeprecated() - Method in class org.htmlcleaner.TagInfo
 
isDeserializeEntities() - Method in class org.htmlcleaner.CleanerProperties
 
isElementIdentifierStartChar(int) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at specified position can be identifier start.
isEmpty() - Method in class org.htmlcleaner.ChildBreaks
 
isEmpty() - Method in class org.htmlcleaner.OpenTags
 
isEmpty() - Method in class org.htmlcleaner.TagNode
 
isEmptyString(Object) - Static method in class org.htmlcleaner.Utils
 
isEmptyTag() - Method in class org.htmlcleaner.TagInfo
 
isEndTagPermitted() - Method in enum org.htmlcleaner.CloseTag
 
isFatalTag(String) - Method in class org.htmlcleaner.TagInfo
 
isFatalTagSatisfied(TagInfo, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Checks if open fatal tag is missing if there is a fatal tag for the specified tag.
isForbiddenAttribute(TagNode, String, String) - Method in class org.htmlcleaner.XmlSerializer
Override to add additional conditions.
isForbiddenTag(TagNode) - Method in class org.htmlcleaner.XmlSerializer
 
isForeignMarkup - Variable in class org.htmlcleaner.TagNode
This flag is set if we are using namespace aware setting, and the tagnode belongs to a non-HTML namespace.
isForeignMarkup() - Method in class org.htmlcleaner.TagNode
 
isFormed - Variable in class org.htmlcleaner.TagNode
 
isFormed() - Method in class org.htmlcleaner.TagNode
 
isFullUrl(String) - Static method in class org.htmlcleaner.Utils
Checks if specified link is full URL.
isFunctionCall(int, int) - Method in class org.htmlcleaner.XPather
Checks if tokens in specified range represents valid function call.
isHeadAndBodyTag() - Method in class org.htmlcleaner.TagInfo
 
isHeadOrBody(String) - Method in class org.htmlcleaner.XmlSerializer
 
isHeadTag() - Method in class org.htmlcleaner.TagInfo
 
isHigher(String) - Method in class org.htmlcleaner.TagInfo
 
isHtmlAttributeIdentifierChar() - Method in class org.htmlcleaner.HtmlTokenizer
 
isHtmlAttributeIdentifierChar(int) - Method in class org.htmlcleaner.HtmlTokenizer
Check whether the character at the specified position in the stream is a valid character for part of an attribute identifier in HTML
isHtmlAttributeIdentifierStartChar() - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at current runtime position can be identifier start.
isHtmlElementIdentifier() - Method in class org.htmlcleaner.HtmlTokenizer
 
isHtmlElementIdentifier(int) - Method in class org.htmlcleaner.HtmlTokenizer
 
isHtmlSpecialEntity() - Method in class org.htmlcleaner.SpecialEntity
 
isIdentifier(String) - Method in class org.htmlcleaner.XPather
Checks if given string is valid identifier.
isIgnoreQuestAndExclam() - Method in class org.htmlcleaner.CleanerProperties
 
isKeepWhitespaceAndCommentsInHead() - Method in class org.htmlcleaner.CleanerProperties
 
isLeadingAndEndWhitespacesAllowed() - Method in enum org.htmlcleaner.Display
 
isMinimizedTagPermitted() - Method in enum org.htmlcleaner.CloseTag
 
isMinimizedTagPermitted() - Method in class org.htmlcleaner.TagInfo
 
isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
 
isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.XmlSerializer
 
isMustCloseTag(TagInfo) - Method in class org.htmlcleaner.TagInfo
 
isNamespacesAware() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitCdataOutsideScriptAndStyle() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitComments() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitDeprecatedTags() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitDoctypeDeclaration() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitHtmlEnvelope() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitUnknownTags() - Method in class org.htmlcleaner.CleanerProperties
 
isOmitXmlDeclaration() - Method in class org.htmlcleaner.CleanerProperties
 
isPreserveSourceAttributes() - Method in class org.htmlcleaner.TagTransformation
 
isPruned() - Method in class org.htmlcleaner.TagNode
 
isRecognizeUnicodeChars() - Method in class org.htmlcleaner.CleanerProperties
 
isRemovingNodeReasonablySafe(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
 
isReservedTag(String) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if specified tag name is one of the reserved tags: HTML, HEAD or BODY
isScriptOrStyle(TagNode) - Method in class org.htmlcleaner.Serializer
 
isScriptOrStyle(Element) - Method in class org.htmlcleaner.DomSerializer
 
isStartToken(Object) - Method in class org.htmlcleaner.HtmlCleaner
 
isTagStartOrEnd() - Method in class org.htmlcleaner.HtmlTokenizer
Not all '<' (lt) symbols mean tag start or end.
isToken(String, int) - Method in class org.htmlcleaner.XPather
 
isTranslateSpecialEntities() - Method in class org.htmlcleaner.CleanerProperties
 
isTransResCharsToNCR() - Method in class org.htmlcleaner.CleanerProperties
 
isTransSpecialEntitiesToNCR() - Method in class org.htmlcleaner.CleanerProperties
 
isTreatDeprecatedTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
 
isTreatUnknownTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
 
isTrimAttributeValues - Variable in class org.htmlcleaner.TagNode
This flag is set if attribute values should be trimmed.
isTrimAttributeValues() - Method in class org.htmlcleaner.CleanerProperties
 
isTrimAttributeValues() - Method in class org.htmlcleaner.TagNode
 
isUnique() - Method in class org.htmlcleaner.TagInfo
 
isUseCdataFor(String) - Method in class org.htmlcleaner.CleanerProperties
 
isUseCdataForScriptAndStyle() - Method in class org.htmlcleaner.CleanerProperties
 
isUseEmptyElementTags() - Method in class org.htmlcleaner.CleanerProperties
 
isValid() - Method in class org.htmlcleaner.DoctypeToken
 
isValidDouble(String) - Method in class org.htmlcleaner.XPather
 
isValidHtmlAttributeName(String) - Static method in class org.htmlcleaner.Utils
 
isValidInt(String, int) - Static method in class org.htmlcleaner.Utils
 
isValidInteger(String) - Static method in class org.htmlcleaner.XPather
 
isValidXmlIdentifier(String) - Static method in class org.htmlcleaner.Utils
Checks whether specified string can be valid tag name or attribute name in xml.
isValidXmlIdentifierStartChar(String) - Static method in class org.htmlcleaner.Utils
Determines whether the initial character of an identifier is valid for XML
isWhitespace() - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at current runtime position is whitespace.
isWhitespace(int) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if character at specified position is whitespace.
isWhitespaceString(Object) - Method in class org.htmlcleaner.CompactXmlSerializer
Checks whether specified object's string representation is empty string (containing of only whitespaces).
isWhitespaceString(Object) - Static method in class org.htmlcleaner.Utils
Checks whether specified object's string representation is empty string (containing of only whitespaces).
isXmlReservedCharacter(String) - Static method in class org.htmlcleaner.Utils
 
ITagInfoProvider - Interface in org.htmlcleaner
Provides set of TagInfo instances.
ITagNodeCondition - Interface in org.htmlcleaner.conditional
Used as base for different node checkers.
itemsToMove - Variable in class org.htmlcleaner.TagNode
 

J

JDomSerializer - Class in org.htmlcleaner
JDom serializer - creates xml JDom instance out of the TagNode.
JDomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.JDomSerializer
 
JDomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.JDomSerializer
 

K

keepWhitespaceAndCommentsInHead - Variable in class org.htmlcleaner.CleanerProperties
Tries to keep inside head all whitespace and comments that were originally there
key - Variable in class org.htmlcleaner.SpecialEntity
 

L

last - Variable in class org.htmlcleaner.OpenTags
 
layoutElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
 
lchomp(String) - Static method in class org.htmlcleaner.Utils
Removes the first newline (if present) of a string
leadingAndEndWhitespacesAllowed - Variable in enum org.htmlcleaner.Display
 
LINE_BREAK - Static variable in class org.htmlcleaner.BrowserCompactXmlSerializer
 
linkElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
list - Variable in class org.htmlcleaner.OpenTags
 
listElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
log - Variable in class org.htmlcleaner.audit.HtmlModificationListenerLogger
 
logger - Static variable in class org.htmlcleaner.CommandLine
 
ltrim(String) - Method in class org.htmlcleaner.PrettyXmlSerializer
Trims specified string from left.
ltrim(String) - Static method in class org.htmlcleaner.Utils
Trims specified string from left.

M

main(String[]) - Static method in class org.htmlcleaner.CommandLine
 
main(String[]) - Static method in class org.htmlcleaner.ConfigFileTagProvider
Generates code for tag provider class from specified configuration XML file.
makeCopy() - Method in class org.htmlcleaner.TagNode
 
makeTree(List, ListIterator<BaseToken>, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
This method generally mutates flattened list of tokens into tree structure.
mappings - Variable in class org.htmlcleaner.CleanerTransformations
 
MARKER_ATTRIBUTE - Static variable in class org.htmlcleaner.HtmlCleaner
Marker attribute added to aid with part of the cleaning process.
markNodesToPrune(List, CleanTimeValues, int) - Method in class org.htmlcleaner.HtmlCleaner
 
math - Variable in class org.htmlcleaner.SpecialEntities
 
MathMLTagProvider - Class in org.htmlcleaner
It contains the MathML tags to use with Html5 tags
MathMLTagProvider(TagInfo, ConcurrentMap<String, TagInfo>) - Constructor for class org.htmlcleaner.MathMLTagProvider
 
maxDepth - Variable in class org.htmlcleaner.CleanerProperties
Provides an arbitrary recursion depth
maxEntityLength - Variable in class org.htmlcleaner.SpecialEntities
 
MEDIA_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
HTML5 Media Tags
MEDIA_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
HTML5 Media Tags
mediaTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
Html5 media-embedded tags (12 tags)
metadataTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
HTML5 Document metadata tags
minimizedTagPermitted - Variable in enum org.htmlcleaner.CloseTag
 
mustAddRequiredParent(TagInfo, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
Check if specified tag requires parent tag, but that parent tag is missing in the appropriate context.
mustCloseTags - Variable in class org.htmlcleaner.TagInfo
 

N

name - Variable in class org.htmlcleaner.conditional.TagNodeNameCondition
 
name - Variable in class org.htmlcleaner.TagInfo
 
name - Variable in class org.htmlcleaner.TagPos
 
name - Variable in class org.htmlcleaner.TagToken
 
namespace - Variable in class org.htmlcleaner.CleanTimeValues
A stack of namespaces for currently open tags.
namespaceMap - Variable in class org.htmlcleaner.CleanTimeValues
A map of all the namespace prefixes and URIs declared within the document.
namespacesaware - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
namespacesAware - Variable in class org.htmlcleaner.CleanerProperties
 
NestingState - Class in org.htmlcleaner
Nesting State Wrapper for a current HtmlCleaner cleaning state, keeping together the set of open tags and breaks in the current state.
NestingState(OpenTags, ChildBreaks) - Constructor for class org.htmlcleaner.NestingState
 
nestingStates - Variable in class org.htmlcleaner.CleanTimeValues
 
NEW_LINE - Static variable in class org.htmlcleaner.DomSerializer
 
NEW_LINE - Static variable in class org.htmlcleaner.JDomSerializer
 
newTagNode(String) - Method in class org.htmlcleaner.HtmlCleaner
 
nodebyxpath - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
NON_BREAKABLE_SPACE - Static variable in class org.htmlcleaner.SpecialEntities
 
none - org.htmlcleaner.ContentType
elements that have no children or content ( for example ).
none - org.htmlcleaner.Display
Elements that are not actually inline or block, usually such elements are not rendered at all.
NotAllowedTag - org.htmlcleaner.audit.ErrorType
The tag wasn't found on list of allowed tags, thus it was removed.
nsDeclarations - Variable in class org.htmlcleaner.TagNode
 

O

olderElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
omit - org.htmlcleaner.OptionalOutput
Never outputed even if supplied in the source.
omitCdataOutsideScriptAndStyle - Variable in class org.htmlcleaner.CleanerProperties
 
omitcomments - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitComments - Variable in class org.htmlcleaner.CleanerProperties
 
omitDeprecatedTags - Variable in class org.htmlcleaner.CleanerProperties
 
omitdeprtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitdoctypedecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitDoctypeDeclaration - Variable in class org.htmlcleaner.CleanerProperties
 
omithtmlenvelope - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitHtmlEnvelope - Variable in class org.htmlcleaner.CleanerProperties
 
omitunknowntags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
omitUnknownTags - Variable in class org.htmlcleaner.CleanerProperties
 
omitxmldecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
OMITXMLDECL - Static variable in class org.htmlcleaner.CommandLine
 
omitXmlDeclaration - Variable in class org.htmlcleaner.CleanerProperties
 
openPreTags - Variable in class org.htmlcleaner.CompactHtmlSerializer
 
openTags - Variable in class org.htmlcleaner.NestingState
 
OpenTags - Class in org.htmlcleaner
Class that contains information and methods for managing list of open, but unhandled tags.
OpenTags(HtmlCleaner) - Constructor for class org.htmlcleaner.OpenTags
 
optional - org.htmlcleaner.CloseTag
or is permitted
OptionalOutput - Enum in org.htmlcleaner
 
OptionalOutput() - Constructor for enum org.htmlcleaner.OptionalOutput
 
org.htmlcleaner - package org.htmlcleaner
 
org.htmlcleaner.audit - package org.htmlcleaner.audit
 
org.htmlcleaner.conditional - package org.htmlcleaner.conditional
 
outcharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
outputCData(CData) - Method in class org.htmlcleaner.DomSerializer
 
outputtype - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 

P

parent - Variable in class org.htmlcleaner.BaseHtmlNode
 
parse(InputSource) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
parserFactory - Static variable in class org.htmlcleaner.ConfigFileTagProvider
 
part1 - Variable in class org.htmlcleaner.DoctypeToken
 
part2 - Variable in class org.htmlcleaner.DoctypeToken
 
part3 - Variable in class org.htmlcleaner.DoctypeToken
 
part4 - Variable in class org.htmlcleaner.DoctypeToken
 
permittedTags - Variable in class org.htmlcleaner.TagInfo
 
PHRASING_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
Phrasing tags are those that can make up paragraphs along with text to make Phrasing Content
PHRASING_TAGS - Static variable in class org.htmlcleaner.Html4TagProvider
Phrasing tags are those that can make up paragraphs along with text to make Phrasing Content
PHRASING_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
Phrasing tags are those that can make up paragraphs along with text to make Phrasing Content.
phrasingTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
Html5 phrasing tags --text level semantics (31 total) thelw data
pop() - Method in class org.htmlcleaner.ChildBreaks
pops out latest broken tag position.
popNesting(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
position - Variable in class org.htmlcleaner.TagPos
 
PRE_TAG - Static variable in class org.htmlcleaner.BrowserCompactXmlSerializer
 
preferredChildTag - Variable in class org.htmlcleaner.TagInfo
 
presentationMarkup(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
 
preserve - org.htmlcleaner.OptionalOutput
outputed ONLY if supplied in the source.
preserveSourceAttributes - Variable in class org.htmlcleaner.TagTransformation
 
PrettyHtmlSerializer - Class in org.htmlcleaner
Pretty HTML serializer - creates resulting HTML with indenting lines.
PrettyHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
 
PrettyHtmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
 
PrettyXmlSerializer - Class in org.htmlcleaner
Pretty XML serializer - creates resulting XML with indenting lines.
PrettyXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
 
PrettyXmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
 
properties - Variable in class org.htmlcleaner.HtmlCleaner
 
props - Variable in class org.htmlcleaner.DomBuilder
 
props - Variable in class org.htmlcleaner.DomSerializer
The HTML Cleaner properties set by the user to control the HTML cleaning.
props - Variable in class org.htmlcleaner.HtmlTokenizer
 
props - Variable in class org.htmlcleaner.JDomSerializer
 
props - Variable in class org.htmlcleaner.Serializer
 
props - Variable in class org.htmlcleaner.TraversalDomSerializer
 
ProxyTagNode - Class in org.htmlcleaner
A TagNode that only really holds whitespace or comments - allows using ContentNode in places where a TagNode is expected.
ProxyTagNode(CommentNode, TagNode) - Constructor for class org.htmlcleaner.ProxyTagNode
 
ProxyTagNode(ContentNode, TagNode) - Constructor for class org.htmlcleaner.ProxyTagNode
 
pruned - Variable in class org.htmlcleaner.TagNode
Indicates that the node was marked to be pruned out of the tree.
pruneNodeSet - Variable in class org.htmlcleaner.CleanTimeValues
 
prunetags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
pruneTags - Variable in class org.htmlcleaner.CleanerProperties
 
pruneTagSet - Variable in class org.htmlcleaner.CleanerProperties
blacklist of tags
pruneTagSet - Variable in class org.htmlcleaner.CleanTimeValues
 
pushNesting(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
put(String, TagInfo) - Method in class org.htmlcleaner.DefaultTagProvider
 
put(String, TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
put(String, TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
It inserts the tag node into the tagInfoMap.
put(String, TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
 
put(SpecialEntity) - Method in class org.htmlcleaner.SpecialEntities
 

R

readIfNeeded(int) - Method in class org.htmlcleaner.HtmlTokenizer
 
readUrl(URL, String) - Static method in class org.htmlcleaner.Utils
Deprecated.
recognizeUnicodeChars - Variable in class org.htmlcleaner.CleanerProperties
 
removeAllChildren() - Method in class org.htmlcleaner.TagNode
Removes all children (subelements and text content).
removeAttribute(String) - Method in class org.htmlcleaner.TagNode
Removes specified attribute from this tag.
removeChild(Object) - Method in class org.htmlcleaner.TagNode
Remove specified child element from this node.
removeFromTree() - Method in class org.htmlcleaner.ProxyTagNode
 
removeFromTree() - Method in class org.htmlcleaner.TagNode
Remove this node from the tree.
removeTag(String) - Method in class org.htmlcleaner.OpenTags
 
reopenBrokenNode(ListIterator<BaseToken>, TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
replaceAttributes(Map<String, String>) - Method in class org.htmlcleaner.TagNode
Clears existing attributes and puts replacement attributes
replaceInvalidXmlIdentifierCharacters(String, String) - Static method in class org.htmlcleaner.Utils
Strips out invalid characters from names used for XML Elements and replaces them with the specified character.
required - org.htmlcleaner.CloseTag
is required.
RequiredParentMissing - org.htmlcleaner.audit.ErrorType
Missing parent tag was added for current (i.e.
requiredParentTags - Variable in class org.htmlcleaner.TagInfo
 
reset() - Method in class org.htmlcleaner.CleanerProperties
advancedXmlEscape = true; setUseCdataFor("script,style"); translateSpecialEntities = true; recognizeUnicodeChars = true; omitUnknownTags = false; treatUnknownTagsAsContent = false; omitDeprecatedTags = false; treatDeprecatedTagsAsContent = false; omitComments = false; omitXmlDeclaration = OptionalOutput.alwaysOutput; omitDoctypeDeclaration = OptionalOutput.alwaysOutput; omitHtmlEnvelope = OptionalOutput.alwaysOutput; useEmptyElementTags = true; allowMultiWordAttributes = true; allowHtmlInsideAttributes = false; ignoreQuestAndExclam = true; namespacesAware = true; keepHeadWhitespace = true; addNewlineToHeadAndBody = true; hyphenReplacementInComment = "="; pruneTags = null; allowTags = null; booleanAttributeValues = BOOL_ATT_SELF; collapseNullHtml = CollapseHtml.none charset = "UTF-8"; trimAttributeValues = true; tagInfoProvider = HTML5TagProvider.INSTANCE maxDepth = 1000
resetPruneTagSet() - Method in class org.htmlcleaner.CleanerProperties
 
rootNode - Variable in class org.htmlcleaner.CleanTimeValues
 
row - Variable in class org.htmlcleaner.BaseTokenImpl
 
rtrim(String) - Method in class org.htmlcleaner.PrettyXmlSerializer
Trims specified string from right.
rtrim(String) - Static method in class org.htmlcleaner.Utils
Trims specified string from right.

S

SAFE_BEGIN_CDATA - Static variable in class org.htmlcleaner.CData
 
SAFE_BEGIN_CDATA_ALT - Static variable in class org.htmlcleaner.CData
 
SAFE_END_CDATA - Static variable in class org.htmlcleaner.CData
 
SAFE_END_CDATA_ALT - Static variable in class org.htmlcleaner.CData
 
sanitizeHtmlAttributeName(String) - Static method in class org.htmlcleaner.Utils
 
sanitizeXmlIdentifier(String) - Static method in class org.htmlcleaner.Utils
 
sanitizeXmlIdentifier(String, String) - Static method in class org.htmlcleaner.Utils
 
sanitizeXmlIdentifier(String, String, String) - Static method in class org.htmlcleaner.Utils
Attempts to replace invalid attribute names with valid ones.
satisfy(String, String) - Method in interface org.htmlcleaner.AttributeTransformation
 
satisfy(String, String) - Method in class org.htmlcleaner.AttributeTransformationPatternImpl
 
satisfy(TagNode) - Method in interface org.htmlcleaner.conditional.ITagNodeCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagAllCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAttExistsCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAttValueCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
 
satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeNameCondition
 
satisfy(TagNode, boolean) - Method in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
 
save(char) - Method in class org.htmlcleaner.HtmlTokenizer
Saves specified character to the temporary buffer.
saveCurrent() - Method in class org.htmlcleaner.HtmlTokenizer
Saves character at current runtime position to the temporary buffer.
saveCurrent(int) - Method in class org.htmlcleaner.HtmlTokenizer
Saves specified number of characters at current runtime position to the temporary buffer.
saveToLastOpenTag(List, Object, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
 
SCRIPT_SUPPORTING_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
 
scriptElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
scriptElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
 
scriptingTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
HTML5 scripting tags
semanticFlowTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
The HTML5 semantic flow tags-Sectioning tags (15 total)
serialize(Serializer, Writer) - Method in class org.htmlcleaner.BaseHtmlNode
 
serialize(Serializer, Writer) - Method in interface org.htmlcleaner.BaseToken
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.CommentNode
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.ContentNode
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.DoctypeToken
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.EndTagToken
 
serialize(Serializer, Writer) - Method in class org.htmlcleaner.TagNode
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.BrowserCompactXmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactHtmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactXmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.Serializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleHtmlSerializer
 
serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleXmlSerializer
 
serializeAttribute(TagNode, Writer, String, String) - Method in class org.htmlcleaner.XmlSerializer
This allows overriding to eliminate forbidden attributes (for example javascript attributes onclick, onblur, etc.
serializeCData(CData, TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
Serialize a CDATA section.
serializeContentToken(ContentNode, TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
Serialize a content token, escaping where necessary.
serializeEndTag(TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
 
serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
 
serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
 
serializeOpenTag(TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
 
serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
 
serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
 
serializePrettyHtml(TagNode, Writer, int, boolean, boolean) - Method in class org.htmlcleaner.PrettyHtmlSerializer
 
serializePrettyXml(TagNode, Writer, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
 
Serializer - Class in org.htmlcleaner
Basic abstract serializer - contains common logic for descendants (methods writeXXX().
Serializer(CleanerProperties) - Constructor for class org.htmlcleaner.Serializer
 
Serializer.HeadlessTagNode - Class in org.htmlcleaner
Used to implement serialization with missing envelope - omiting open and close tags, just serialize children.
set - Variable in class org.htmlcleaner.OpenTags
 
setAddNewlineToHeadAndBody(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setAdvancedxmlescape(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setAdvancedXmlEscape(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setAllowhtmlinsideattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setAllowHtmlInsideAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setAllowInvalidAttributeNames(boolean) - Method in class org.htmlcleaner.CleanerProperties
Set whether to allow invalid attribute names, or to try to fix or omit them
setAllowInvalidAttributeNames(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setAllowmultiwordattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setAllowMultiWordAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setAllowTags(String) - Method in class org.htmlcleaner.CleanerProperties
 
setAllowTagSet(String) - Method in class org.htmlcleaner.CleanerProperties
 
setAssumedNamespace(String) - Method in class org.htmlcleaner.TagInfo
 
setAssumedNamespacePrefix(String) - Method in class org.htmlcleaner.TagInfo
 
setAttributes(Map<String, String>) - Method in class org.htmlcleaner.TagNode
Replace the current set of attributes with a new set.
setAttributes(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
 
setAutoGenerated(boolean) - Method in class org.htmlcleaner.TagNode
 
setBelongsTo(BelongsTo) - Method in class org.htmlcleaner.TagInfo
 
setBooleanAttributeValues(String) - Method in class org.htmlcleaner.CleanerProperties
 
setBooleanatts(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setCharset(String) - Method in class org.htmlcleaner.CleanerProperties
 
setChildren(List<? extends BaseToken>) - Method in class org.htmlcleaner.TagNode
 
setChildTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
 
setCleanerTransformations(CleanerTransformations) - Method in class org.htmlcleaner.CleanerProperties
 
setCol(int) - Method in interface org.htmlcleaner.BaseToken
 
setCol(int) - Method in class org.htmlcleaner.BaseTokenImpl
 
setContinueAfterTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
 
setCopyTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
 
setCreatingHtmlDom(boolean) - Method in class org.htmlcleaner.XmlSerializer
 
setDeprecated(boolean) - Method in class org.htmlcleaner.TagInfo
 
setDeserializeEntities(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setDest(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setDisplay(Display) - Method in class org.htmlcleaner.TagInfo
 
setDocType(DoctypeToken) - Method in class org.htmlcleaner.TagNode
 
setFatalTag(String) - Method in class org.htmlcleaner.TagInfo
 
setForeignMarkup(boolean) - Method in class org.htmlcleaner.TagNode
 
setFormed() - Method in class org.htmlcleaner.TagNode
 
setFormed(boolean) - Method in class org.htmlcleaner.TagNode
 
setHigherTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
 
setHtmlVersion(int) - Method in class org.htmlcleaner.CleanerProperties
Sets the html version according to the parameter.Also,it sets the tag provider to the appropriate version.
setHyphenreplacement(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setHyphenReplacementInComment(String) - Method in class org.htmlcleaner.CleanerProperties
 
setIgnoreqe(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setIgnoreQuestAndExclam(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setIncharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setInnerHtml(TagNode, String) - Method in class org.htmlcleaner.HtmlCleaner
For the specified tag node, defines it's html content.
setInvalidAttributeNamePrefix(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setInvalidXmlAttributeNamePrefix(String) - Method in class org.htmlcleaner.CleanerProperties
Sets the prefix to use for xml attributes that are invalid
setItemsToMove(List<BaseToken>) - Method in class org.htmlcleaner.TagNode
 
setKeepWhitespaceAndCommentsInHead(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setMaxDepth(int) - Method in class org.htmlcleaner.CleanerProperties
 
setMustCloseTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
 
setName(String) - Method in class org.htmlcleaner.TagInfo
 
setNamespacesaware(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setNamespacesAware(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setNodebyxpath(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitCdataOutsideScriptAndStyle(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitcomments(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitComments(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitDeprecatedTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitdeprtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitdoctypedecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitDoctypeDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmithtmlenvelope(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitHtmlEnvelope(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitunknowntags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitUnknownTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOmitxmldecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOmitXmlDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setOutcharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setOutputtype(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setParent(TagNode) - Method in class org.htmlcleaner.BaseHtmlNode
 
setParent(TagNode) - Method in interface org.htmlcleaner.HtmlNode
 
setPermittedTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
 
setPreferredChildTag(String) - Method in class org.htmlcleaner.TagInfo
 
setPruned(boolean) - Method in class org.htmlcleaner.TagNode
 
setPrunetags(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setPruneTags(String) - Method in class org.htmlcleaner.CleanerProperties
Resets prune tags set and adds tag name conditions to it.
setRecognizeUnicodeChars(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setRequiredParent(String) - Method in class org.htmlcleaner.TagInfo
 
setRow(int) - Method in interface org.htmlcleaner.BaseToken
 
setRow(int) - Method in class org.htmlcleaner.BaseTokenImpl
 
setSpecialentities(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setSrc(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTaginfofile(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTagInfoProvider(ITagInfoProvider) - Method in class org.htmlcleaner.CleanerProperties
 
setText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTransform(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTranslateSpecialEntities(boolean) - Method in class org.htmlcleaner.CleanerProperties
TODO : use OptionalOutput
setTransResCharsToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTransSpecialEntitiesToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTreatDeprecatedTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTreatdeprtagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTreatunknowntagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setTreatUnknownTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTrimAttributeValues(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setTrimAttributeValues(boolean) - Method in class org.htmlcleaner.TagNode
 
setUnicodechars(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setUnique(boolean) - Method in class org.htmlcleaner.TagInfo
 
setUsecdata(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setUsecdatafor(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setUseCdataFor(String) - Method in class org.htmlcleaner.CleanerProperties
 
setUseCdataForScriptAndStyle(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setUseemptyelementtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
 
setUseEmptyElementTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
 
setXmlVersion(String) - Method in class org.htmlcleaner.DomSerializer
 
shouldEscapeOrTranslateEntities() - Method in class org.htmlcleaner.DomBuilder
 
SimpleHtmlSerializer - Class in org.htmlcleaner
Simple HTML serializer - creates resulting HTML without indenting and/or compacting.
SimpleHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleHtmlSerializer
 
SimpleHtmlSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.SimpleHtmlSerializer
 
SimpleXmlSerializer - Class in org.htmlcleaner
Simple XML serializer - creates resulting XML without indenting lines.
SimpleXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleXmlSerializer
 
singleton(Object) - Method in class org.htmlcleaner.XPather
Creates one-element collection for the specified object.
skipWhitespaces() - Method in class org.htmlcleaner.HtmlTokenizer
Skips whitespaces at current position and moves foreward until non-whitespace character is found or the end of content is reached.
someAlreadyOpen(Set<String>) - Method in class org.htmlcleaner.OpenTags
Checks if any of tags specified in the set are already open.
sourceTag - Variable in class org.htmlcleaner.TagTransformation
 
specialentities - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
SpecialEntities - Class in org.htmlcleaner
This class contains map with special entities used in HTML and their unicodes.
SpecialEntities(boolean, boolean) - Constructor for class org.htmlcleaner.SpecialEntities
 
SpecialEntity - Class in org.htmlcleaner
 
SpecialEntity(String, int, String, boolean) - Constructor for class org.htmlcleaner.SpecialEntity
 
src - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
start() - Method in class org.htmlcleaner.HtmlTokenizer
Starts parsing HTML.
startElement(String, String, String, Attributes) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
startsWith(String) - Method in class org.htmlcleaner.HtmlTokenizer
Checks if content starts with specified value at the current position.
strictErrorChecking - Variable in class org.htmlcleaner.DomBuilder
 
strictErrorChecking - Variable in class org.htmlcleaner.DomSerializer
 
strictErrorChecking - Variable in class org.htmlcleaner.TraversalDomSerializer
 
STRONG - Static variable in class org.htmlcleaner.DefaultTagProvider
 
STRONG - Static variable in class org.htmlcleaner.Html4TagProvider
 
STRONG - Static variable in class org.htmlcleaner.Html5TagProvider
 
styleElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 

T

tableElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
 
tableElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
 
tableTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
The HTML5 table tags (12 total)
TagAllCondition - Class in org.htmlcleaner.conditional
All nodes.
TagAllCondition() - Constructor for class org.htmlcleaner.conditional.TagAllCondition
 
tagAttributes() - Method in class org.htmlcleaner.HtmlTokenizer
Parses list tag attributes from the current position.
tagEncountered(String) - Method in class org.htmlcleaner.OpenTags
 
tagEnd() - Method in class org.htmlcleaner.HtmlTokenizer
Parses end of the tag.
tagExists(String, CleanTimeValues) - Method in class org.htmlcleaner.OpenTags
 
tagInfo - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
TagInfo - Class in org.htmlcleaner
Class contains information about single HTML tag.
It also contains rules for tag balancing.
TagInfo(String, ContentType, BelongsTo, boolean, boolean, boolean, CloseTag, Display) - Constructor for class org.htmlcleaner.TagInfo
 
taginfofile - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
tagInfoMap - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
 
tagInfoMap - Variable in class org.htmlcleaner.DefaultTagProvider
 
tagInfoMap - Variable in class org.htmlcleaner.Html4TagProvider
 
tagInfoMap - Variable in class org.htmlcleaner.Html5TagProvider
 
tagInfoProvider - Variable in class org.htmlcleaner.CleanerProperties
 
tagInfoProvider - Variable in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
 
TagNode - Class in org.htmlcleaner
XML node tag - basic node of the cleaned HTML tree.
TagNode(String) - Constructor for class org.htmlcleaner.TagNode
 
TagNode(String, boolean) - Constructor for class org.htmlcleaner.TagNode
 
TagNodeAttExistsCondition - Class in org.htmlcleaner.conditional
Checks if node contains specified attribute.
TagNodeAttExistsCondition(String) - Constructor for class org.htmlcleaner.conditional.TagNodeAttExistsCondition
 
TagNodeAttNameValueRegexCondition - Class in org.htmlcleaner.conditional
Checks if node has specified attribute with specified value.
TagNodeAttNameValueRegexCondition(Pattern, Pattern) - Constructor for class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
 
TagNodeAttValueCondition - Class in org.htmlcleaner.conditional
Checks if node has specified attribute with specified value.
TagNodeAttValueCondition(String, String, boolean) - Constructor for class org.htmlcleaner.conditional.TagNodeAttValueCondition
 
TagNodeAutoGeneratedCondition - Class in org.htmlcleaner.conditional
Remove empty autogenerated nodes.
TagNodeAutoGeneratedCondition() - Constructor for class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
 
TagNodeEmptyContentCondition - Class in org.htmlcleaner.conditional
Checks if node is an inline 0r block element and has empty contents or white/non-breakable spaces only.
TagNodeEmptyContentCondition(ITagInfoProvider) - Constructor for class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
 
TagNodeInsignificantBrCondition - Class in org.htmlcleaner.conditional
Checks if node is an insignificant br tag -- is placed at the end or at the start of a block.
TagNodeInsignificantBrCondition() - Constructor for class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
 
TagNodeNameCondition - Class in org.htmlcleaner.conditional
Checks if node has specified name.
TagNodeNameCondition(String) - Constructor for class org.htmlcleaner.conditional.TagNodeNameCondition
 
TagNodeVisitor - Interface in org.htmlcleaner
Defines action to be performed on TagNodes
TagPos - Class in org.htmlcleaner
Contains information about a single open tag
TagPos(int, String, TagInfo, CleanTimeValues) - Constructor for class org.htmlcleaner.TagPos
 
tagStart() - Method in class org.htmlcleaner.HtmlTokenizer
Parses start of the tag.
TagToken - Class in org.htmlcleaner
HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
TagToken() - Constructor for class org.htmlcleaner.TagToken
 
TagToken(String) - Constructor for class org.htmlcleaner.TagToken
 
TagTransformation - Class in org.htmlcleaner
Describes how specified tag is transformed to another one, or is ignored during parsing
TagTransformation() - Constructor for class org.htmlcleaner.TagTransformation
 
TagTransformation(String) - Constructor for class org.htmlcleaner.TagTransformation
Creates new tag transformation in which specified tag will be skipped (ignored) during parsing process.
TagTransformation(String, String) - Constructor for class org.htmlcleaner.TagTransformation
Creates new tag transformation from source tag to target tag preserving all source tag attributes.
TagTransformation(String, String, boolean) - Constructor for class org.htmlcleaner.TagTransformation
Creates new tag transformation from source tag to target tag specifying whether source tag attributes are preserved.
tail(HtmlNode, int) - Method in class org.htmlcleaner.DomBuilder
 
tail(HtmlNode, int) - Method in interface org.htmlcleaner.XmlVisitor
Callback for when a node is last visited, after all of its descendants have been visited.
template - Variable in class org.htmlcleaner.AttributeTransformationPatternImpl
 
text - org.htmlcleaner.ContentType
 
text - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
throwStandardException() - Method in class org.htmlcleaner.XPather
 
toBoolean(String) - Static method in class org.htmlcleaner.CommandLine
 
token - Variable in class org.htmlcleaner.ProxyTagNode
 
tokenArray - Variable in class org.htmlcleaner.XPather
 
tokenElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
 
tokenize(String, String) - Static method in class org.htmlcleaner.Utils
 
toString() - Method in class org.htmlcleaner.BaseTokenImpl
 
toString() - Method in class org.htmlcleaner.CData
 
toString() - Method in class org.htmlcleaner.CommentNode
 
toString() - Method in class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
 
toString() - Method in class org.htmlcleaner.ContentNode
 
toString() - Method in class org.htmlcleaner.DoctypeToken
 
toString() - Method in class org.htmlcleaner.EndTagToken
 
toString() - Method in class org.htmlcleaner.TagToken
 
toString(Document) - Static method in class org.htmlcleaner.TraversalDomSerializer
 
toText(Object) - Method in class org.htmlcleaner.XPather
 
toValue(Object) - Static method in enum org.htmlcleaner.BelongsTo
 
toValue(Object) - Static method in enum org.htmlcleaner.ContentType
 
transform - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
transformations - Variable in class org.htmlcleaner.HtmlCleaner
 
transformations - Variable in class org.htmlcleaner.HtmlTokenizer
 
transformAttributes(String, Map<String, String>) - Method in class org.htmlcleaner.CleanerTransformations
 
translateSpecialEntities - Variable in class org.htmlcleaner.CleanerProperties
 
transResCharsToNCR - Variable in class org.htmlcleaner.CleanerProperties
 
transSpecialEntitiesToNCR - Variable in class org.htmlcleaner.CleanerProperties
 
TraversalDomSerializer - Class in org.htmlcleaner
A traversal-based serializer for DOM; used to avoid recursion and stack overflow for large HTML documents.
TraversalDomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.TraversalDomSerializer
 
TraversalDomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.TraversalDomSerializer
 
TraversalDomSerializer(CleanerProperties, boolean, boolean) - Constructor for class org.htmlcleaner.TraversalDomSerializer
 
TraversalDomSerializer(CleanerProperties, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.TraversalDomSerializer
 
traverse(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
Traverses the tree and performs visitor's action on each node.
traverse(XmlVisitor, HtmlNode) - Static method in class org.htmlcleaner.XmlTraversor
Start a depth-first traverse of the root and all of its descendants.
traverseInternally(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
 
treatDeprecatedTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
 
treatdeprtagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
treatunknowntagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
treatUnknownTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
 
trimAttributeValues - Variable in class org.htmlcleaner.CleanerProperties
 
type - Variable in class org.htmlcleaner.DoctypeToken
The identified DocType, if any

U

UnclosedTag - org.htmlcleaner.audit.ErrorType
No matching close token was found for the open tag.
unicodechars - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
unique - Variable in class org.htmlcleaner.TagInfo
 
UniqueTagDuplicated - org.htmlcleaner.audit.ErrorType
Second instance of an unique tag was found, most likely it was removed.
Unknown - org.htmlcleaner.audit.ErrorType
The tag is unknown and current cleaner mode doesn't allows this.
UNKNOWN - Static variable in class org.htmlcleaner.DoctypeToken
 
UnpermittedChild - org.htmlcleaner.audit.ErrorType
This tag have bad child that shouldn't be here.
unsafeBlockElements - Static variable in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
Removal of element from this set can affect layout too hard.
updateCoordinates(char) - Method in class org.htmlcleaner.HtmlTokenizer
Looks onto the char passed and updates current position coordinates.
updateTagTransformations(String, String) - Method in class org.htmlcleaner.CleanerTransformations
 
updateTagTransformations(Map) - Method in class org.htmlcleaner.CleanerTransformations
 
usecdata - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
usecdatafor - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
useCdataFor - Variable in class org.htmlcleaner.CleanerProperties
 
useCdataForList - Variable in class org.htmlcleaner.CleanerProperties
 
useemptyelementtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
 
useEmptyElementTags - Variable in class org.htmlcleaner.CleanerProperties
 
Utils - Class in org.htmlcleaner
Common utilities.
Utils() - Constructor for class org.htmlcleaner.Utils
 

V

valid - Variable in class org.htmlcleaner.DoctypeToken
 
VALID_XML_IDENTIFIER_CHAR_PATTERN - Static variable in class org.htmlcleaner.Utils
 
VALID_XML_IDENTIFIER_CHAR_REGEX - Static variable in class org.htmlcleaner.Utils
 
VALID_XML_IDENTIFIER_START_CHAR_PATTERN - Static variable in class org.htmlcleaner.Utils
 
VALID_XML_IDENTIFIER_START_CHAR_REGEX - Static variable in class org.htmlcleaner.Utils
 
validate() - Method in class org.htmlcleaner.DoctypeToken
Checks the doctype according to W3C parsing rules and tries to identify the type and validity See: http://www.w3.org/TR/html-markup/syntax.html#doctype-syntax http://dev.w3.org/html5/html-author/#doctype-declaration
valueOf(String) - Static method in enum org.htmlcleaner.audit.ErrorType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.htmlcleaner.BelongsTo
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.htmlcleaner.CloseTag
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.htmlcleaner.ContentType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.htmlcleaner.Display
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.htmlcleaner.OptionalOutput
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.htmlcleaner.audit.ErrorType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.htmlcleaner.BelongsTo
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.htmlcleaner.CloseTag
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.htmlcleaner.ContentType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.htmlcleaner.Display
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.htmlcleaner.OptionalOutput
Returns an array containing the constants of this enum type, in the order they are declared.
VAR_END - Static variable in class org.htmlcleaner.TagTransformation
 
VAR_START - Static variable in class org.htmlcleaner.TagTransformation
 
visit(TagNode, HtmlNode) - Method in interface org.htmlcleaner.TagNodeVisitor
Action to be performed on single node in the tree
visitor - Variable in class org.htmlcleaner.XmlTraversor
 

W

WORKING_BUFFER_SIZE - Static variable in class org.htmlcleaner.HtmlTokenizer
 
write(TagNode, Writer, String) - Method in class org.htmlcleaner.Serializer
Writes specified node using specified writer.
write(TagNode, Writer, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified node using specified writer.
writeToFile(TagNode, String) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using system default charset.
writeToFile(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
writeToFile(TagNode, String, String) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using specified charset.
writeToFile(TagNode, String, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
writeToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using system default charset.
writeToStream(TagNode, OutputStream, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using system default charset and optionally omits node envelope (skips open and close tags of the node).
writeToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using specified charset.
writeToStream(TagNode, OutputStream, String, boolean) - Method in class org.htmlcleaner.Serializer
Writes specified TagNode to the output stream, using specified charset and optionally omits node envelope (skips open and close tags of the node).
writeXml(TagNode, Writer, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use write() instead.
writeXmlToFile(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToFile() instead.
writeXmlToFile(TagNode, String, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToFile() instead.
writeXmlToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToStream() instead.
writeXmlToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.XmlSerializer
Deprecated.
Use writeToStream() instead.

X

XHTML1_0_FRAMESET - Static variable in class org.htmlcleaner.DoctypeToken
 
XHTML1_0_STRICT - Static variable in class org.htmlcleaner.DoctypeToken
 
XHTML1_0_TRANSITIONAL - Static variable in class org.htmlcleaner.DoctypeToken
 
XHTML1_1 - Static variable in class org.htmlcleaner.DoctypeToken
 
XHTML1_1_BASIC - Static variable in class org.htmlcleaner.DoctypeToken
 
XML_10 - Static variable in class org.htmlcleaner.DomSerializer
 
XML_11 - Static variable in class org.htmlcleaner.DomSerializer
 
XMLNS_NAMESPACE - Static variable in class org.htmlcleaner.XmlSerializer
 
XmlSerializer - Class in org.htmlcleaner
Abstract XML serializer - contains common logic for descendants.
XmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.XmlSerializer
 
XmlTraversor - Class in org.htmlcleaner
Depth-first node traversor.
XmlTraversor() - Constructor for class org.htmlcleaner.XmlTraversor
 
xmlVersion - Variable in class org.htmlcleaner.DomSerializer
 
XmlVisitor - Interface in org.htmlcleaner
Node visitor interface.
XPather - Class in org.htmlcleaner
Utility for searching cleaned document tree with XPath expressions.
XPather(String) - Constructor for class org.htmlcleaner.XPather
Constructor - creates XPather instance with specified XPath expression.
XPatherException - Exception in org.htmlcleaner
Exception that could occure during XPather evaluation.
XPatherException() - Constructor for exception org.htmlcleaner.XPatherException
 
XPatherException(String) - Constructor for exception org.htmlcleaner.XPatherException
 
XPatherException(String, Throwable) - Constructor for exception org.htmlcleaner.XPatherException
 
XPatherException(Throwable) - Constructor for exception org.htmlcleaner.XPatherException
 

_

_asExpected - Variable in class org.htmlcleaner.HtmlTokenizer
 
_bodyOpened - Variable in class org.htmlcleaner.CleanTimeValues
 
_col - Variable in class org.htmlcleaner.HtmlTokenizer
 
_currentTagToken - Variable in class org.htmlcleaner.HtmlTokenizer
 
_docType - Variable in class org.htmlcleaner.HtmlTokenizer
 
_headOpened - Variable in class org.htmlcleaner.CleanTimeValues
 
_headTags - Variable in class org.htmlcleaner.CleanTimeValues
 
_isLateForDoctype - Variable in class org.htmlcleaner.HtmlTokenizer
 
_isSpecialContext - Variable in class org.htmlcleaner.HtmlTokenizer
 
_isSpecialContextName - Variable in class org.htmlcleaner.HtmlTokenizer
 
_len - Variable in class org.htmlcleaner.HtmlTokenizer
 
_namespacePrefixes - Variable in class org.htmlcleaner.HtmlTokenizer
 
_pos - Variable in class org.htmlcleaner.HtmlTokenizer
 
_put(SpecialEntity) - Method in class org.htmlcleaner.SpecialEntities
 
_reader - Variable in class org.htmlcleaner.HtmlTokenizer
 
_row - Variable in class org.htmlcleaner.HtmlTokenizer
 
_saved - Variable in class org.htmlcleaner.HtmlTokenizer
 
_tokenList - Variable in class org.htmlcleaner.HtmlTokenizer
 
_working - Variable in class org.htmlcleaner.HtmlTokenizer
 
A B C D E F G H I J K L M N O P R S T U V W X _ 
All Classes All Packages