A B C D E F G H I J K L M N O P R S T U V W X _
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- addAttribute(String, String) - Method in class org.htmlcleaner.EndTagToken
- addAttribute(String, String) - Method in class org.htmlcleaner.TagNode
-
Adds specified attribute to this tag or overrides existing one.
- addAttribute(String, String) - Method in class org.htmlcleaner.TagToken
- addAttributePatternTransformation(Pattern, String) - Method in class org.htmlcleaner.TagTransformation
- addAttributePatternTransformation(Pattern, Pattern, String) - Method in class org.htmlcleaner.TagTransformation
- addAttributePatternTransformation(AttributeTransformation) - Method in class org.htmlcleaner.TagTransformation
- addAttributesToTag(TagNode, Map<String, String>) - Method in class org.htmlcleaner.HtmlCleaner
-
Add attributes from specified map to the specified tag.
- addAttributeTransformation(String) - Method in class org.htmlcleaner.TagTransformation
-
Adds new attribute transformation in which destination attrbute will not exists (simply removes it from list of attributes).
- addAttributeTransformation(String, String) - Method in class org.htmlcleaner.TagTransformation
-
Adds new attribute transformation to this tag transformation.
- addBreak(TagPos, TagPos) - Method in class org.htmlcleaner.ChildBreaks
-
Adds the break info to the top of the stacks.
- addChild(Object) - Method in class org.htmlcleaner.TagNode
- addChildren(List) - Method in class org.htmlcleaner.TagNode
-
Add all elements from specified list to this node.
- addGlobalTransformation(AttributeTransformation) - Method in class org.htmlcleaner.CleanerTransformations
- addHtmlModificationListener(HtmlModificationListener) - Method in class org.htmlcleaner.CleanerProperties
-
Adds a listener to the list of objects that will be notified about changes that cleaner does during cleanup process.
- addIfNeededToPruneSet(TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- addItemForMoving(Object) - Method in class org.htmlcleaner.TagNode
- addNamespaceDeclaration(String, String) - Method in class org.htmlcleaner.TagNode
-
Adds namespace declaration to the node
- addNewlineToHeadAndBody - Variable in class org.htmlcleaner.CleanerProperties
-
"cause the cleaner cannot keep track of whitespace at that level", there are 2 lists built: one for the head , one for the body.
- addPossibleHeadCandidate(TagInfo, TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Checks if specified tag with specified info is candidate for moving to head section.
- addPruneNode(TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- addPruneTagNodeCondition(ITagNodeCondition) - Method in class org.htmlcleaner.CleanerProperties
-
Adds the condition to existing prune tag set.
- addSavedAsContent() - Method in class org.htmlcleaner.HtmlTokenizer
- addTag(String, TagInfo, int, CleanTimeValues) - Method in class org.htmlcleaner.OpenTags
- addTagNameConditions(Set<ITagNodeCondition>, String) - Method in class org.htmlcleaner.CleanerProperties
- addText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- addToken(BaseToken) - Method in class org.htmlcleaner.HtmlTokenizer
- addTransformation(TagTransformation) - Method in class org.htmlcleaner.CleanerTransformations
-
Adds specified tag transformation to the collection.
- advancedxmlescape - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- advancedXmlEscape - Variable in class org.htmlcleaner.CleanerProperties
-
If this parameter is set to true, ampersand sign (&) that proceeds valid XML character sequences (&XXX;) will not be escaped with &XXX;
- afterTagLineBreakNeeded - Variable in enum org.htmlcleaner.Display
- all - org.htmlcleaner.ContentType
- allowhtmlinsideattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- allowHtmlInsideAttributes - Variable in class org.htmlcleaner.CleanerProperties
- allowInvalidAttributeNames - Variable in class org.htmlcleaner.CleanerProperties
- allowInvalidAttributeNames - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- allowmultiwordattributes - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- allowMultiWordAttributes - Variable in class org.htmlcleaner.CleanerProperties
- allowsAnything() - Method in class org.htmlcleaner.TagInfo
- allowsBody() - Method in class org.htmlcleaner.TagInfo
- allowsItem(BaseToken) - Method in class org.htmlcleaner.TagInfo
- allowTags - Variable in class org.htmlcleaner.CleanerProperties
- allowTagSet - Variable in class org.htmlcleaner.CleanerProperties
-
the list of allowed tags (whitelist approach v.
- allowTagSet - Variable in class org.htmlcleaner.CleanTimeValues
- allTags - Variable in class org.htmlcleaner.CleanTimeValues
- alwaysOutput - org.htmlcleaner.OptionalOutput
-
Always outputed, if information is not supplied in the source a default is created.
- ampNcr - Static variable in class org.htmlcleaner.Utils
- any - org.htmlcleaner.Display
-
The following elements may be used as either block-level elements or inline elements.
- applyTagTransformations(Map<String, String>) - Method in class org.htmlcleaner.TagTransformation
- areCopiedTokensEqual(TagNode, TagNode) - Static method in class org.htmlcleaner.HtmlCleaner
-
Determines if two copied tokens are equal.
- ASCII_CHAR - Static variable in class org.htmlcleaner.Utils
- assumedNamespace - Variable in class org.htmlcleaner.TagInfo
- assumedNamespacePrefix - Variable in class org.htmlcleaner.TagInfo
- attName - Variable in class org.htmlcleaner.conditional.TagNodeAttExistsCondition
- attName - Variable in class org.htmlcleaner.conditional.TagNodeAttValueCondition
- attNamePattern - Variable in class org.htmlcleaner.AttributeTransformationPatternImpl
- attNameRegex - Variable in class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
- attributePatternTransformations - Variable in class org.htmlcleaner.TagTransformation
- attributes - Variable in class org.htmlcleaner.TagNode
- attributesToLowerCase() - Method in class org.htmlcleaner.TagNode
-
Returns a copy of the set of attributes for this node with lowercase names.
- AttributeTransformation - Interface in org.htmlcleaner
- AttributeTransformationPatternImpl - Class in org.htmlcleaner
- AttributeTransformationPatternImpl(String, String, String) - Constructor for class org.htmlcleaner.AttributeTransformationPatternImpl
- AttributeTransformationPatternImpl(Pattern, Pattern, String) - Constructor for class org.htmlcleaner.AttributeTransformationPatternImpl
- attributeTransformations - Variable in class org.htmlcleaner.TagTransformation
- attributeValue() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses a single tag attribute - it is expected to be in one of the forms: name=value name="value" name='value' name
- attValue - Variable in class org.htmlcleaner.conditional.TagNodeAttValueCondition
- attValuePattern - Variable in class org.htmlcleaner.AttributeTransformationPatternImpl
- attValueRegex - Variable in class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
- autoGenerated - Variable in class org.htmlcleaner.TagNode
-
Used to indicate a start tag that was auto generated because
TagInfo.isContinueAfter(String)
(closedTag.getName()) returned true For example,
B
- BaseHtmlNode - Class in org.htmlcleaner
- BaseHtmlNode() - Constructor for class org.htmlcleaner.BaseHtmlNode
- BaseToken - Interface in org.htmlcleaner
-
Base token interface.
- BaseTokenImpl - Class in org.htmlcleaner
-
Base class for all tokens.
- BaseTokenImpl() - Constructor for class org.htmlcleaner.BaseTokenImpl
- BaseTokenImpl(int, int) - Constructor for class org.htmlcleaner.BaseTokenImpl
- basicElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- bchomp(String) - Static method in class org.htmlcleaner.Utils
-
Removes the first newline and last newline (if present) of a string
- BEGIN_CDATA - Static variable in class org.htmlcleaner.CData
- belongsTo - Variable in class org.htmlcleaner.TagInfo
- BelongsTo - Enum in org.htmlcleaner
- BelongsTo(String) - Constructor for enum org.htmlcleaner.BelongsTo
- blank - Variable in class org.htmlcleaner.ContentNode
- block - org.htmlcleaner.Display
-
Block-level elements typically contain inline elements and other block-level elements.
- BODY - org.htmlcleaner.BelongsTo
- bodyNode - Variable in class org.htmlcleaner.CleanTimeValues
- bodyNode - Variable in class org.htmlcleaner.ProxyTagNode
- BOOL_ATT_EMPTY - Static variable in class org.htmlcleaner.CleanerProperties
- BOOL_ATT_SELF - Static variable in class org.htmlcleaner.CleanerProperties
- BOOL_ATT_TRUE - Static variable in class org.htmlcleaner.CleanerProperties
- booleanAttributeValues - Variable in class org.htmlcleaner.CleanerProperties
- booleanatts - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- BR_TAG - Static variable in class org.htmlcleaner.BrowserCompactXmlSerializer
- BR_TAG - Static variable in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
- breakingTags - Variable in class org.htmlcleaner.ChildBreaks
- BrowserCompactXmlSerializer - Class in org.htmlcleaner
-
Browser compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.
- BrowserCompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.BrowserCompactXmlSerializer
C
- C0 - Static variable in class org.htmlcleaner.XPather
- C9 - Static variable in class org.htmlcleaner.XPather
- calculateRootNode(CleanTimeValues, Set<String>) - Method in class org.htmlcleaner.HtmlCleaner
-
Assigns root node to internal variable and adds neccessery xmlns attributes if cleaner is namespace-aware.
- CD - Static variable in class org.htmlcleaner.XPather
- cdata() - Method in class org.htmlcleaner.HtmlTokenizer
- CData - Class in org.htmlcleaner
- CData(String) - Constructor for class org.htmlcleaner.CData
- characters(char[], int, int) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- charset - Variable in class org.htmlcleaner.CleanerProperties
- charValue() - Method in class org.htmlcleaner.SpecialEntity
- checkSublist(int, int, List) - Method in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
- childBreaks - Variable in class org.htmlcleaner.NestingState
- ChildBreaks - Class in org.htmlcleaner
-
Contains information about nodes that were closed due to their child nodes.
- ChildBreaks() - Constructor for class org.htmlcleaner.ChildBreaks
- children - Variable in class org.htmlcleaner.TagNode
- childTags - Variable in class org.htmlcleaner.TagInfo
- chomp(String) - Static method in class org.htmlcleaner.Utils
-
Removes the last newline (if present) of a string
- className - Static variable in class org.htmlcleaner.CommandLine
- clean(File) - Method in class org.htmlcleaner.HtmlCleaner
- clean(File, String) - Method in class org.htmlcleaner.HtmlCleaner
- clean(InputStream) - Method in class org.htmlcleaner.HtmlCleaner
- clean(InputStream, String) - Method in class org.htmlcleaner.HtmlCleaner
- clean(Reader) - Method in class org.htmlcleaner.HtmlCleaner
- clean(Reader, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Basic version of the cleaning call.
- clean(String) - Method in class org.htmlcleaner.DoctypeToken
- clean(String) - Method in class org.htmlcleaner.HtmlCleaner
- clean(URL) - Method in class org.htmlcleaner.HtmlCleaner
-
Deprecated.
- clean(URL, String) - Method in class org.htmlcleaner.HtmlCleaner
-
Deprecated.
- cleaner - Variable in class org.htmlcleaner.HtmlTokenizer
- CleanerProperties - Class in org.htmlcleaner
-
Properties defining cleaner's behaviour
- CleanerProperties() - Constructor for class org.htmlcleaner.CleanerProperties
- CleanerProperties(ITagInfoProvider) - Constructor for class org.htmlcleaner.CleanerProperties
- cleanerTransformations - Variable in class org.htmlcleaner.CleanerProperties
- CleanerTransformations - Class in org.htmlcleaner
-
Contains transformation collection.
- CleanerTransformations() - Constructor for class org.htmlcleaner.CleanerTransformations
- CleanerTransformations(Map) - Constructor for class org.htmlcleaner.CleanerTransformations
- cleanTimeValues - Variable in class org.htmlcleaner.HtmlTokenizer
- CleanTimeValues - Class in org.htmlcleaner
-
This class is for thread-safe handling of private instance variables from HtmlCleaner
- CleanTimeValues() - Constructor for class org.htmlcleaner.CleanTimeValues
- clear() - Method in class org.htmlcleaner.CleanerTransformations
- CLOSE_BEFORE_COPY_INSIDE_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
- CLOSE_BEFORE_COPY_INSIDE_TAGS - Static variable in class org.htmlcleaner.Html4TagProvider
- CLOSE_BEFORE_COPY_INSIDE_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
- CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
- CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.Html4TagProvider
- CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
- CLOSE_BEFORE_TAGS - Static variable in class org.htmlcleaner.MathMLTagProvider
- closeAll(List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Close all unclosed tags if there are any.
- closedByChildBreak - Variable in class org.htmlcleaner.ChildBreaks
- closeSnippet(List, TagPos, Object, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Forced closing
- closeTag - Variable in class org.htmlcleaner.TagInfo
- CloseTag - Enum in org.htmlcleaner
- CloseTag(boolean, boolean) - Constructor for enum org.htmlcleaner.CloseTag
- CM - Static variable in class org.htmlcleaner.XPather
- col - Variable in class org.htmlcleaner.BaseTokenImpl
- collectNamespacePrefixesOnPath(Set<String>) - Method in class org.htmlcleaner.TagNode
-
Collect all prefixes in namespace declarations up the path to the document root from the specified node
- CommandLine - Class in org.htmlcleaner
-
Command line usage class.
- CommandLine() - Constructor for class org.htmlcleaner.CommandLine
- comment - Variable in class org.htmlcleaner.ProxyTagNode
- comment() - Method in class org.htmlcleaner.HtmlTokenizer
- CommentNode - Class in org.htmlcleaner
-
HTML comment token.
- CommentNode(String) - Constructor for class org.htmlcleaner.CommentNode
- CompactHtmlSerializer - Class in org.htmlcleaner
-
Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.
- CompactHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactHtmlSerializer
- CompactXmlSerializer - Class in org.htmlcleaner
-
Compact XML serializer - creates resulting XML by stripping whitespaces.
- CompactXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.CompactXmlSerializer
- compileUnicodePattern(String) - Static method in class org.htmlcleaner.Utils
- ConfigFileTagProvider - Class in org.htmlcleaner
-
Configuration file tag provider - reads XML file in specified format and creates a Tag Provider.
- ConfigFileTagProvider() - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider(File) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider(URL) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider(InputSource) - Constructor for class org.htmlcleaner.ConfigFileTagProvider
- ConfigFileTagProvider.ConfigParser - Class in org.htmlcleaner
-
SAX parser for tag configuration files.
- ConfigParser(Map) - Constructor for class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- containsEndCData() - Method in class org.htmlcleaner.HtmlTokenizer
- content - Variable in class org.htmlcleaner.CommentNode
- content - Variable in class org.htmlcleaner.ContentNode
- content() - Method in class org.htmlcleaner.HtmlTokenizer
- ContentNode - Class in org.htmlcleaner
-
HTML text token.
- ContentNode(String) - Constructor for class org.htmlcleaner.ContentNode
- contentType - Variable in class org.htmlcleaner.TagInfo
- ContentType - Enum in org.htmlcleaner
- ContentType(String) - Constructor for enum org.htmlcleaner.ContentType
- continueAfterTags - Variable in class org.htmlcleaner.TagInfo
- convert_To_Entity_Name(String, boolean, boolean, boolean, StringBuilder, int) - Static method in class org.htmlcleaner.Utils
- convertToUnicode(String, boolean, boolean, boolean, StringBuilder, int) - Static method in class org.htmlcleaner.Utils
- copyTags - Variable in class org.htmlcleaner.TagInfo
- CP - Static variable in class org.htmlcleaner.XPather
- createDocument(TagNode) - Method in class org.htmlcleaner.DomBuilder
- createDocument(TagNode) - Method in class org.htmlcleaner.DomSerializer
- createDocumentNodes(List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- createDOM(TagNode) - Method in class org.htmlcleaner.DomSerializer
- createDOM(TagNode) - Method in class org.htmlcleaner.TraversalDomSerializer
- createElement(TagNode) - Method in class org.htmlcleaner.JDomSerializer
- createJDom(TagNode) - Method in class org.htmlcleaner.JDomSerializer
- createSubnodes(Element, List<? extends BaseToken>) - Method in class org.htmlcleaner.JDomSerializer
- createSubnodes(Document, Element, List<? extends BaseToken>) - Method in class org.htmlcleaner.DomSerializer
-
Serialize a given HTML Cleaner node.
- createTagNode(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
- creatingHtmlDom - Variable in class org.htmlcleaner.XmlSerializer
- CS - Static variable in class org.htmlcleaner.XPather
- CSS_COMMENT_END - Static variable in class org.htmlcleaner.DomSerializer
- CSS_COMMENT_END - Static variable in class org.htmlcleaner.JDomSerializer
- CSS_COMMENT_START - Static variable in class org.htmlcleaner.DomBuilder
- CSS_COMMENT_START - Static variable in class org.htmlcleaner.DomSerializer
- CSS_COMMENT_START - Static variable in class org.htmlcleaner.JDomSerializer
D
- dbCode - Variable in enum org.htmlcleaner.BelongsTo
- dbCode - Variable in enum org.htmlcleaner.ContentType
- DECIMAL - Static variable in class org.htmlcleaner.Utils
- DEFAULT_CHARSET - Static variable in class org.htmlcleaner.CleanerProperties
- DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyHtmlSerializer
- DEFAULT_INDENTATION_STRING - Static variable in class org.htmlcleaner.PrettyXmlSerializer
- DefaultTagProvider - Class in org.htmlcleaner
-
This is the default tag provider for HTML Cleaner Note this is no longer generated from XML - see https://sourceforge.net/p/htmlcleaner/bugs/81/
- DefaultTagProvider() - Constructor for class org.htmlcleaner.DefaultTagProvider
- defineAllowedChildrenTags(String) - Method in class org.htmlcleaner.TagInfo
- defineCloseBeforeCopyInsideTags(String) - Method in class org.htmlcleaner.TagInfo
- defineCloseBeforeTags(String) - Method in class org.htmlcleaner.TagInfo
- defineCloseInsideCopyAfterTags(String) - Method in class org.htmlcleaner.TagInfo
- defineFatalTags(String) - Method in class org.htmlcleaner.TagInfo
- defineForbiddenTags(String) - Method in class org.htmlcleaner.TagInfo
- defineHigherLevelTags(String) - Method in class org.htmlcleaner.TagInfo
- defineNamespaceDeclarations(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
- defineRequiredEnclosingTags(String) - Method in class org.htmlcleaner.TagInfo
- dependencyName - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- deprecated - Variable in class org.htmlcleaner.TagInfo
- Deprecated - org.htmlcleaner.audit.ErrorType
-
The tag was deprecated and current cleaner mode doesn't allows this.
- deserializeCdataEntities - Variable in class org.htmlcleaner.DomBuilder
- deserializeCdataEntities - Variable in class org.htmlcleaner.DomSerializer
- deserializeCdataEntities - Variable in class org.htmlcleaner.TraversalDomSerializer
- deserializeCdataEntities(String) - Method in class org.htmlcleaner.DomBuilder
- deserializeCdataEntities(String) - Method in class org.htmlcleaner.DomSerializer
- deserializeEntities - Variable in class org.htmlcleaner.CleanerProperties
- deserializeEntities(String, boolean) - Static method in class org.htmlcleaner.Utils
- dest - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- destinationElement - Variable in class org.htmlcleaner.DomBuilder
- destTag - Variable in class org.htmlcleaner.TagTransformation
- display - Variable in class org.htmlcleaner.TagInfo
- Display - Enum in org.htmlcleaner
-
Most HTML 4 elements permitted within the BODY are classified as either block-level elements or inline elements.
- Display(boolean, boolean) - Constructor for enum org.htmlcleaner.Display
- doctype() - Method in class org.htmlcleaner.HtmlTokenizer
- docType - Variable in class org.htmlcleaner.TagNode
- DoctypeToken - Class in org.htmlcleaner
-
HTML doctype token.
- DoctypeToken(String, String, String, String) - Constructor for class org.htmlcleaner.DoctypeToken
- DoctypeToken(String, String, String, String, String) - Constructor for class org.htmlcleaner.DoctypeToken
- document - Variable in class org.htmlcleaner.DomBuilder
- DomBuilder - Class in org.htmlcleaner
- DomBuilder(CleanerProperties, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.DomBuilder
- DomSerializer - Class in org.htmlcleaner
-
DOM serializer - creates xml DOM.
- DomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.DomSerializer
- DomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.DomSerializer
- DomSerializer(CleanerProperties, boolean, boolean) - Constructor for class org.htmlcleaner.DomSerializer
- DomSerializer(CleanerProperties, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.DomSerializer
- dontEscape(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
- dontEscape(TagNode) - Method in class org.htmlcleaner.XmlSerializer
- dontEscape(Element) - Method in class org.htmlcleaner.DomSerializer
-
encapsulate content with <[CDATA[ ]]> for things like script and style elements
E
- editTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
The HTML5 edits tags (2 total)
- embeddedContentTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
- END_CDATA - Static variable in class org.htmlcleaner.CData
- endElement(String, String, String) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- endTagPermitted - Variable in enum org.htmlcleaner.CloseTag
- EndTagToken - Class in org.htmlcleaner
-
HTML tag end token.
- EndTagToken() - Constructor for class org.htmlcleaner.EndTagToken
- EndTagToken(String) - Constructor for class org.htmlcleaner.EndTagToken
- entities - Variable in class org.htmlcleaner.SpecialEntities
-
key is the
SpecialEntity.getKey()
( i.e. - entitiesByUnicodeCharcode - Variable in class org.htmlcleaner.SpecialEntities
-
Key is the Integer returned by
SpecialEntity.intValue()
- ErrorType - Enum in org.htmlcleaner.audit
-
Possible error codes (read messages) that cleaner uses to inform clients about reasons/actions that modification involves.
- ErrorType() - Constructor for enum org.htmlcleaner.audit.ErrorType
- escape - Variable in class org.htmlcleaner.SimpleHtmlSerializer
- escapedXmlString - Variable in class org.htmlcleaner.SpecialEntity
- escapeHtml(String, CleanerProperties) - Static method in class org.htmlcleaner.Utils
-
Escapes HTML string
- escapeText(String) - Method in class org.htmlcleaner.HtmlSerializer
- escapeXml - Variable in class org.htmlcleaner.DomBuilder
- escapeXml - Variable in class org.htmlcleaner.DomSerializer
-
Whether XML entities should be escaped or not.
- escapeXml - Variable in class org.htmlcleaner.JDomSerializer
- escapeXml - Variable in class org.htmlcleaner.TraversalDomSerializer
-
Whether XML entities should be escaped or not.
- escapeXml(String) - Method in class org.htmlcleaner.XmlSerializer
- escapeXml(String, boolean, boolean, boolean, boolean, boolean, boolean) - Static method in class org.htmlcleaner.Utils
-
change notes: 1) convert ascii characters encoded using x; format to the ascii characters -- may be an attempt to slip in malicious html 2) convert xx; format characters to " style representation if available for the character.
- escapeXml(String, boolean, boolean, boolean, boolean, boolean, boolean, boolean) - Static method in class org.htmlcleaner.Utils
-
change notes: 1) convert ascii characters encoded using x; format to the ascii characters -- may be an attempt to slip in malicious html 2) convert xx; format characters to " style representation if available for the character.
- escapeXml(String, CleanerProperties, boolean) - Static method in class org.htmlcleaner.Utils
-
Escapes XML string.
- evaluateAgainst(Collection, int, int, boolean, int, int, boolean, Collection) - Method in class org.htmlcleaner.XPather
- evaluateAgainstNode(TagNode) - Method in class org.htmlcleaner.XPather
-
Main public method for this class - a way to execute XPath expression against specified TagNode instance.
- evaluateFunction(Collection, int, int, int, int, boolean) - Method in class org.htmlcleaner.XPather
-
Evaluates specified function.
- evaluateLogic(Collection, Collection, String) - Method in class org.htmlcleaner.XPather
-
Evaluates logic operation on two collections.
- evaluateTemplate(String, Map<String, String>) - Method in class org.htmlcleaner.TagTransformation
-
Evaluates string template for specified map of variables.
- evaluateXPath(String) - Method in class org.htmlcleaner.TagNode
-
Evaluates XPath expression on give node.
- execute() - Method in class org.htmlcleaner.HtmlCleanerForAnt
-
Implementation of Ant task execution.
- extractCharCode(String, int, boolean, StringBuilder) - Static method in class org.htmlcleaner.Utils
-
(earlier code was failing on this) - A; is converted by FF to 3 characters: + 'A' + ';' x138A; is converted by FF to 6? 7? characters: 'x'+'1'+'3'+ '8' + 'A' + ';' #0 is displayed kind of weird ᎊ is a single character
F
- factory - Variable in class org.htmlcleaner.JDomSerializer
- FatalTagMissing - org.htmlcleaner.audit.ErrorType
-
Tag which existence is critical for the current is missing.
- fatalTags - Variable in class org.htmlcleaner.TagInfo
- filterByCondition(Collection, int, int) - Method in class org.htmlcleaner.XPather
-
Filter nodes satisfying the condition
- findClosingIndex(int, int) - Method in class org.htmlcleaner.XPather
- findElement(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
-
Finds first element in the tree that satisfy specified condition.
- findElementByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
- findElementByName(String, boolean) - Method in class org.htmlcleaner.TagNode
- findElementHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
- findFirstTagPos() - Method in class org.htmlcleaner.OpenTags
- findMatchingTagNodes(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
-
Get all elements in the tree that satisfy specified condition.
- findTag(String, CleanTimeValues) - Method in class org.htmlcleaner.OpenTags
- findTagToPlaceRubbish() - Method in class org.htmlcleaner.OpenTags
- fireConditionModification(ITagNodeCondition, TagNode) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
-
Fired when cleaner modifies html due to
ITagNodeCondition
match. - fireConditionModification(ITagNodeCondition, TagNode) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
- fireConditionModification(ITagNodeCondition, TagNode) - Method in class org.htmlcleaner.CleanerProperties
- fireHtmlError(boolean, TagNode, ErrorType) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
-
Fired when cleaner fixes some error in html syntax.
- fireHtmlError(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
- fireHtmlError(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.CleanerProperties
- fireUglyHtml(boolean, TagNode, ErrorType) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
-
Fired when cleaner fixes ugly html -- when syntax was correct but task was implemented by weird code.
- fireUglyHtml(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
- fireUglyHtml(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.CleanerProperties
- fireUserDefinedModification(boolean, TagNode, ErrorType) - Method in interface org.htmlcleaner.audit.HtmlModificationListener
-
Fired when cleaner modifies html due to user specified rules.
- fireUserDefinedModification(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.audit.HtmlModificationListenerLogger
- fireUserDefinedModification(boolean, TagNode, ErrorType) - Method in class org.htmlcleaner.CleanerProperties
- flatten(int, int) - Method in class org.htmlcleaner.XPather
- flattenNestedList(List) - Method in class org.htmlcleaner.HtmlCleaner
-
Flattens a list of tagnodes
- FLOW_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
-
Most elements that are used in the body of documents and applications are categorized as flow content.
- forbidden - org.htmlcleaner.CloseTag
-
is not permitted
- foreignMarkupFlagSet - Variable in class org.htmlcleaner.TagNode
-
This flag is set if foreignMarkup is set; if it is false it means that the tagnode tree has not been built and so it isn't known whether this node is a HTML node or foreign markup such as SVG.
- formattingElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- formElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- formTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
The HTML5 forms tags (15 total)
- fullUrl(String, String) - Static method in class org.htmlcleaner.Utils
-
Calculates full URL for specified page URL and link which could be full, absolute or relative like there can be found in A or IMG tags.
G
- generateCode - Variable in class org.htmlcleaner.ConfigFileTagProvider
- getAllChildren() - Method in class org.htmlcleaner.TagNode
- getAllElements(boolean) - Method in class org.htmlcleaner.TagNode
- getAllElementsList(boolean) - Method in class org.htmlcleaner.TagNode
- getAllowTags() - Method in class org.htmlcleaner.CleanerProperties
- getAllowTagSet() - Method in class org.htmlcleaner.CleanerProperties
- getAllowTagSet(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- getAllTags(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- getAmpNcr() - Static method in class org.htmlcleaner.Utils
- getArgValue(String[], String, String) - Static method in class org.htmlcleaner.CommandLine
- getAsString(String) - Method in class org.htmlcleaner.Serializer
- getAsString(TagNode) - Method in class org.htmlcleaner.Serializer
- getAsString(TagNode, boolean) - Method in class org.htmlcleaner.Serializer
- getAsString(TagNode, String) - Method in class org.htmlcleaner.Serializer
- getAsString(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
- getAssumedNamespace() - Method in class org.htmlcleaner.TagInfo
- getAssumedNamespacePrefix() - Method in class org.htmlcleaner.TagInfo
- getAttributeByName(String) - Method in class org.htmlcleaner.TagNode
- getAttributes() - Method in class org.htmlcleaner.TagNode
-
Returns the attributes of the tagnode.
- getAttributesInLowerCase() - Method in class org.htmlcleaner.TagNode
-
Returns the attributes of the tagnode in lower case.
- getAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
- getBelongsTo() - Method in class org.htmlcleaner.TagInfo
- getBooleanAttributeValues() - Method in class org.htmlcleaner.CleanerProperties
- getCharset() - Method in class org.htmlcleaner.CleanerProperties
- getChildBreaks() - Method in class org.htmlcleaner.NestingState
- getChildBreaks(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- getChildIndex(HtmlNode) - Method in class org.htmlcleaner.TagNode
- getChildren() - Method in class org.htmlcleaner.TagNode
-
Deprecated.use
TagNode.getChildTagList()
, will be refactored and possibly removed in future versions. TODO This method should be refactored because is does not properly match the commonly used Java's getter/setter strategy. - getChildTagList() - Method in class org.htmlcleaner.TagNode
- getChildTags() - Method in class org.htmlcleaner.TagInfo
- getChildTags() - Method in class org.htmlcleaner.TagNode
- getCleanerTransformations() - Method in class org.htmlcleaner.CleanerProperties
- getCol() - Method in interface org.htmlcleaner.BaseToken
- getCol() - Method in class org.htmlcleaner.BaseTokenImpl
- getCommentedContent() - Method in class org.htmlcleaner.CommentNode
- getContent() - Method in class org.htmlcleaner.CData
- getContent() - Method in class org.htmlcleaner.CommentNode
- getContent() - Method in class org.htmlcleaner.ContentNode
- getContent() - Method in class org.htmlcleaner.DoctypeToken
- getContent() - Method in class org.htmlcleaner.ProxyTagNode
- getContentType() - Method in class org.htmlcleaner.TagInfo
- getContentWithoutStartAndEndTokens() - Method in class org.htmlcleaner.CData
- getContentWithStartAndEndTokens() - Method in class org.htmlcleaner.CData
- getContinueAfterTags() - Method in class org.htmlcleaner.TagInfo
- getCopyTags() - Method in class org.htmlcleaner.TagInfo
- getDbCode() - Method in enum org.htmlcleaner.BelongsTo
- getDbCode() - Method in enum org.htmlcleaner.ContentType
- getDecimalNCR() - Method in class org.htmlcleaner.SpecialEntity
- getDestTag() - Method in class org.htmlcleaner.TagTransformation
- getDisplay() - Method in class org.htmlcleaner.TagInfo
- getDocType() - Method in class org.htmlcleaner.HtmlTokenizer
- getDocType() - Method in class org.htmlcleaner.TagNode
- getDocument() - Method in class org.htmlcleaner.DomBuilder
- getElementList(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
-
Get all elements in the tree that satisfy specified condition.
- getElementListByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
- getElementListByName(String, boolean) - Method in class org.htmlcleaner.TagNode
- getElementListHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
- getElements(ITagNodeCondition, boolean) - Method in class org.htmlcleaner.TagNode
- getElementsByAttValue(String, String, boolean, boolean) - Method in class org.htmlcleaner.TagNode
- getElementsByName(String, boolean) - Method in class org.htmlcleaner.TagNode
- getElementsByName(Collection, int, int, boolean, boolean) - Method in class org.htmlcleaner.XPather
-
For the given source collection and specified name, returns collection of subnodes or attribute values.
- getElementsHavingAttribute(String, boolean) - Method in class org.htmlcleaner.TagNode
- getEscaped(boolean) - Method in class org.htmlcleaner.SpecialEntity
- getEscapedValue() - Method in class org.htmlcleaner.SpecialEntity
- getEscapedXmlString() - Method in class org.htmlcleaner.SpecialEntity
- getFatalTags() - Method in class org.htmlcleaner.TagInfo
- getHexNCR() - Method in class org.htmlcleaner.SpecialEntity
- getHigherTags() - Method in class org.htmlcleaner.TagInfo
- getHtmlString() - Method in class org.htmlcleaner.SpecialEntity
- getHtmlVersion() - Method in class org.htmlcleaner.CleanerProperties
-
Return the html version
- getHyphenReplacementInComment() - Method in class org.htmlcleaner.CleanerProperties
- getIndent(int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- getIndent(int) - Method in class org.htmlcleaner.PrettyXmlSerializer
- getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- getIndentedText(String, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
- getInnerHtml(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
-
For the specified node, returns it's content as string.
- getInvalidAttributeNamePrefix() - Method in class org.htmlcleaner.HtmlCleanerForAnt
- getInvalidXmlAttributeNamePrefix() - Method in class org.htmlcleaner.CleanerProperties
-
Get the prefix to use to try to make valid attribute names
- getItemsToMove() - Method in class org.htmlcleaner.TagNode
- getKey() - Method in class org.htmlcleaner.SpecialEntity
- getLastBreakingTag() - Method in class org.htmlcleaner.ChildBreaks
- getLastBreakingTagPosition() - Method in class org.htmlcleaner.ChildBreaks
- getLastTagPos() - Method in class org.htmlcleaner.OpenTags
- getMaxDepth() - Method in class org.htmlcleaner.CleanerProperties
- getMaxEntityLength() - Method in class org.htmlcleaner.SpecialEntities
- getMustCloseTags() - Method in class org.htmlcleaner.TagInfo
- getName() - Method in class org.htmlcleaner.DoctypeToken
- getName() - Method in class org.htmlcleaner.TagInfo
- getName() - Method in class org.htmlcleaner.TagNode
- getName() - Method in class org.htmlcleaner.TagToken
- getNamespaceDeclarations() - Method in class org.htmlcleaner.TagNode
- getNamespacePrefixes() - Method in class org.htmlcleaner.HtmlTokenizer
- getNamespaceURIOnPath(String) - Method in class org.htmlcleaner.TagNode
- getOpenTags() - Method in class org.htmlcleaner.NestingState
- getOpenTags(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- getParent() - Method in class org.htmlcleaner.BaseHtmlNode
- getParent() - Method in interface org.htmlcleaner.HtmlNode
- getParent() - Method in class org.htmlcleaner.ProxyTagNode
- getPart1() - Method in class org.htmlcleaner.DoctypeToken
- getPart2() - Method in class org.htmlcleaner.DoctypeToken
- getPart3() - Method in class org.htmlcleaner.DoctypeToken
-
Deprecated.
- getPart4() - Method in class org.htmlcleaner.DoctypeToken
-
Deprecated.
- getPermittedTags() - Method in class org.htmlcleaner.TagInfo
- getPreferredChildTag() - Method in class org.htmlcleaner.TagInfo
- getProperties() - Method in class org.htmlcleaner.HtmlCleaner
- getPruneTags() - Method in class org.htmlcleaner.CleanerProperties
- getPruneTagSet() - Method in class org.htmlcleaner.CleanerProperties
- getPruneTagSet(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- getPublicId() - Method in class org.htmlcleaner.DoctypeToken
-
This will retrieve the public ID of an externally referenced DTD, or an empty String if none is referenced.
- getRequiredParentTags() - Method in class org.htmlcleaner.TagInfo
- getRow() - Method in interface org.htmlcleaner.BaseToken
- getRow() - Method in class org.htmlcleaner.BaseTokenImpl
- getSiblings() - Method in class org.htmlcleaner.BaseHtmlNode
- getSiblings() - Method in interface org.htmlcleaner.HtmlNode
- getSingleLineOfChildren(List<? extends BaseToken>) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- getSingleLineOfChildren(List<? extends BaseToken>) - Method in class org.htmlcleaner.PrettyXmlSerializer
- getSourceTag() - Method in class org.htmlcleaner.TagTransformation
- getSpecialEntity(String) - Method in class org.htmlcleaner.SpecialEntities
- getSpecialEntityByUnicode(int) - Method in class org.htmlcleaner.SpecialEntities
- getSwitchArgument(String[], String) - Static method in class org.htmlcleaner.CommandLine
-
If the specified argument name exists without a value, return true.
- getSystemId() - Method in class org.htmlcleaner.DoctypeToken
-
This will retrieve the system ID of an externally referenced DTD, or an empty String if none is referenced.
- getTagInfo(String) - Method in class org.htmlcleaner.ConfigFileTagProvider
- getTagInfo(String) - Method in class org.htmlcleaner.DefaultTagProvider
- getTagInfo(String) - Method in class org.htmlcleaner.Html4TagProvider
- getTagInfo(String) - Method in class org.htmlcleaner.Html5TagProvider
-
It returns the tag information.
- getTagInfo(String) - Method in interface org.htmlcleaner.ITagInfoProvider
- getTagInfo(String, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
- getTagInfo(String, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Returns a TagInfo object for the specified tag name.
- getTagInfoProvider() - Method in class org.htmlcleaner.CleanerProperties
- getTagInfoProvider() - Method in class org.htmlcleaner.HtmlCleaner
- getTagName(String) - Method in class org.htmlcleaner.CleanerTransformations
- getTemplate() - Method in interface org.htmlcleaner.AttributeTransformation
- getTemplate() - Method in class org.htmlcleaner.AttributeTransformationPatternImpl
- getText() - Method in class org.htmlcleaner.TagNode
- getToken() - Method in class org.htmlcleaner.ProxyTagNode
- getTokenList() - Method in class org.htmlcleaner.HtmlTokenizer
- getTransformation(String) - Method in class org.htmlcleaner.CleanerTransformations
- getTransformations() - Method in class org.htmlcleaner.HtmlCleaner
- getType() - Method in class org.htmlcleaner.DoctypeToken
-
This will retrieve an integer representing the identified DocType
- getUseCdataFor() - Method in class org.htmlcleaner.CleanerProperties
- getXmlAsString(TagNode) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use getAsString() instead.
- getXmlAsString(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use getAsString() instead.
- getXmlName(String) - Static method in class org.htmlcleaner.Utils
- getXmlNSPrefix(String) - Static method in class org.htmlcleaner.Utils
- getXmlVersion() - Method in class org.htmlcleaner.DomSerializer
- globalTransformations - Variable in class org.htmlcleaner.CleanerTransformations
- go() - Method in class org.htmlcleaner.HtmlTokenizer
- go(int) - Method in class org.htmlcleaner.HtmlTokenizer
- greek - Variable in class org.htmlcleaner.SpecialEntities
- groupingTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
The HTML5 grouping tags (14 total)
H
- handleEndTagToken(BaseToken, ListIterator<BaseToken>, List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Process rules for a new end tag token in the HTML tree.
- handleInterruption() - Method in class org.htmlcleaner.HtmlCleaner
-
Called whenever the thread is interrupted.
- handleInterruption() - Method in class org.htmlcleaner.HtmlTokenizer
-
Called whenver the thread is interrupted.
- handleInterruption() - Method in class org.htmlcleaner.TagNode
-
Called whenver the thread is interrupted.
- handleStartTagToken(BaseToken, ListIterator<BaseToken>, List, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Processes all the rules associated with a new opening tag in the HTML tree
- hasAttribute(String) - Method in class org.htmlcleaner.TagNode
-
Checks existence of specified attribute.
- hasAttributeTransformations() - Method in class org.htmlcleaner.TagTransformation
- hasChildren() - Method in class org.htmlcleaner.TagNode
- hasCopyTags() - Method in class org.htmlcleaner.TagInfo
- hasIdAttributeSet(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
- hasPermittedTags() - Method in class org.htmlcleaner.TagInfo
- hasTransformationForTag(String) - Method in class org.htmlcleaner.CleanerTransformations
- head(HtmlNode, int) - Method in class org.htmlcleaner.DomBuilder
- head(HtmlNode, int) - Method in interface org.htmlcleaner.XmlVisitor
-
Callback for when a node is first visited.
- HEAD - org.htmlcleaner.BelongsTo
- HEAD_AND_BODY - org.htmlcleaner.BelongsTo
- HeadlessTagNode(TagNode) - Constructor for class org.htmlcleaner.Serializer.HeadlessTagNode
- headNode - Variable in class org.htmlcleaner.CleanTimeValues
- HEX_RELAXED - Static variable in class org.htmlcleaner.Utils
- HEX_STRICT - Static variable in class org.htmlcleaner.Utils
- higherTags - Variable in class org.htmlcleaner.TagInfo
- HTML_4 - Static variable in class org.htmlcleaner.HtmlCleaner
- HTML_5 - Static variable in class org.htmlcleaner.HtmlCleaner
- HTML4_0 - Static variable in class org.htmlcleaner.DoctypeToken
- HTML4_01 - Static variable in class org.htmlcleaner.DoctypeToken
- HTML4_01_FRAMESET - Static variable in class org.htmlcleaner.DoctypeToken
- HTML4_01_STRICT - Static variable in class org.htmlcleaner.DoctypeToken
- HTML4_01_TRANSITIONAL - Static variable in class org.htmlcleaner.DoctypeToken
- Html4TagProvider - Class in org.htmlcleaner
- Html4TagProvider() - Constructor for class org.htmlcleaner.Html4TagProvider
- HTML5 - Static variable in class org.htmlcleaner.DoctypeToken
- HTML5_LEGACY_TOOL_COMPATIBLE - Static variable in class org.htmlcleaner.DoctypeToken
- Html5TagProvider - Class in org.htmlcleaner
- Html5TagProvider() - Constructor for class org.htmlcleaner.Html5TagProvider
- htmlCleaner - Variable in class org.htmlcleaner.OpenTags
- HtmlCleaner - Class in org.htmlcleaner
-
Main HtmlCleaner class.
- HtmlCleaner() - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates cleaner instance with default tag info provider,default version and default properties.
- HtmlCleaner(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates the instance with default tag info provider and specified properties
- HtmlCleaner(ITagInfoProvider) - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates the instance with specified tag info provider and default properties
- HtmlCleaner(ITagInfoProvider, CleanerProperties) - Constructor for class org.htmlcleaner.HtmlCleaner
-
Constructor - creates the instance with specified tag info provider and specified properties
- HtmlCleanerException - Exception in org.htmlcleaner
-
General HtmlCleaner runtime exception.
- HtmlCleanerException() - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerException(String) - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerException(String, Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerException(Throwable) - Constructor for exception org.htmlcleaner.HtmlCleanerException
- HtmlCleanerForAnt - Class in org.htmlcleaner
-
Support for ANT.
- HtmlCleanerForAnt() - Constructor for class org.htmlcleaner.HtmlCleanerForAnt
- HtmlModificationListener - Interface in org.htmlcleaner.audit
-
Implementors can be registered on
CleanerProperties
to receive notifications about modifications made by html cleaner. - HtmlModificationListenerLogger - Class in org.htmlcleaner.audit
- HtmlModificationListenerLogger(Logger) - Constructor for class org.htmlcleaner.audit.HtmlModificationListenerLogger
- htmlModificationListeners - Variable in class org.htmlcleaner.CleanerProperties
- htmlNode - Variable in class org.htmlcleaner.CleanTimeValues
- HtmlNode - Interface in org.htmlcleaner
-
Marker interface denoting nodes of the document tree
- HtmlSerializer - Class in org.htmlcleaner
-
Abstract HTML serializer - contains common logic for descendants.
- HtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.HtmlSerializer
- htmlSpecialEntity - Variable in class org.htmlcleaner.SpecialEntity
- htmlString - Variable in class org.htmlcleaner.SpecialEntity
- HtmlTokenizer - Class in org.htmlcleaner
-
Main HTML tokenizer.
- HtmlTokenizer(HtmlCleaner, Reader, CleanTimeValues) - Constructor for class org.htmlcleaner.HtmlTokenizer
-
Constructor - creates instance of the parser with specified content.
- htmlVersion - Variable in class org.htmlcleaner.CleanerProperties
- hyphenreplacement - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- hyphenReplacementInComment - Variable in class org.htmlcleaner.CleanerProperties
I
- ID_ATTRIBUTE_NAME - Static variable in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
- identifier(boolean) - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses an identifier from the current position.
- ignoreqe - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- ignoreQuestAndExclam - Variable in class org.htmlcleaner.CleanerProperties
- ignoreUntil(char) - Method in class org.htmlcleaner.HtmlTokenizer
- imgElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- incharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- indents - Variable in class org.htmlcleaner.PrettyHtmlSerializer
- indents - Variable in class org.htmlcleaner.PrettyXmlSerializer
- indentString - Variable in class org.htmlcleaner.PrettyHtmlSerializer
- indentString - Variable in class org.htmlcleaner.PrettyXmlSerializer
- info - Variable in class org.htmlcleaner.TagPos
- initCleanerTransformations(Map) - Method in class org.htmlcleaner.HtmlCleaner
- inline - org.htmlcleaner.Display
-
Inline elements typically may only contain text and other inline elements.
- insertChild(int, HtmlNode) - Method in class org.htmlcleaner.TagNode
-
Inserts specified node at specified position in array of children
- insertChildAfter(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
-
Inserts specified node in the list of children after specified child
- insertChildBefore(HtmlNode, HtmlNode) - Method in class org.htmlcleaner.TagNode
-
Inserts specified node in the list of children before specified child
- INSTANCE - Static variable in class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
- INSTANCE - Static variable in class org.htmlcleaner.DefaultTagProvider
- INSTANCE - Static variable in class org.htmlcleaner.Html4TagProvider
- INSTANCE - Static variable in class org.htmlcleaner.Html5TagProvider
- INSTANCE - Static variable in class org.htmlcleaner.SpecialEntities
- INSTANCE2 - Variable in class org.htmlcleaner.Html5TagProvider
- intCode - Variable in class org.htmlcleaner.SpecialEntity
- interactiveTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
The HTML5 Interactive tags (4 total)
- intValue() - Method in class org.htmlcleaner.SpecialEntity
- invalidAttributeNamePrefix - Variable in class org.htmlcleaner.CleanerProperties
- invalidAttributeNamePrefix - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- isAddNewlineToHeadAndBody() - Method in class org.htmlcleaner.CleanerProperties
- isAdvancedXmlEscape() - Method in class org.htmlcleaner.CleanerProperties
- isAfterTagLineBreakNeeded() - Method in enum org.htmlcleaner.Display
- isAllowedAsForeignMarkup(String, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Checks whether we can allow a tag as "foreign markup".
- isAllowedInLastOpenTag(BaseToken, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- isAllowHtmlInsideAttributes() - Method in class org.htmlcleaner.CleanerProperties
- isAllowInvalidAttributeNames() - Method in class org.htmlcleaner.CleanerProperties
-
If false, when outputting XML, if an attribute name is not valid, attempt to fix it by using a prefix and removing invalid characters.
- isAllowInvalidAttributeNames() - Method in class org.htmlcleaner.HtmlCleanerForAnt
- isAllowMultiWordAttributes() - Method in class org.htmlcleaner.CleanerProperties
- isAllRead() - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if end of the content is reached.
- isAtt(String) - Method in class org.htmlcleaner.XPather
-
Checks if token is attribute (starts with @)
- isAutoGenerated() - Method in class org.htmlcleaner.TagNode
- isBlank() - Method in class org.htmlcleaner.ContentNode
- isBrNode(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
- isCaseSensitive - Variable in class org.htmlcleaner.conditional.TagNodeAttValueCondition
- isChar(char) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at current runtime position is equal to specified char.
- isChar(int, char) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at specified position is equal to specified char.
- isContentOrInline(Object) - Method in class org.htmlcleaner.BrowserCompactXmlSerializer
- isContinueAfter(String) - Method in class org.htmlcleaner.TagInfo
- isCopiedTokenEqualToNextThreeCopiedTokens(TagNode, ListIterator<BaseToken>) - Static method in class org.htmlcleaner.HtmlCleaner
-
Determines if a copied token is equal to the next 3 tokens in the iterator.
- isCopy - Variable in class org.htmlcleaner.TagNode
-
Indicates that the node is a copy of another node.
- isCopy() - Method in class org.htmlcleaner.TagNode
- isCopy(String) - Method in class org.htmlcleaner.TagInfo
- isCreatingHtmlDom() - Method in class org.htmlcleaner.XmlSerializer
- isDeprecated() - Method in class org.htmlcleaner.TagInfo
- isDeserializeEntities() - Method in class org.htmlcleaner.CleanerProperties
- isElementIdentifierStartChar(int) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at specified position can be identifier start.
- isEmpty() - Method in class org.htmlcleaner.ChildBreaks
- isEmpty() - Method in class org.htmlcleaner.OpenTags
- isEmpty() - Method in class org.htmlcleaner.TagNode
- isEmptyString(Object) - Static method in class org.htmlcleaner.Utils
- isEmptyTag() - Method in class org.htmlcleaner.TagInfo
- isEndTagPermitted() - Method in enum org.htmlcleaner.CloseTag
- isFatalTag(String) - Method in class org.htmlcleaner.TagInfo
- isFatalTagSatisfied(TagInfo, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Checks if open fatal tag is missing if there is a fatal tag for the specified tag.
- isForbiddenAttribute(TagNode, String, String) - Method in class org.htmlcleaner.XmlSerializer
-
Override to add additional conditions.
- isForbiddenTag(TagNode) - Method in class org.htmlcleaner.XmlSerializer
- isForeignMarkup - Variable in class org.htmlcleaner.TagNode
-
This flag is set if we are using namespace aware setting, and the tagnode belongs to a non-HTML namespace.
- isForeignMarkup() - Method in class org.htmlcleaner.TagNode
- isFormed - Variable in class org.htmlcleaner.TagNode
- isFormed() - Method in class org.htmlcleaner.TagNode
- isFullUrl(String) - Static method in class org.htmlcleaner.Utils
-
Checks if specified link is full URL.
- isFunctionCall(int, int) - Method in class org.htmlcleaner.XPather
-
Checks if tokens in specified range represents valid function call.
- isHeadAndBodyTag() - Method in class org.htmlcleaner.TagInfo
- isHeadOrBody(String) - Method in class org.htmlcleaner.XmlSerializer
- isHeadTag() - Method in class org.htmlcleaner.TagInfo
- isHigher(String) - Method in class org.htmlcleaner.TagInfo
- isHtmlAttributeIdentifierChar() - Method in class org.htmlcleaner.HtmlTokenizer
- isHtmlAttributeIdentifierChar(int) - Method in class org.htmlcleaner.HtmlTokenizer
-
Check whether the character at the specified position in the stream is a valid character for part of an attribute identifier in HTML
- isHtmlAttributeIdentifierStartChar() - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at current runtime position can be identifier start.
- isHtmlElementIdentifier() - Method in class org.htmlcleaner.HtmlTokenizer
- isHtmlElementIdentifier(int) - Method in class org.htmlcleaner.HtmlTokenizer
- isHtmlSpecialEntity() - Method in class org.htmlcleaner.SpecialEntity
- isIdentifier(String) - Method in class org.htmlcleaner.XPather
-
Checks if given string is valid identifier.
- isIgnoreQuestAndExclam() - Method in class org.htmlcleaner.CleanerProperties
- isKeepWhitespaceAndCommentsInHead() - Method in class org.htmlcleaner.CleanerProperties
- isLeadingAndEndWhitespacesAllowed() - Method in enum org.htmlcleaner.Display
- isMinimizedTagPermitted() - Method in enum org.htmlcleaner.CloseTag
- isMinimizedTagPermitted() - Method in class org.htmlcleaner.TagInfo
- isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.HtmlSerializer
- isMinimizedTagSyntax(TagNode) - Method in class org.htmlcleaner.XmlSerializer
- isMustCloseTag(TagInfo) - Method in class org.htmlcleaner.TagInfo
- isNamespacesAware() - Method in class org.htmlcleaner.CleanerProperties
- isOmitCdataOutsideScriptAndStyle() - Method in class org.htmlcleaner.CleanerProperties
- isOmitComments() - Method in class org.htmlcleaner.CleanerProperties
- isOmitDeprecatedTags() - Method in class org.htmlcleaner.CleanerProperties
- isOmitDoctypeDeclaration() - Method in class org.htmlcleaner.CleanerProperties
- isOmitHtmlEnvelope() - Method in class org.htmlcleaner.CleanerProperties
- isOmitUnknownTags() - Method in class org.htmlcleaner.CleanerProperties
- isOmitXmlDeclaration() - Method in class org.htmlcleaner.CleanerProperties
- isPreserveSourceAttributes() - Method in class org.htmlcleaner.TagTransformation
- isPruned() - Method in class org.htmlcleaner.TagNode
- isRecognizeUnicodeChars() - Method in class org.htmlcleaner.CleanerProperties
- isRemovingNodeReasonablySafe(TagNode) - Method in class org.htmlcleaner.HtmlCleaner
- isReservedTag(String) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if specified tag name is one of the reserved tags: HTML, HEAD or BODY
- isScriptOrStyle(TagNode) - Method in class org.htmlcleaner.Serializer
- isScriptOrStyle(Element) - Method in class org.htmlcleaner.DomSerializer
- isStartToken(Object) - Method in class org.htmlcleaner.HtmlCleaner
- isTagStartOrEnd() - Method in class org.htmlcleaner.HtmlTokenizer
-
Not all '<' (lt) symbols mean tag start or end.
- isToken(String, int) - Method in class org.htmlcleaner.XPather
- isTranslateSpecialEntities() - Method in class org.htmlcleaner.CleanerProperties
- isTransResCharsToNCR() - Method in class org.htmlcleaner.CleanerProperties
- isTransSpecialEntitiesToNCR() - Method in class org.htmlcleaner.CleanerProperties
- isTreatDeprecatedTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
- isTreatUnknownTagsAsContent() - Method in class org.htmlcleaner.CleanerProperties
- isTrimAttributeValues - Variable in class org.htmlcleaner.TagNode
-
This flag is set if attribute values should be trimmed.
- isTrimAttributeValues() - Method in class org.htmlcleaner.CleanerProperties
- isTrimAttributeValues() - Method in class org.htmlcleaner.TagNode
- isUnique() - Method in class org.htmlcleaner.TagInfo
- isUseCdataFor(String) - Method in class org.htmlcleaner.CleanerProperties
- isUseCdataForScriptAndStyle() - Method in class org.htmlcleaner.CleanerProperties
- isUseEmptyElementTags() - Method in class org.htmlcleaner.CleanerProperties
- isValid() - Method in class org.htmlcleaner.DoctypeToken
- isValidDouble(String) - Method in class org.htmlcleaner.XPather
- isValidHtmlAttributeName(String) - Static method in class org.htmlcleaner.Utils
- isValidInt(String, int) - Static method in class org.htmlcleaner.Utils
- isValidInteger(String) - Static method in class org.htmlcleaner.XPather
- isValidXmlIdentifier(String) - Static method in class org.htmlcleaner.Utils
-
Checks whether specified string can be valid tag name or attribute name in xml.
- isValidXmlIdentifierStartChar(String) - Static method in class org.htmlcleaner.Utils
-
Determines whether the initial character of an identifier is valid for XML
- isWhitespace() - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at current runtime position is whitespace.
- isWhitespace(int) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if character at specified position is whitespace.
- isWhitespaceString(Object) - Method in class org.htmlcleaner.CompactXmlSerializer
-
Checks whether specified object's string representation is empty string (containing of only whitespaces).
- isWhitespaceString(Object) - Static method in class org.htmlcleaner.Utils
-
Checks whether specified object's string representation is empty string (containing of only whitespaces).
- isXmlReservedCharacter(String) - Static method in class org.htmlcleaner.Utils
- ITagInfoProvider - Interface in org.htmlcleaner
-
Provides set of TagInfo instances.
- ITagNodeCondition - Interface in org.htmlcleaner.conditional
-
Used as base for different node checkers.
- itemsToMove - Variable in class org.htmlcleaner.TagNode
J
- JDomSerializer - Class in org.htmlcleaner
-
JDom serializer - creates xml JDom instance out of the TagNode.
- JDomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.JDomSerializer
- JDomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.JDomSerializer
K
- keepWhitespaceAndCommentsInHead - Variable in class org.htmlcleaner.CleanerProperties
-
Tries to keep inside head all whitespace and comments that were originally there
- key - Variable in class org.htmlcleaner.SpecialEntity
L
- last - Variable in class org.htmlcleaner.OpenTags
- layoutElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
- lchomp(String) - Static method in class org.htmlcleaner.Utils
-
Removes the first newline (if present) of a string
- leadingAndEndWhitespacesAllowed - Variable in enum org.htmlcleaner.Display
- LINE_BREAK - Static variable in class org.htmlcleaner.BrowserCompactXmlSerializer
- linkElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- list - Variable in class org.htmlcleaner.OpenTags
- listElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- log - Variable in class org.htmlcleaner.audit.HtmlModificationListenerLogger
- logger - Static variable in class org.htmlcleaner.CommandLine
- ltrim(String) - Method in class org.htmlcleaner.PrettyXmlSerializer
-
Trims specified string from left.
- ltrim(String) - Static method in class org.htmlcleaner.Utils
-
Trims specified string from left.
M
- main(String[]) - Static method in class org.htmlcleaner.CommandLine
- main(String[]) - Static method in class org.htmlcleaner.ConfigFileTagProvider
-
Generates code for tag provider class from specified configuration XML file.
- makeCopy() - Method in class org.htmlcleaner.TagNode
- makeTree(List, ListIterator<BaseToken>, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
This method generally mutates flattened list of tokens into tree structure.
- mappings - Variable in class org.htmlcleaner.CleanerTransformations
- MARKER_ATTRIBUTE - Static variable in class org.htmlcleaner.HtmlCleaner
-
Marker attribute added to aid with part of the cleaning process.
- markNodesToPrune(List, CleanTimeValues, int) - Method in class org.htmlcleaner.HtmlCleaner
- math - Variable in class org.htmlcleaner.SpecialEntities
- MathMLTagProvider - Class in org.htmlcleaner
-
It contains the MathML tags to use with Html5 tags
- MathMLTagProvider(TagInfo, ConcurrentMap<String, TagInfo>) - Constructor for class org.htmlcleaner.MathMLTagProvider
- maxDepth - Variable in class org.htmlcleaner.CleanerProperties
-
Provides an arbitrary recursion depth
- maxEntityLength - Variable in class org.htmlcleaner.SpecialEntities
- MEDIA_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
-
HTML5 Media Tags
- MEDIA_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
-
HTML5 Media Tags
- mediaTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
Html5 media-embedded tags (12 tags)
- metadataTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
HTML5 Document metadata tags
- minimizedTagPermitted - Variable in enum org.htmlcleaner.CloseTag
- mustAddRequiredParent(TagInfo, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
-
Check if specified tag requires parent tag, but that parent tag is missing in the appropriate context.
- mustCloseTags - Variable in class org.htmlcleaner.TagInfo
N
- name - Variable in class org.htmlcleaner.conditional.TagNodeNameCondition
- name - Variable in class org.htmlcleaner.TagInfo
- name - Variable in class org.htmlcleaner.TagPos
- name - Variable in class org.htmlcleaner.TagToken
- namespace - Variable in class org.htmlcleaner.CleanTimeValues
-
A stack of namespaces for currently open tags.
- namespaceMap - Variable in class org.htmlcleaner.CleanTimeValues
-
A map of all the namespace prefixes and URIs declared within the document.
- namespacesaware - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- namespacesAware - Variable in class org.htmlcleaner.CleanerProperties
- NestingState - Class in org.htmlcleaner
-
Nesting State Wrapper for a current HtmlCleaner cleaning state, keeping together the set of open tags and breaks in the current state.
- NestingState(OpenTags, ChildBreaks) - Constructor for class org.htmlcleaner.NestingState
- nestingStates - Variable in class org.htmlcleaner.CleanTimeValues
- NEW_LINE - Static variable in class org.htmlcleaner.DomSerializer
- NEW_LINE - Static variable in class org.htmlcleaner.JDomSerializer
- newTagNode(String) - Method in class org.htmlcleaner.HtmlCleaner
- nodebyxpath - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- NON_BREAKABLE_SPACE - Static variable in class org.htmlcleaner.SpecialEntities
- none - org.htmlcleaner.ContentType
-
elements that have no children or content ( for example
).
- none - org.htmlcleaner.Display
-
Elements that are not actually inline or block, usually such elements are not rendered at all.
- NotAllowedTag - org.htmlcleaner.audit.ErrorType
-
The tag wasn't found on list of allowed tags, thus it was removed.
- nsDeclarations - Variable in class org.htmlcleaner.TagNode
O
- olderElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- omit - org.htmlcleaner.OptionalOutput
-
Never outputed even if supplied in the source.
- omitCdataOutsideScriptAndStyle - Variable in class org.htmlcleaner.CleanerProperties
- omitcomments - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitComments - Variable in class org.htmlcleaner.CleanerProperties
- omitDeprecatedTags - Variable in class org.htmlcleaner.CleanerProperties
- omitdeprtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitdoctypedecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitDoctypeDeclaration - Variable in class org.htmlcleaner.CleanerProperties
- omithtmlenvelope - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitHtmlEnvelope - Variable in class org.htmlcleaner.CleanerProperties
- omitunknowntags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- omitUnknownTags - Variable in class org.htmlcleaner.CleanerProperties
- omitxmldecl - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- OMITXMLDECL - Static variable in class org.htmlcleaner.CommandLine
- omitXmlDeclaration - Variable in class org.htmlcleaner.CleanerProperties
- openPreTags - Variable in class org.htmlcleaner.CompactHtmlSerializer
- openTags - Variable in class org.htmlcleaner.NestingState
- OpenTags - Class in org.htmlcleaner
-
Class that contains information and methods for managing list of open, but unhandled tags.
- OpenTags(HtmlCleaner) - Constructor for class org.htmlcleaner.OpenTags
- optional - org.htmlcleaner.CloseTag
-
or is permitted
- OptionalOutput - Enum in org.htmlcleaner
- OptionalOutput() - Constructor for enum org.htmlcleaner.OptionalOutput
- org.htmlcleaner - package org.htmlcleaner
- org.htmlcleaner.audit - package org.htmlcleaner.audit
- org.htmlcleaner.conditional - package org.htmlcleaner.conditional
- outcharset - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- outputCData(CData) - Method in class org.htmlcleaner.DomSerializer
- outputtype - Variable in class org.htmlcleaner.HtmlCleanerForAnt
P
- parent - Variable in class org.htmlcleaner.BaseHtmlNode
- parse(InputSource) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- parserFactory - Static variable in class org.htmlcleaner.ConfigFileTagProvider
- part1 - Variable in class org.htmlcleaner.DoctypeToken
- part2 - Variable in class org.htmlcleaner.DoctypeToken
- part3 - Variable in class org.htmlcleaner.DoctypeToken
- part4 - Variable in class org.htmlcleaner.DoctypeToken
- permittedTags - Variable in class org.htmlcleaner.TagInfo
- PHRASING_TAGS - Static variable in class org.htmlcleaner.DefaultTagProvider
-
Phrasing tags are those that can make up paragraphs along with text to make Phrasing Content
- PHRASING_TAGS - Static variable in class org.htmlcleaner.Html4TagProvider
-
Phrasing tags are those that can make up paragraphs along with text to make Phrasing Content
- PHRASING_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
-
Phrasing tags are those that can make up paragraphs along with text to make Phrasing Content.
- phrasingTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
Html5 phrasing tags --text level semantics (31 total) thelw data
- pop() - Method in class org.htmlcleaner.ChildBreaks
-
pops out latest broken tag position.
- popNesting(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- position - Variable in class org.htmlcleaner.TagPos
- PRE_TAG - Static variable in class org.htmlcleaner.BrowserCompactXmlSerializer
- preferredChildTag - Variable in class org.htmlcleaner.TagInfo
- presentationMarkup(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
- preserve - org.htmlcleaner.OptionalOutput
-
outputed ONLY if supplied in the source.
- preserveSourceAttributes - Variable in class org.htmlcleaner.TagTransformation
- PrettyHtmlSerializer - Class in org.htmlcleaner
-
Pretty HTML serializer - creates resulting HTML with indenting lines.
- PrettyHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
- PrettyHtmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyHtmlSerializer
- PrettyXmlSerializer - Class in org.htmlcleaner
-
Pretty XML serializer - creates resulting XML with indenting lines.
- PrettyXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
- PrettyXmlSerializer(CleanerProperties, String) - Constructor for class org.htmlcleaner.PrettyXmlSerializer
- properties - Variable in class org.htmlcleaner.HtmlCleaner
- props - Variable in class org.htmlcleaner.DomBuilder
- props - Variable in class org.htmlcleaner.DomSerializer
-
The HTML Cleaner properties set by the user to control the HTML cleaning.
- props - Variable in class org.htmlcleaner.HtmlTokenizer
- props - Variable in class org.htmlcleaner.JDomSerializer
- props - Variable in class org.htmlcleaner.Serializer
- props - Variable in class org.htmlcleaner.TraversalDomSerializer
- ProxyTagNode - Class in org.htmlcleaner
-
A
TagNode
that only really holds whitespace or comments - allows usingContentNode
in places where aTagNode
is expected. - ProxyTagNode(CommentNode, TagNode) - Constructor for class org.htmlcleaner.ProxyTagNode
- ProxyTagNode(ContentNode, TagNode) - Constructor for class org.htmlcleaner.ProxyTagNode
- pruned - Variable in class org.htmlcleaner.TagNode
-
Indicates that the node was marked to be pruned out of the tree.
- pruneNodeSet - Variable in class org.htmlcleaner.CleanTimeValues
- prunetags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- pruneTags - Variable in class org.htmlcleaner.CleanerProperties
- pruneTagSet - Variable in class org.htmlcleaner.CleanerProperties
-
blacklist of tags
- pruneTagSet - Variable in class org.htmlcleaner.CleanTimeValues
- pushNesting(CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- put(String, TagInfo) - Method in class org.htmlcleaner.DefaultTagProvider
- put(String, TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- put(String, TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
It inserts the tag node into the tagInfoMap.
- put(String, TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
- put(SpecialEntity) - Method in class org.htmlcleaner.SpecialEntities
R
- readIfNeeded(int) - Method in class org.htmlcleaner.HtmlTokenizer
- readUrl(URL, String) - Static method in class org.htmlcleaner.Utils
-
Deprecated.
- recognizeUnicodeChars - Variable in class org.htmlcleaner.CleanerProperties
- removeAllChildren() - Method in class org.htmlcleaner.TagNode
-
Removes all children (subelements and text content).
- removeAttribute(String) - Method in class org.htmlcleaner.TagNode
-
Removes specified attribute from this tag.
- removeChild(Object) - Method in class org.htmlcleaner.TagNode
-
Remove specified child element from this node.
- removeFromTree() - Method in class org.htmlcleaner.ProxyTagNode
- removeFromTree() - Method in class org.htmlcleaner.TagNode
-
Remove this node from the tree.
- removeTag(String) - Method in class org.htmlcleaner.OpenTags
- reopenBrokenNode(ListIterator<BaseToken>, TagNode, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- replaceAttributes(Map<String, String>) - Method in class org.htmlcleaner.TagNode
-
Clears existing attributes and puts replacement attributes
- replaceInvalidXmlIdentifierCharacters(String, String) - Static method in class org.htmlcleaner.Utils
-
Strips out invalid characters from names used for XML Elements and replaces them with the specified character.
- required - org.htmlcleaner.CloseTag
-
is required.
- RequiredParentMissing - org.htmlcleaner.audit.ErrorType
-
Missing parent tag was added for current (i.e.
- requiredParentTags - Variable in class org.htmlcleaner.TagInfo
- reset() - Method in class org.htmlcleaner.CleanerProperties
-
advancedXmlEscape = true; setUseCdataFor("script,style"); translateSpecialEntities = true; recognizeUnicodeChars = true; omitUnknownTags = false; treatUnknownTagsAsContent = false; omitDeprecatedTags = false; treatDeprecatedTagsAsContent = false; omitComments = false; omitXmlDeclaration = OptionalOutput.alwaysOutput; omitDoctypeDeclaration = OptionalOutput.alwaysOutput; omitHtmlEnvelope = OptionalOutput.alwaysOutput; useEmptyElementTags = true; allowMultiWordAttributes = true; allowHtmlInsideAttributes = false; ignoreQuestAndExclam = true; namespacesAware = true; keepHeadWhitespace = true; addNewlineToHeadAndBody = true; hyphenReplacementInComment = "="; pruneTags = null; allowTags = null; booleanAttributeValues = BOOL_ATT_SELF; collapseNullHtml = CollapseHtml.none charset = "UTF-8"; trimAttributeValues = true; tagInfoProvider = HTML5TagProvider.INSTANCE maxDepth = 1000
- resetPruneTagSet() - Method in class org.htmlcleaner.CleanerProperties
- rootNode - Variable in class org.htmlcleaner.CleanTimeValues
- row - Variable in class org.htmlcleaner.BaseTokenImpl
- rtrim(String) - Method in class org.htmlcleaner.PrettyXmlSerializer
-
Trims specified string from right.
- rtrim(String) - Static method in class org.htmlcleaner.Utils
-
Trims specified string from right.
S
- SAFE_BEGIN_CDATA - Static variable in class org.htmlcleaner.CData
- SAFE_BEGIN_CDATA_ALT - Static variable in class org.htmlcleaner.CData
- SAFE_END_CDATA - Static variable in class org.htmlcleaner.CData
- SAFE_END_CDATA_ALT - Static variable in class org.htmlcleaner.CData
- sanitizeHtmlAttributeName(String) - Static method in class org.htmlcleaner.Utils
- sanitizeXmlIdentifier(String) - Static method in class org.htmlcleaner.Utils
- sanitizeXmlIdentifier(String, String) - Static method in class org.htmlcleaner.Utils
- sanitizeXmlIdentifier(String, String, String) - Static method in class org.htmlcleaner.Utils
-
Attempts to replace invalid attribute names with valid ones.
- satisfy(String, String) - Method in interface org.htmlcleaner.AttributeTransformation
- satisfy(String, String) - Method in class org.htmlcleaner.AttributeTransformationPatternImpl
- satisfy(TagNode) - Method in interface org.htmlcleaner.conditional.ITagNodeCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagAllCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAttExistsCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAttValueCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
- satisfy(TagNode) - Method in class org.htmlcleaner.conditional.TagNodeNameCondition
- satisfy(TagNode, boolean) - Method in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
- save(char) - Method in class org.htmlcleaner.HtmlTokenizer
-
Saves specified character to the temporary buffer.
- saveCurrent() - Method in class org.htmlcleaner.HtmlTokenizer
-
Saves character at current runtime position to the temporary buffer.
- saveCurrent(int) - Method in class org.htmlcleaner.HtmlTokenizer
-
Saves specified number of characters at current runtime position to the temporary buffer.
- saveToLastOpenTag(List, Object, CleanTimeValues) - Method in class org.htmlcleaner.HtmlCleaner
- SCRIPT_SUPPORTING_TAGS - Static variable in class org.htmlcleaner.Html5TagProvider
- scriptElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- scriptElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
- scriptingTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
HTML5 scripting tags
- semanticFlowTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
The HTML5 semantic flow tags-Sectioning tags (15 total)
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.BaseHtmlNode
- serialize(Serializer, Writer) - Method in interface org.htmlcleaner.BaseToken
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.CommentNode
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.ContentNode
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.DoctypeToken
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.EndTagToken
- serialize(Serializer, Writer) - Method in class org.htmlcleaner.TagNode
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.BrowserCompactXmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactHtmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.CompactXmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.PrettyXmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.Serializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleHtmlSerializer
- serialize(TagNode, Writer) - Method in class org.htmlcleaner.SimpleXmlSerializer
- serializeAttribute(TagNode, Writer, String, String) - Method in class org.htmlcleaner.XmlSerializer
-
This allows overriding to eliminate forbidden attributes (for example javascript attributes onclick, onblur, etc.
- serializeCData(CData, TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
-
Serialize a CDATA section.
- serializeContentToken(ContentNode, TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
-
Serialize a content token, escaping where necessary.
- serializeEndTag(TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
- serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
- serializeEndTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
- serializeOpenTag(TagNode, Writer) - Method in class org.htmlcleaner.XmlSerializer
- serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.HtmlSerializer
- serializeOpenTag(TagNode, Writer, boolean) - Method in class org.htmlcleaner.XmlSerializer
- serializePrettyHtml(TagNode, Writer, int, boolean, boolean) - Method in class org.htmlcleaner.PrettyHtmlSerializer
- serializePrettyXml(TagNode, Writer, int) - Method in class org.htmlcleaner.PrettyXmlSerializer
- Serializer - Class in org.htmlcleaner
-
Basic abstract serializer - contains common logic for descendants (methods
writeXXX()
. - Serializer(CleanerProperties) - Constructor for class org.htmlcleaner.Serializer
- Serializer.HeadlessTagNode - Class in org.htmlcleaner
-
Used to implement serialization with missing envelope - omiting open and close tags, just serialize children.
- set - Variable in class org.htmlcleaner.OpenTags
- setAddNewlineToHeadAndBody(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setAdvancedxmlescape(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setAdvancedXmlEscape(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setAllowhtmlinsideattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setAllowHtmlInsideAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setAllowInvalidAttributeNames(boolean) - Method in class org.htmlcleaner.CleanerProperties
-
Set whether to allow invalid attribute names, or to try to fix or omit them
- setAllowInvalidAttributeNames(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setAllowmultiwordattributes(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setAllowMultiWordAttributes(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setAllowTags(String) - Method in class org.htmlcleaner.CleanerProperties
- setAllowTagSet(String) - Method in class org.htmlcleaner.CleanerProperties
- setAssumedNamespace(String) - Method in class org.htmlcleaner.TagInfo
- setAssumedNamespacePrefix(String) - Method in class org.htmlcleaner.TagInfo
- setAttributes(Map<String, String>) - Method in class org.htmlcleaner.TagNode
-
Replace the current set of attributes with a new set.
- setAttributes(TagNode, Element) - Method in class org.htmlcleaner.JDomSerializer
- setAutoGenerated(boolean) - Method in class org.htmlcleaner.TagNode
- setBelongsTo(BelongsTo) - Method in class org.htmlcleaner.TagInfo
- setBooleanAttributeValues(String) - Method in class org.htmlcleaner.CleanerProperties
- setBooleanatts(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setCharset(String) - Method in class org.htmlcleaner.CleanerProperties
- setChildren(List<? extends BaseToken>) - Method in class org.htmlcleaner.TagNode
- setChildTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
- setCleanerTransformations(CleanerTransformations) - Method in class org.htmlcleaner.CleanerProperties
- setCol(int) - Method in interface org.htmlcleaner.BaseToken
- setCol(int) - Method in class org.htmlcleaner.BaseTokenImpl
- setContinueAfterTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
- setCopyTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
- setCreatingHtmlDom(boolean) - Method in class org.htmlcleaner.XmlSerializer
- setDeprecated(boolean) - Method in class org.htmlcleaner.TagInfo
- setDeserializeEntities(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setDest(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setDisplay(Display) - Method in class org.htmlcleaner.TagInfo
- setDocType(DoctypeToken) - Method in class org.htmlcleaner.TagNode
- setFatalTag(String) - Method in class org.htmlcleaner.TagInfo
- setForeignMarkup(boolean) - Method in class org.htmlcleaner.TagNode
- setFormed() - Method in class org.htmlcleaner.TagNode
- setFormed(boolean) - Method in class org.htmlcleaner.TagNode
- setHigherTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
- setHtmlVersion(int) - Method in class org.htmlcleaner.CleanerProperties
-
Sets the html version according to the parameter.Also,it sets the tag provider to the appropriate version.
- setHyphenreplacement(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setHyphenReplacementInComment(String) - Method in class org.htmlcleaner.CleanerProperties
- setIgnoreqe(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setIgnoreQuestAndExclam(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setIncharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setInnerHtml(TagNode, String) - Method in class org.htmlcleaner.HtmlCleaner
-
For the specified tag node, defines it's html content.
- setInvalidAttributeNamePrefix(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setInvalidXmlAttributeNamePrefix(String) - Method in class org.htmlcleaner.CleanerProperties
-
Sets the prefix to use for xml attributes that are invalid
- setItemsToMove(List<BaseToken>) - Method in class org.htmlcleaner.TagNode
- setKeepWhitespaceAndCommentsInHead(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setMaxDepth(int) - Method in class org.htmlcleaner.CleanerProperties
- setMustCloseTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
- setName(String) - Method in class org.htmlcleaner.TagInfo
- setNamespacesaware(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setNamespacesAware(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setNodebyxpath(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitCdataOutsideScriptAndStyle(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitcomments(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitComments(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitDeprecatedTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitdeprtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitdoctypedecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitDoctypeDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmithtmlenvelope(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitHtmlEnvelope(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitunknowntags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitUnknownTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOmitxmldecl(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOmitXmlDeclaration(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setOutcharset(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setOutputtype(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setParent(TagNode) - Method in class org.htmlcleaner.BaseHtmlNode
- setParent(TagNode) - Method in interface org.htmlcleaner.HtmlNode
- setPermittedTags(Set<String>) - Method in class org.htmlcleaner.TagInfo
- setPreferredChildTag(String) - Method in class org.htmlcleaner.TagInfo
- setPruned(boolean) - Method in class org.htmlcleaner.TagNode
- setPrunetags(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setPruneTags(String) - Method in class org.htmlcleaner.CleanerProperties
-
Resets prune tags set and adds tag name conditions to it.
- setRecognizeUnicodeChars(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setRequiredParent(String) - Method in class org.htmlcleaner.TagInfo
- setRow(int) - Method in interface org.htmlcleaner.BaseToken
- setRow(int) - Method in class org.htmlcleaner.BaseTokenImpl
- setSpecialentities(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setSrc(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTaginfofile(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTagInfoProvider(ITagInfoProvider) - Method in class org.htmlcleaner.CleanerProperties
- setText(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTransform(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTranslateSpecialEntities(boolean) - Method in class org.htmlcleaner.CleanerProperties
-
TODO : use
OptionalOutput
- setTransResCharsToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTransSpecialEntitiesToNCR(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTreatDeprecatedTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTreatdeprtagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTreatunknowntagsascontent(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setTreatUnknownTagsAsContent(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTrimAttributeValues(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setTrimAttributeValues(boolean) - Method in class org.htmlcleaner.TagNode
- setUnicodechars(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setUnique(boolean) - Method in class org.htmlcleaner.TagInfo
- setUsecdata(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setUsecdatafor(String) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setUseCdataFor(String) - Method in class org.htmlcleaner.CleanerProperties
- setUseCdataForScriptAndStyle(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setUseemptyelementtags(boolean) - Method in class org.htmlcleaner.HtmlCleanerForAnt
- setUseEmptyElementTags(boolean) - Method in class org.htmlcleaner.CleanerProperties
- setXmlVersion(String) - Method in class org.htmlcleaner.DomSerializer
- shouldEscapeOrTranslateEntities() - Method in class org.htmlcleaner.DomBuilder
- SimpleHtmlSerializer - Class in org.htmlcleaner
-
Simple HTML serializer - creates resulting HTML without indenting and/or compacting.
- SimpleHtmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleHtmlSerializer
- SimpleHtmlSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.SimpleHtmlSerializer
- SimpleXmlSerializer - Class in org.htmlcleaner
-
Simple XML serializer - creates resulting XML without indenting lines.
- SimpleXmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.SimpleXmlSerializer
- singleton(Object) - Method in class org.htmlcleaner.XPather
-
Creates one-element collection for the specified object.
- skipWhitespaces() - Method in class org.htmlcleaner.HtmlTokenizer
-
Skips whitespaces at current position and moves foreward until non-whitespace character is found or the end of content is reached.
- someAlreadyOpen(Set<String>) - Method in class org.htmlcleaner.OpenTags
-
Checks if any of tags specified in the set are already open.
- sourceTag - Variable in class org.htmlcleaner.TagTransformation
- specialentities - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- SpecialEntities - Class in org.htmlcleaner
-
This class contains map with special entities used in HTML and their unicodes.
- SpecialEntities(boolean, boolean) - Constructor for class org.htmlcleaner.SpecialEntities
- SpecialEntity - Class in org.htmlcleaner
- SpecialEntity(String, int, String, boolean) - Constructor for class org.htmlcleaner.SpecialEntity
- src - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- start() - Method in class org.htmlcleaner.HtmlTokenizer
-
Starts parsing HTML.
- startElement(String, String, String, Attributes) - Method in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- startsWith(String) - Method in class org.htmlcleaner.HtmlTokenizer
-
Checks if content starts with specified value at the current position.
- strictErrorChecking - Variable in class org.htmlcleaner.DomBuilder
- strictErrorChecking - Variable in class org.htmlcleaner.DomSerializer
- strictErrorChecking - Variable in class org.htmlcleaner.TraversalDomSerializer
- STRONG - Static variable in class org.htmlcleaner.DefaultTagProvider
- STRONG - Static variable in class org.htmlcleaner.Html4TagProvider
- STRONG - Static variable in class org.htmlcleaner.Html5TagProvider
- styleElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
T
- tableElements(TagInfo) - Method in class org.htmlcleaner.Html4TagProvider
- tableElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
- tableTags(TagInfo) - Method in class org.htmlcleaner.Html5TagProvider
-
The HTML5 table tags (12 total)
- TagAllCondition - Class in org.htmlcleaner.conditional
-
All nodes.
- TagAllCondition() - Constructor for class org.htmlcleaner.conditional.TagAllCondition
- tagAttributes() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses list tag attributes from the current position.
- tagEncountered(String) - Method in class org.htmlcleaner.OpenTags
- tagEnd() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses end of the tag.
- tagExists(String, CleanTimeValues) - Method in class org.htmlcleaner.OpenTags
- tagInfo - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- TagInfo - Class in org.htmlcleaner
-
Class contains information about single HTML tag.
It also contains rules for tag balancing. - TagInfo(String, ContentType, BelongsTo, boolean, boolean, boolean, CloseTag, Display) - Constructor for class org.htmlcleaner.TagInfo
- taginfofile - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- tagInfoMap - Variable in class org.htmlcleaner.ConfigFileTagProvider.ConfigParser
- tagInfoMap - Variable in class org.htmlcleaner.DefaultTagProvider
- tagInfoMap - Variable in class org.htmlcleaner.Html4TagProvider
- tagInfoMap - Variable in class org.htmlcleaner.Html5TagProvider
- tagInfoProvider - Variable in class org.htmlcleaner.CleanerProperties
- tagInfoProvider - Variable in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
- TagNode - Class in org.htmlcleaner
-
XML node tag - basic node of the cleaned HTML tree.
- TagNode(String) - Constructor for class org.htmlcleaner.TagNode
- TagNode(String, boolean) - Constructor for class org.htmlcleaner.TagNode
- TagNodeAttExistsCondition - Class in org.htmlcleaner.conditional
-
Checks if node contains specified attribute.
- TagNodeAttExistsCondition(String) - Constructor for class org.htmlcleaner.conditional.TagNodeAttExistsCondition
- TagNodeAttNameValueRegexCondition - Class in org.htmlcleaner.conditional
-
Checks if node has specified attribute with specified value.
- TagNodeAttNameValueRegexCondition(Pattern, Pattern) - Constructor for class org.htmlcleaner.conditional.TagNodeAttNameValueRegexCondition
- TagNodeAttValueCondition - Class in org.htmlcleaner.conditional
-
Checks if node has specified attribute with specified value.
- TagNodeAttValueCondition(String, String, boolean) - Constructor for class org.htmlcleaner.conditional.TagNodeAttValueCondition
- TagNodeAutoGeneratedCondition - Class in org.htmlcleaner.conditional
-
Remove empty autogenerated nodes.
- TagNodeAutoGeneratedCondition() - Constructor for class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
- TagNodeEmptyContentCondition - Class in org.htmlcleaner.conditional
-
Checks if node is an inline 0r block element and has empty contents or white/non-breakable spaces only.
- TagNodeEmptyContentCondition(ITagInfoProvider) - Constructor for class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
- TagNodeInsignificantBrCondition - Class in org.htmlcleaner.conditional
-
Checks if node is an insignificant br tag -- is placed at the end or at the start of a block.
- TagNodeInsignificantBrCondition() - Constructor for class org.htmlcleaner.conditional.TagNodeInsignificantBrCondition
- TagNodeNameCondition - Class in org.htmlcleaner.conditional
-
Checks if node has specified name.
- TagNodeNameCondition(String) - Constructor for class org.htmlcleaner.conditional.TagNodeNameCondition
- TagNodeVisitor - Interface in org.htmlcleaner
-
Defines action to be performed on TagNodes
- TagPos - Class in org.htmlcleaner
-
Contains information about a single open tag
- TagPos(int, String, TagInfo, CleanTimeValues) - Constructor for class org.htmlcleaner.TagPos
- tagStart() - Method in class org.htmlcleaner.HtmlTokenizer
-
Parses start of the tag.
- TagToken - Class in org.htmlcleaner
-
HTML tag token - descendants are start (TagNode) and end token (EndTagToken).
- TagToken() - Constructor for class org.htmlcleaner.TagToken
- TagToken(String) - Constructor for class org.htmlcleaner.TagToken
- TagTransformation - Class in org.htmlcleaner
-
Describes how specified tag is transformed to another one, or is ignored during parsing
- TagTransformation() - Constructor for class org.htmlcleaner.TagTransformation
- TagTransformation(String) - Constructor for class org.htmlcleaner.TagTransformation
-
Creates new tag transformation in which specified tag will be skipped (ignored) during parsing process.
- TagTransformation(String, String) - Constructor for class org.htmlcleaner.TagTransformation
-
Creates new tag transformation from source tag to target tag preserving all source tag attributes.
- TagTransformation(String, String, boolean) - Constructor for class org.htmlcleaner.TagTransformation
-
Creates new tag transformation from source tag to target tag specifying whether source tag attributes are preserved.
- tail(HtmlNode, int) - Method in class org.htmlcleaner.DomBuilder
- tail(HtmlNode, int) - Method in interface org.htmlcleaner.XmlVisitor
-
Callback for when a node is last visited, after all of its descendants have been visited.
- template - Variable in class org.htmlcleaner.AttributeTransformationPatternImpl
- text - org.htmlcleaner.ContentType
- text - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- throwStandardException() - Method in class org.htmlcleaner.XPather
- toBoolean(String) - Static method in class org.htmlcleaner.CommandLine
- token - Variable in class org.htmlcleaner.ProxyTagNode
- tokenArray - Variable in class org.htmlcleaner.XPather
- tokenElements(TagInfo, ConcurrentMap<String, TagInfo>) - Method in class org.htmlcleaner.MathMLTagProvider
- tokenize(String, String) - Static method in class org.htmlcleaner.Utils
- toString() - Method in class org.htmlcleaner.BaseTokenImpl
- toString() - Method in class org.htmlcleaner.CData
- toString() - Method in class org.htmlcleaner.CommentNode
- toString() - Method in class org.htmlcleaner.conditional.TagNodeAutoGeneratedCondition
- toString() - Method in class org.htmlcleaner.ContentNode
- toString() - Method in class org.htmlcleaner.DoctypeToken
- toString() - Method in class org.htmlcleaner.EndTagToken
- toString() - Method in class org.htmlcleaner.TagToken
- toString(Document) - Static method in class org.htmlcleaner.TraversalDomSerializer
- toText(Object) - Method in class org.htmlcleaner.XPather
- toValue(Object) - Static method in enum org.htmlcleaner.BelongsTo
- toValue(Object) - Static method in enum org.htmlcleaner.ContentType
- transform - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- transformations - Variable in class org.htmlcleaner.HtmlCleaner
- transformations - Variable in class org.htmlcleaner.HtmlTokenizer
- transformAttributes(String, Map<String, String>) - Method in class org.htmlcleaner.CleanerTransformations
- translateSpecialEntities - Variable in class org.htmlcleaner.CleanerProperties
- transResCharsToNCR - Variable in class org.htmlcleaner.CleanerProperties
- transSpecialEntitiesToNCR - Variable in class org.htmlcleaner.CleanerProperties
- TraversalDomSerializer - Class in org.htmlcleaner
-
A traversal-based serializer for DOM; used to avoid recursion and stack overflow for large HTML documents.
- TraversalDomSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.TraversalDomSerializer
- TraversalDomSerializer(CleanerProperties, boolean) - Constructor for class org.htmlcleaner.TraversalDomSerializer
- TraversalDomSerializer(CleanerProperties, boolean, boolean) - Constructor for class org.htmlcleaner.TraversalDomSerializer
- TraversalDomSerializer(CleanerProperties, boolean, boolean, boolean) - Constructor for class org.htmlcleaner.TraversalDomSerializer
- traverse(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
-
Traverses the tree and performs visitor's action on each node.
- traverse(XmlVisitor, HtmlNode) - Static method in class org.htmlcleaner.XmlTraversor
-
Start a depth-first traverse of the root and all of its descendants.
- traverseInternally(TagNodeVisitor) - Method in class org.htmlcleaner.TagNode
- treatDeprecatedTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
- treatdeprtagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- treatunknowntagsascontent - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- treatUnknownTagsAsContent - Variable in class org.htmlcleaner.CleanerProperties
- trimAttributeValues - Variable in class org.htmlcleaner.CleanerProperties
- type - Variable in class org.htmlcleaner.DoctypeToken
-
The identified DocType, if any
U
- UnclosedTag - org.htmlcleaner.audit.ErrorType
-
No matching close token was found for the open tag.
- unicodechars - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- unique - Variable in class org.htmlcleaner.TagInfo
- UniqueTagDuplicated - org.htmlcleaner.audit.ErrorType
-
Second instance of an unique tag was found, most likely it was removed.
- Unknown - org.htmlcleaner.audit.ErrorType
-
The tag is unknown and current cleaner mode doesn't allows this.
- UNKNOWN - Static variable in class org.htmlcleaner.DoctypeToken
- UnpermittedChild - org.htmlcleaner.audit.ErrorType
-
This tag have bad child that shouldn't be here.
- unsafeBlockElements - Static variable in class org.htmlcleaner.conditional.TagNodeEmptyContentCondition
-
Removal of element from this set can affect layout too hard.
- updateCoordinates(char) - Method in class org.htmlcleaner.HtmlTokenizer
-
Looks onto the char passed and updates current position coordinates.
- updateTagTransformations(String, String) - Method in class org.htmlcleaner.CleanerTransformations
- updateTagTransformations(Map) - Method in class org.htmlcleaner.CleanerTransformations
- usecdata - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- usecdatafor - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- useCdataFor - Variable in class org.htmlcleaner.CleanerProperties
- useCdataForList - Variable in class org.htmlcleaner.CleanerProperties
- useemptyelementtags - Variable in class org.htmlcleaner.HtmlCleanerForAnt
- useEmptyElementTags - Variable in class org.htmlcleaner.CleanerProperties
- Utils - Class in org.htmlcleaner
-
Common utilities.
- Utils() - Constructor for class org.htmlcleaner.Utils
V
- valid - Variable in class org.htmlcleaner.DoctypeToken
- VALID_XML_IDENTIFIER_CHAR_PATTERN - Static variable in class org.htmlcleaner.Utils
- VALID_XML_IDENTIFIER_CHAR_REGEX - Static variable in class org.htmlcleaner.Utils
- VALID_XML_IDENTIFIER_START_CHAR_PATTERN - Static variable in class org.htmlcleaner.Utils
- VALID_XML_IDENTIFIER_START_CHAR_REGEX - Static variable in class org.htmlcleaner.Utils
- validate() - Method in class org.htmlcleaner.DoctypeToken
-
Checks the doctype according to W3C parsing rules and tries to identify the type and validity See: http://www.w3.org/TR/html-markup/syntax.html#doctype-syntax http://dev.w3.org/html5/html-author/#doctype-declaration
- valueOf(String) - Static method in enum org.htmlcleaner.audit.ErrorType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.htmlcleaner.BelongsTo
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.htmlcleaner.CloseTag
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.htmlcleaner.ContentType
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.htmlcleaner.Display
-
Returns the enum constant of this type with the specified name.
- valueOf(String) - Static method in enum org.htmlcleaner.OptionalOutput
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum org.htmlcleaner.audit.ErrorType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.htmlcleaner.BelongsTo
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.htmlcleaner.CloseTag
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.htmlcleaner.ContentType
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.htmlcleaner.Display
-
Returns an array containing the constants of this enum type, in the order they are declared.
- values() - Static method in enum org.htmlcleaner.OptionalOutput
-
Returns an array containing the constants of this enum type, in the order they are declared.
- VAR_END - Static variable in class org.htmlcleaner.TagTransformation
- VAR_START - Static variable in class org.htmlcleaner.TagTransformation
- visit(TagNode, HtmlNode) - Method in interface org.htmlcleaner.TagNodeVisitor
-
Action to be performed on single node in the tree
- visitor - Variable in class org.htmlcleaner.XmlTraversor
W
- WORKING_BUFFER_SIZE - Static variable in class org.htmlcleaner.HtmlTokenizer
- write(TagNode, Writer, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified node using specified writer.
- write(TagNode, Writer, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified node using specified writer.
- writeToFile(TagNode, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using system default charset.
- writeToFile(TagNode, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
- writeToFile(TagNode, String, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using specified charset.
- writeToFile(TagNode, String, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the file, using specified charset and optionally omits node envelope (skips open and close tags of the node).
- writeToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using system default charset.
- writeToStream(TagNode, OutputStream, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using system default charset and optionally omits node envelope (skips open and close tags of the node).
- writeToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using specified charset.
- writeToStream(TagNode, OutputStream, String, boolean) - Method in class org.htmlcleaner.Serializer
-
Writes specified TagNode to the output stream, using specified charset and optionally omits node envelope (skips open and close tags of the node).
- writeXml(TagNode, Writer, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use write() instead.
- writeXmlToFile(TagNode, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToFile() instead.
- writeXmlToFile(TagNode, String, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToFile() instead.
- writeXmlToStream(TagNode, OutputStream) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToStream() instead.
- writeXmlToStream(TagNode, OutputStream, String) - Method in class org.htmlcleaner.XmlSerializer
-
Deprecated.Use writeToStream() instead.
X
- XHTML1_0_FRAMESET - Static variable in class org.htmlcleaner.DoctypeToken
- XHTML1_0_STRICT - Static variable in class org.htmlcleaner.DoctypeToken
- XHTML1_0_TRANSITIONAL - Static variable in class org.htmlcleaner.DoctypeToken
- XHTML1_1 - Static variable in class org.htmlcleaner.DoctypeToken
- XHTML1_1_BASIC - Static variable in class org.htmlcleaner.DoctypeToken
- XML_10 - Static variable in class org.htmlcleaner.DomSerializer
- XML_11 - Static variable in class org.htmlcleaner.DomSerializer
- XMLNS_NAMESPACE - Static variable in class org.htmlcleaner.XmlSerializer
- XmlSerializer - Class in org.htmlcleaner
-
Abstract XML serializer - contains common logic for descendants.
- XmlSerializer(CleanerProperties) - Constructor for class org.htmlcleaner.XmlSerializer
- XmlTraversor - Class in org.htmlcleaner
-
Depth-first node traversor.
- XmlTraversor() - Constructor for class org.htmlcleaner.XmlTraversor
- xmlVersion - Variable in class org.htmlcleaner.DomSerializer
- XmlVisitor - Interface in org.htmlcleaner
-
Node visitor interface.
- XPather - Class in org.htmlcleaner
-
Utility for searching cleaned document tree with XPath expressions.
- XPather(String) - Constructor for class org.htmlcleaner.XPather
-
Constructor - creates XPather instance with specified XPath expression.
- XPatherException - Exception in org.htmlcleaner
-
Exception that could occure during XPather evaluation.
- XPatherException() - Constructor for exception org.htmlcleaner.XPatherException
- XPatherException(String) - Constructor for exception org.htmlcleaner.XPatherException
- XPatherException(String, Throwable) - Constructor for exception org.htmlcleaner.XPatherException
- XPatherException(Throwable) - Constructor for exception org.htmlcleaner.XPatherException
_
- _asExpected - Variable in class org.htmlcleaner.HtmlTokenizer
- _bodyOpened - Variable in class org.htmlcleaner.CleanTimeValues
- _col - Variable in class org.htmlcleaner.HtmlTokenizer
- _currentTagToken - Variable in class org.htmlcleaner.HtmlTokenizer
- _docType - Variable in class org.htmlcleaner.HtmlTokenizer
- _headOpened - Variable in class org.htmlcleaner.CleanTimeValues
- _headTags - Variable in class org.htmlcleaner.CleanTimeValues
- _isLateForDoctype - Variable in class org.htmlcleaner.HtmlTokenizer
- _isSpecialContext - Variable in class org.htmlcleaner.HtmlTokenizer
- _isSpecialContextName - Variable in class org.htmlcleaner.HtmlTokenizer
- _len - Variable in class org.htmlcleaner.HtmlTokenizer
- _namespacePrefixes - Variable in class org.htmlcleaner.HtmlTokenizer
- _pos - Variable in class org.htmlcleaner.HtmlTokenizer
- _put(SpecialEntity) - Method in class org.htmlcleaner.SpecialEntities
- _reader - Variable in class org.htmlcleaner.HtmlTokenizer
- _row - Variable in class org.htmlcleaner.HtmlTokenizer
- _saved - Variable in class org.htmlcleaner.HtmlTokenizer
- _tokenList - Variable in class org.htmlcleaner.HtmlTokenizer
- _working - Variable in class org.htmlcleaner.HtmlTokenizer
All Classes All Packages