Class StandardEncodingTranslator

java.lang.Object
org.htmlunit.cyberneko.xerces.util.StandardEncodingTranslator
All Implemented Interfaces:
EncodingTranslator

public final class StandardEncodingTranslator extends Object implements EncodingTranslator
  • Field Details

    • INSTANCE

      public static final StandardEncodingTranslator INSTANCE
      Singleton.
    • REPLACEMENT

      public static final String REPLACEMENT
      https://encoding.spec.whatwg.org/#replacement For the moment we are using this constant in the HTMLScanner to support this encoding. Another option might be to install a proper charset impl in the jvm (like https://github.com/lovasoa/replacement-charset-java) but this might introduce some behavior changes for the whole vm and therefore we handle it here.
      See Also:
    • ENCODING_FROM_LABEL

      public static final Map<String,String> ENCODING_FROM_LABEL
      Encoding names and labels Made public to let other also use this (like HtmlUnit TextDecoder).
    • ENCODING_TO_IANA_ENCODING

      public static final Map<String,String> ENCODING_TO_IANA_ENCODING
      Differences from iana naming Made public to let other also use this (like HtmlUnit TextDecoder).
    • IANA_TO_JAVA_ENCODINGS

      public static final Map<String,String> IANA_TO_JAVA_ENCODINGS
  • Constructor Details

    • StandardEncodingTranslator

      private StandardEncodingTranslator()
  • Method Details

    • encodingNameFromLabel

      public String encodingNameFromLabel(String charsetLabel)
      Specified by:
      encodingNameFromLabel in interface EncodingTranslator
      Returns:
      the Java encoding name for the specified HTTP encoding name, or null if the name is unknown. The returned encoding name may or may not be supported by the running JVM. Use Charset.isSupported(String) to check availability. Moreover, either of the special definitions REPLACEMENT or
      invalid reference
      #USER_DEFINED
      may be returned depending on input.