Class StandardEncodingTranslator

    • Field Detail

      • REPLACEMENT

        public static final java.lang.String REPLACEMENT
        https://encoding.spec.whatwg.org/#replacement For the moment we are using this constant in the HTMLScanner to support this encoding. Another option might be to install a proper charset impl in the jvm (like https://github.com/lovasoa/replacement-charset-java) but this might introduce some behavior changes for the whole vm and therefore we handle it here.
        See Also:
        Constant Field Values
      • ENCODING_FROM_LABEL

        public static final java.util.Map<java.lang.String,​java.lang.String> ENCODING_FROM_LABEL
        Encoding names and labels Made public to let other also use this (like HtmlUnit TextDecoder).
      • ENCODING_TO_IANA_ENCODING

        public static final java.util.Map<java.lang.String,​java.lang.String> ENCODING_TO_IANA_ENCODING
        Differences from iana naming Made public to let other also use this (like HtmlUnit TextDecoder).
      • IANA_TO_JAVA_ENCODINGS

        public static final java.util.Map<java.lang.String,​java.lang.String> IANA_TO_JAVA_ENCODINGS
    • Constructor Detail

      • StandardEncodingTranslator

        private StandardEncodingTranslator()
    • Method Detail

      • encodingNameFromLabel

        public java.lang.String encodingNameFromLabel​(java.lang.String charsetLabel)
        Specified by:
        encodingNameFromLabel in interface EncodingTranslator
        Returns:
        the Java encoding name for the specified HTTP encoding name, or null if the name is unknown. The returned encoding name may or may not be supported by the running JVM. Use Charset.isSupported(String) to check availability. Moreover, either of the special definitions REPLACEMENT or #USER_DEFINED may be returned depending on input.