Class CharNormalizer


  • public class CharNormalizer
    extends java.lang.Object
    Some character normalization (and exclusion) functionality. This functionality was previously embedded in the NGram class.
    • Field Summary

      Fields 
      Modifier and Type Field Description
      private static java.lang.String[] CJK_CLASS
      CJK Kanji Normalization Mapping
      private static java.util.Map<java.lang.Character,​java.lang.Character> cjk_map  
      private static java.lang.String LATIN1_EXCLUDED  
    • Constructor Summary

      Constructors 
      Constructor Description
      CharNormalizer()  
    • Method Summary

      All Methods Static Methods Concrete Methods 
      Modifier and Type Method Description
      static char normalize​(char ch)
      Character Normalization (and exclusion).
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Field Detail

      • LATIN1_EXCLUDED

        private static final java.lang.String LATIN1_EXCLUDED
      • cjk_map

        private static final java.util.Map<java.lang.Character,​java.lang.Character> cjk_map
      • CJK_CLASS

        private static final java.lang.String[] CJK_CLASS
        CJK Kanji Normalization Mapping
    • Constructor Detail

      • CharNormalizer

        public CharNormalizer()
    • Method Detail

      • normalize

        public static char normalize​(char ch)
        Character Normalization (and exclusion).
        Returns:
        Normalized character, the space to exclude the character.