Class Normalizer

java.lang.Object
net.sf.saxon.serialize.codenorm.Normalizer

public class Normalizer extends Object
Implements Unicode Normalization Forms C, D, KC, KD. Copyright (c) 1991-2005 Unicode, Inc. For terms of use, see http://www.unicode.org/terms_of_use.html For documentation, see UAX#15.
The Unicode Consortium makes no expressed or implied warranty of any kind, and assumes no liability for errors or omissions. No liability is assumed for incidental and consequential damages in connection with or arising out of the use of the information here.
Author:
Mark Davis Updates for supplementary code points: Vladimir Weinstein & Markus Scherer Modified to remove dependency on ICU code: Michael Kay
  • Field Summary

    Fields
    Modifier and Type
    Field
    Description
    static final int
    Normalization Form Selector
    (package private) static final int
    Masks for the form selector
    (package private) static final int
    Masks for the form selector
    static final int
    Normalization Form Selector
    static final int
    Normalization Form Selector
    static final int
    Normalization Form Selector
    static final int
    Normalization Form Selector
  • Method Summary

    Modifier and Type
    Method
    Description
    static Normalizer
    make(int form, Configuration config)
    Create a normalizer for a given form: static synchronized factory method
    Normalizes text according to the chosen form

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Field Details

  • Method Details

    • make

      public static Normalizer make(int form, Configuration config) throws XPathException
      Create a normalizer for a given form: static synchronized factory method
      Parameters:
      form - the normalization form required: for example C, D
      config - the Saxon configuration
      Throws:
      XPathException - if normalization fails
    • normalize

      public CharSequence normalize(CharSequence source)
      Normalizes text according to the chosen form
      Parameters:
      source - the original text, unnormalized
      Returns:
      target the resulting normalized text