Package net.sf.saxon.serialize.codenorm


package net.sf.saxon.serialize.codenorm

This package contains the code to implement Unicode normalization.

Much of the code is derived directly from the reference implementation published by the Unicode Consortium. However, the code for constructing the data tables used as input to the algorithm has been redesigned and rewritten for efficiency.

  • Classes
    Class
    Description
    Implements Unicode Normalization Forms C, D, KC, KD.
    Accesses the Normalization Data used for Forms C and D.
    This class reads the Unicode character database, extracts information needed to perform unicode normalization, and writes this information out in the form of the Java "source" module UnicodeData.java.
    This class reads the data compiled into class UnicodeData, and builds hash tables that can be used by the Unicode normalization routines.