Class ArabicLigaturizer

  • All Implemented Interfaces:
    LanguageProcessor

    public class ArabicLigaturizer
    extends Object
    implements LanguageProcessor
    Shape arabic characters. This code was inspired by an LGPL'ed C library: Pango ( see http://www.pango.com/ ). Note that the code of this class is the original work of Paulo Soares.
    Author:
    Paulo Soares
    • Field Detail

      • DIGITS_EN2AN

        public static final int DIGITS_EN2AN
        Digit shaping option: Replace European digits (U+0030...U+0039) by Arabic-Indic digits.
        See Also:
        Constant Field Values
      • DIGITS_AN2EN

        public static final int DIGITS_AN2EN
        Digit shaping option: Replace Arabic-Indic digits by European digits (U+0030...U+0039).
        See Also:
        Constant Field Values
      • DIGITS_EN2AN_INIT_LR

        public static final int DIGITS_EN2AN_INIT_LR
        Digit shaping option: Replace European digits (U+0030...U+0039) by Arabic-Indic digits if the most recent strongly directional character is an Arabic letter (its Bidi direction value is RIGHT_TO_LEFT_ARABIC). The initial state at the start of the text is assumed to be not an Arabic, letter, so European digits at the start of the text will not change. Compare to DIGITS_ALEN2AN_INIT_AL.
        See Also:
        Constant Field Values
      • DIGITS_EN2AN_INIT_AL

        public static final int DIGITS_EN2AN_INIT_AL
        Digit shaping option: Replace European digits (U+0030...U+0039) by Arabic-Indic digits if the most recent strongly directional character is an Arabic letter (its Bidi direction value is RIGHT_TO_LEFT_ARABIC). The initial state at the start of the text is assumed to be an Arabic, letter, so European digits at the start of the text will change. Compare to DIGITS_ALEN2AN_INT_LR.
        See Also:
        Constant Field Values
      • DIGITS_MASK

        public static final int DIGITS_MASK
        Bit mask for digit shaping options.
        See Also:
        Constant Field Values
      • DIGIT_TYPE_AN

        public static final int DIGIT_TYPE_AN
        Digit type option: Use Arabic-Indic digits (U+0660...U+0669).
        See Also:
        Constant Field Values
      • DIGIT_TYPE_AN_EXTENDED

        public static final int DIGIT_TYPE_AN_EXTENDED
        Digit type option: Use Eastern (Extended) Arabic-Indic digits (U+06f0...U+06f9).
        See Also:
        Constant Field Values
      • DIGIT_TYPE_MASK

        public static final int DIGIT_TYPE_MASK
        Bit mask for digit type options.
        See Also:
        Constant Field Values
      • options

        protected int options
      • runDirection

        protected int runDirection
    • Constructor Detail

      • ArabicLigaturizer

        public ArabicLigaturizer()
      • ArabicLigaturizer

        public ArabicLigaturizer​(int runDirection,
                                 int options)
    • Method Detail

      • arabic_shape

        public static int arabic_shape​(char[] src,
                                       int srcoffset,
                                       int srclength,
                                       char[] dest,
                                       int destoffset,
                                       int destlength,
                                       int level)
      • processNumbers

        public static void processNumbers​(char[] text,
                                          int offset,
                                          int length,
                                          int options)
      • getReverseMapping

        public static Character getReverseMapping​(char c)