Interface UProperty


public interface UProperty

Selection constants for Unicode properties.

These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.

The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).

For details about the properties see UAX #44: Unicode Character Database.

Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.

Author:
Syn Wee Quek
See Also:
  • Nested Class Summary

    Nested Classes
    Modifier and Type
    Interface
    Description
    static interface 
    Selector constants for UCharacter.getPropertyName() and UCharacter.getPropertyValueName().
  • Field Summary

    Fields
    Modifier and Type
    Field
    Description
    static final int
    String property Age.
    static final int
    Binary property Alphabetic.
    static final int
    Binary property ASCII_Hex_Digit (0-9 A-F a-f).
    static final int
    Binary property of strings Basic_Emoji.
    static final int
    Enumerated property Bidi_Class.
    static final int
    Binary property Bidi_Control.
    static final int
    Binary property Bidi_Mirrored.
    static final int
    String property Bidi_Mirroring_Glyph.
    static final int
    String property Bidi_Paired_Bracket (new in Unicode 6.3).
    static final int
    Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3).
    static final int
    Deprecated.
    ICU 58 The numeric value may change over time, see ICU ticket #12420.
    static final int
    First constant for binary Unicode properties.
    static final int
    Enumerated property Block.
    static final int
    Enumerated property Canonical_Combining_Class.
    static final int
    String property Case_Folding.
    static final int
    Binary property Case_Ignorable.
    static final int
    Binary property Case_Sensitive.
    static final int
    Binary property Cased.
    static final int
    Binary property Changes_When_Casefolded.
    static final int
    Binary property Changes_When_Casemapped.
    static final int
    Binary property Changes_When_Lowercased.
    static final int
    Binary property Changes_When_NFKC_Casefolded.
    static final int
    Binary property Changes_When_Titlecased.
    static final int
    Binary property Changes_When_Uppercased.
    static final int
    Binary property Dash.
    static final int
    Enumerated property Decomposition_Type.
    static final int
    Binary property Default_Ignorable_Code_Point (new).
    static final int
    Binary property Deprecated (new).
    static final int
    Binary property Diacritic.
    static final int
    Deprecated.
    ICU 58 The numeric value may change over time, see ICU ticket #12420.
    static final int
    First constant for double Unicode properties.
    static final int
    Enumerated property East_Asian_Width.
    static final int
    Binary property Emoji.
    static final int
    Binary property Emoji_Component.
    static final int
    Binary property of strings Emoji_Keycap_Sequence.
    static final int
    Binary property Emoji_Modifier.
    static final int
    Binary property Emoji_Modifier_Base.
    static final int
    Binary property Emoji_Presentation.
    static final int
    Binary property Extended_Pictographic.
    static final int
    Binary property Extender.
    static final int
    Binary property Full_Composition_Exclusion.
    static final int
    Enumerated property General_Category.
    static final int
    Bitmask property General_Category_Mask.
    static final int
    Binary property Grapheme_Base (new).
    static final int
    Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1).
    static final int
    Binary property Grapheme_Extend (new).
    static final int
    Binary property Grapheme_Link (new).
    static final int
    Enumerated property Hangul_Syllable_Type, new in Unicode 4.
    static final int
    Binary property Hex_Digit.
    static final int
    Binary property Hyphen.
    static final int
    Binary property ID_Continue.
    static final int
    Binary property ID_Start.
    static final int
    Binary property Ideographic.
    static final int
    Binary property IDS_Binary_Operator (new).
    static final int
    Binary property IDS_Trinary_Operator (new).
    static final int
    Enumerated property Indic_Positional_Category.
    static final int
    Enumerated property Indic_Syllabic_Category.
    static final int
    Deprecated.
    ICU 58 The numeric value may change over time, see ICU ticket #12420.
    static final int
    First constant for enumerated/integer Unicode properties.
    static final int
    Deprecated.
    ICU 49
    static final int
    Binary property Join_Control.
    static final int
    Enumerated property Joining_Group.
    static final int
    Enumerated property Joining_Type.
    static final int
    Enumerated property Lead_Canonical_Combining_Class.
    static final int
    Enumerated property Line_Break.
    static final int
    Binary property Logical_Order_Exception (new).
    static final int
    Binary property Lowercase.
    static final int
    String property Lowercase_Mapping.
    static final int
    Deprecated.
    ICU 58 The numeric value may change over time, see ICU ticket #12420.
    static final int
    First constant for bit-mask Unicode properties.
    static final int
    Binary property Math.
    static final int
    String property Name.
    static final int
    Binary property NFC_Inert.
    static final int
    Enumerated property NFC_Quick_Check.
    static final int
    Binary property NFD_Inert.
    static final int
    Enumerated property NFD_Quick_Check.
    static final int
    Binary property NFKC_Inert.
    static final int
    Enumerated property NFKC_Quick_Check.
    static final int
    Binary property NFKD_Inert.
    static final int
    Enumerated property NFKD_Quick_Check.
    static final int
    Binary property Noncharacter_Code_Point.
    static final int
    Enumerated property Numeric_Type.
    static final int
    Double property Numeric_Value.
    static final int
    Deprecated.
    ICU 58 The numeric value may change over time, see ICU ticket #12420.
    static final int
    First constant for Unicode properties with unusual value types.
    static final int
    Binary property Pattern_Syntax (new in Unicode 4.1).
    static final int
    Binary property Pattern_White_Space (new in Unicode 4.1).
    static final int
    Binary property alnum (a C/POSIX character class).
    static final int
    Binary property blank (a C/POSIX character class).
    static final int
    Binary property graph (a C/POSIX character class).
    static final int
    Binary property print (a C/POSIX character class).
    static final int
    Binary property xdigit (a C/POSIX character class).
    static final int
    Binary property Prepended_Concatenation_Mark.
    static final int
    Binary property Quotation_Mark.
    static final int
    Binary property Radical (new).
    static final int
    Binary property Regional_Indicator.
    static final int
    Binary property of strings RGI_Emoji.
    static final int
    Binary property of strings RGI_Emoji_Flag_Sequence.
    static final int
    Binary property of strings RGI_Emoji_Modifier_Sequence.
    static final int
    Binary property of strings RGI_Emoji_Tag_Sequence.
    static final int
    Binary property of strings RGI_Emoji_ZWJ_Sequence.
    static final int
    Binary property STerm (new in Unicode 4.0.1).
    static final int
    Enumerated property Script.
    static final int
    Miscellaneous property Script_Extensions (new in Unicode 6.0).
    static final int
    Binary Property Segment_Starter.
    static final int
    Enumerated property Sentence_Break (new in Unicode 4.1).
    static final int
    String property Simple_Case_Folding.
    static final int
    String property Simple_Lowercase_Mapping.
    static final int
    String property Simple_Titlecase_Mapping.
    static final int
    String property Simple_Uppercase_Mapping.
    static final int
    Binary property Soft_Dotted (new).
    static final int
    Deprecated.
    ICU 58 The numeric value may change over time, see ICU ticket #12420.
    static final int
    First constant for string Unicode properties.
    static final int
    Binary property Terminal_Punctuation.
    static final int
    String property Titlecase_Mapping.
    static final int
    Enumerated property Trail_Canonical_Combining_Class.
    static final int
    Deprecated.
    This API is ICU internal only.
    static final int
    Deprecated.
    ICU 49
    static final int
    Binary property Unified_Ideograph (new).
    static final int
    Binary property Uppercase.
    static final int
    String property Uppercase_Mapping.
    static final int
    Binary property Variation_Selector (new in Unicode 4.0.1).
    static final int
    Enumerated property Vertical_Orientation.
    static final int
    Binary property White_Space.
    static final int
    Enumerated property Word_Break (new in Unicode 4.1).
    static final int
    Binary property XID_Continue.
    static final int
    Binary property XID_Start.
  • Field Details

    • UNDEFINED

      @Deprecated static final int UNDEFINED
      Deprecated.
      This API is ICU internal only.
      Special value indicating undefined property.
      See Also:
    • ALPHABETIC

      static final int ALPHABETIC

      Binary property Alphabetic.

      Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().

      Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.

      See Also:
    • BINARY_START

      static final int BINARY_START
      First constant for binary Unicode properties.
      See Also:
    • ASCII_HEX_DIGIT

      static final int ASCII_HEX_DIGIT
      Binary property ASCII_Hex_Digit (0-9 A-F a-f).
      See Also:
    • BIDI_CONTROL

      static final int BIDI_CONTROL

      Binary property Bidi_Control.

      Format controls which have specific functions in the Bidi Algorithm.

      See Also:
    • BIDI_MIRRORED

      static final int BIDI_MIRRORED

      Binary property Bidi_Mirrored.

      Characters that may change display in RTL text.

      Property for UCharacter.isMirrored().

      See Bidi Algorithm; UTR 9.

      See Also:
    • DASH

      static final int DASH

      Binary property Dash.

      Variations of dashes.

      See Also:
    • DEFAULT_IGNORABLE_CODE_POINT

      static final int DEFAULT_IGNORABLE_CODE_POINT

      Binary property Default_Ignorable_Code_Point (new).

      Property that indicates codepoint is ignorable in most processing.

      Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)

      See Also:
    • DEPRECATED

      static final int DEPRECATED

      Binary property Deprecated (new).

      The usage of deprecated characters is strongly discouraged.

      See Also:
    • DIACRITIC

      static final int DIACRITIC

      Binary property Diacritic.

      Characters that linguistically modify the meaning of another character to which they apply.

      See Also:
    • EXTENDER

      static final int EXTENDER

      Binary property Extender.

      Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.

      See Also:
    • FULL_COMPOSITION_EXCLUSION

      static final int FULL_COMPOSITION_EXCLUSION

      Binary property Full_Composition_Exclusion.

      CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.

      See Also:
    • GRAPHEME_BASE

      static final int GRAPHEME_BASE

      Binary property Grapheme_Base (new).

      For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ

      See Also:
    • GRAPHEME_EXTEND

      static final int GRAPHEME_EXTEND

      Binary property Grapheme_Extend (new).

      For programmatic determination of grapheme cluster boundaries.

      Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ

      See Also:
    • HEX_DIGIT

      static final int HEX_DIGIT

      Binary property Hex_Digit.

      Characters commonly used for hexadecimal numbers.

      See Also:
    • HYPHEN

      static final int HYPHEN

      Binary property Hyphen.

      Dashes used to mark connections between pieces of words, plus the Katakana middle dot.

      See Also:
    • ID_CONTINUE

      static final int ID_CONTINUE

      Binary property ID_Continue.

      Characters that can continue an identifier.

      ID_Start+Mn+Mc+Nd+Pc

      See Also:
    • ID_START

      static final int ID_START

      Binary property ID_Start.

      Characters that can start an identifier.

      Lu+Ll+Lt+Lm+Lo+Nl

      See Also:
    • IDEOGRAPHIC

      static final int IDEOGRAPHIC

      Binary property Ideographic.

      CJKV ideographs.

      See Also:
    • IDS_BINARY_OPERATOR

      static final int IDS_BINARY_OPERATOR

      Binary property IDS_Binary_Operator (new).

      For programmatic determination of Ideographic Description Sequences.

      See Also:
    • IDS_TRINARY_OPERATOR

      static final int IDS_TRINARY_OPERATOR

      Binary property IDS_Trinary_Operator (new).

      For programmatic determination of Ideographic Description Sequences.

      See Also:
    • JOIN_CONTROL

      static final int JOIN_CONTROL

      Binary property Join_Control.

      Format controls for cursive joining and ligation.

      See Also:
    • LOGICAL_ORDER_EXCEPTION

      static final int LOGICAL_ORDER_EXCEPTION

      Binary property Logical_Order_Exception (new).

      Characters that do not use logical order and require special handling in most processing.

      See Also:
    • LOWERCASE

      static final int LOWERCASE

      Binary property Lowercase.

      Same as UCharacter.isULowercase(), different from UCharacter.islower().

      Ll+Other_Lowercase

      See Also:
    • MATH

      static final int MATH

      Binary property Math.

      Sm+Other_Math

      See Also:
    • NONCHARACTER_CODE_POINT

      static final int NONCHARACTER_CODE_POINT

      Binary property Noncharacter_Code_Point.

      Code points that are explicitly defined as illegal for the encoding of characters.

      See Also:
    • QUOTATION_MARK

      static final int QUOTATION_MARK

      Binary property Quotation_Mark.

      See Also:
    • RADICAL

      static final int RADICAL

      Binary property Radical (new).

      For programmatic determination of Ideographic Description Sequences.

      See Also:
    • SOFT_DOTTED

      static final int SOFT_DOTTED

      Binary property Soft_Dotted (new).

      Characters with a "soft dot", like i or j.

      An accent placed on these characters causes the dot to disappear.

      See Also:
    • TERMINAL_PUNCTUATION

      static final int TERMINAL_PUNCTUATION

      Binary property Terminal_Punctuation.

      Punctuation characters that generally mark the end of textual units.

      See Also:
    • UNIFIED_IDEOGRAPH

      static final int UNIFIED_IDEOGRAPH

      Binary property Unified_Ideograph (new).

      For programmatic determination of Ideographic Description Sequences.

      See Also:
    • UPPERCASE

      static final int UPPERCASE

      Binary property Uppercase.

      Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().

      Lu+Other_Uppercase

      See Also:
    • WHITE_SPACE

      static final int WHITE_SPACE

      Binary property White_Space.

      Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace(). Space characters+TAB+CR+LF-ZWSP-ZWNBSP

      See Also:
    • XID_CONTINUE

      static final int XID_CONTINUE

      Binary property XID_Continue.

      ID_Continue modified to allow closure under normalization forms NFKC and NFKD.

      See Also:
    • XID_START

      static final int XID_START

      Binary property XID_Start.

      ID_Start modified to allow closure under normalization forms NFKC and NFKD.

      See Also:
    • CASE_SENSITIVE

      static final int CASE_SENSITIVE

      Binary property Case_Sensitive.

      Either the source of a case mapping or in the target of a case mapping. Not the same as the general category Cased_Letter.

      See Also:
    • S_TERM

      static final int S_TERM
      Binary property STerm (new in Unicode 4.0.1). Sentence Terminal. Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/)
      See Also:
    • VARIATION_SELECTOR

      static final int VARIATION_SELECTOR
      Binary property Variation_Selector (new in Unicode 4.0.1). Indicates all those characters that qualify as Variation Selectors. For details on the behavior of these characters, see StandardizedVariants.html and 15.6 Variation Selectors.
      See Also:
    • NFD_INERT

      static final int NFD_INERT
      Binary property NFD_Inert. ICU-specific property for characters that are inert under NFD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions. There is one such property per normalization form. These properties are computed as follows - an inert character is: a) unassigned, or ALL of the following: b) of combining class 0. c) not decomposed by this normalization form. AND if NFC or NFKC, d) can never compose with a previous character. e) can never compose with a following character. f) can never change if another character is added. Example: a-breve might satisfy all but f, but if you add an ogonek it changes to a-ogonek + breve See also com.ibm.text.UCD.NFSkippable in the ICU4J repository, and icu/source/common/unormimp.h .
      See Also:
    • NFKD_INERT

      static final int NFKD_INERT
      Binary property NFKD_Inert. ICU-specific property for characters that are inert under NFKD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
      See Also:
    • NFC_INERT

      static final int NFC_INERT
      Binary property NFC_Inert. ICU-specific property for characters that are inert under NFC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
      See Also:
    • NFKC_INERT

      static final int NFKC_INERT
      Binary property NFKC_Inert. ICU-specific property for characters that are inert under NFKC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
      See Also:
    • SEGMENT_STARTER

      static final int SEGMENT_STARTER
      Binary Property Segment_Starter. ICU-specific property for characters that are starters in terms of Unicode normalization and combining character sequences. They have ccc=0 and do not occur in non-initial position of the canonical decomposition of any character (like " in NFD(a-umlaut) and a Jamo T in an NFD(Hangul LVT)). ICU uses this property for segmenting a string for generating a set of canonically equivalent strings, e.g. for canonical closure while processing collation tailoring rules.
      See Also:
    • PATTERN_SYNTAX

      static final int PATTERN_SYNTAX
      Binary property Pattern_Syntax (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
      See Also:
    • PATTERN_WHITE_SPACE

      static final int PATTERN_WHITE_SPACE
      Binary property Pattern_White_Space (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
      See Also:
    • POSIX_ALNUM

      static final int POSIX_ALNUM
      Binary property alnum (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
      See Also:
    • POSIX_BLANK

      static final int POSIX_BLANK
      Binary property blank (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
      See Also:
    • POSIX_GRAPH

      static final int POSIX_GRAPH
      Binary property graph (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
      See Also:
    • POSIX_PRINT

      static final int POSIX_PRINT
      Binary property print (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
      See Also:
    • POSIX_XDIGIT

      static final int POSIX_XDIGIT
      Binary property xdigit (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
      See Also:
    • CASED

      static final int CASED
      Binary property Cased. For Lowercase, Uppercase and Titlecase characters.
      See Also:
    • CASE_IGNORABLE

      static final int CASE_IGNORABLE
      Binary property Case_Ignorable. Used in context-sensitive case mappings.
      See Also:
    • CHANGES_WHEN_LOWERCASED

      static final int CHANGES_WHEN_LOWERCASED
      Binary property Changes_When_Lowercased.
      See Also:
    • CHANGES_WHEN_UPPERCASED

      static final int CHANGES_WHEN_UPPERCASED
      Binary property Changes_When_Uppercased.
      See Also:
    • CHANGES_WHEN_TITLECASED

      static final int CHANGES_WHEN_TITLECASED
      Binary property Changes_When_Titlecased.
      See Also:
    • CHANGES_WHEN_CASEFOLDED

      static final int CHANGES_WHEN_CASEFOLDED
      Binary property Changes_When_Casefolded.
      See Also:
    • CHANGES_WHEN_CASEMAPPED

      static final int CHANGES_WHEN_CASEMAPPED
      Binary property Changes_When_Casemapped.
      See Also:
    • CHANGES_WHEN_NFKC_CASEFOLDED

      static final int CHANGES_WHEN_NFKC_CASEFOLDED
      Binary property Changes_When_NFKC_Casefolded.
      See Also:
    • EMOJI

      static final int EMOJI
      Binary property Emoji. See http://www.unicode.org/reports/tr51/#Emoji_Properties
      See Also:
    • EMOJI_PRESENTATION

      static final int EMOJI_PRESENTATION
      Binary property Emoji_Presentation. See http://www.unicode.org/reports/tr51/#Emoji_Properties
      See Also:
    • EMOJI_MODIFIER

      static final int EMOJI_MODIFIER
      Binary property Emoji_Modifier. See http://www.unicode.org/reports/tr51/#Emoji_Properties
      See Also:
    • EMOJI_MODIFIER_BASE

      static final int EMOJI_MODIFIER_BASE
      Binary property Emoji_Modifier_Base. See http://www.unicode.org/reports/tr51/#Emoji_Properties
      See Also:
    • EMOJI_COMPONENT

      static final int EMOJI_COMPONENT
      Binary property Emoji_Component. See http://www.unicode.org/reports/tr51/#Emoji_Properties
      See Also:
    • REGIONAL_INDICATOR

      static final int REGIONAL_INDICATOR
      Binary property Regional_Indicator.
      See Also:
    • PREPENDED_CONCATENATION_MARK

      static final int PREPENDED_CONCATENATION_MARK
      Binary property Prepended_Concatenation_Mark.
      See Also:
    • EXTENDED_PICTOGRAPHIC

      static final int EXTENDED_PICTOGRAPHIC
      Binary property Extended_Pictographic. See http://www.unicode.org/reports/tr51/#Emoji_Properties
      See Also:
    • BASIC_EMOJI

      static final int BASIC_EMOJI
      Binary property of strings Basic_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
      See Also:
    • EMOJI_KEYCAP_SEQUENCE

      static final int EMOJI_KEYCAP_SEQUENCE
      Binary property of strings Emoji_Keycap_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
      See Also:
    • RGI_EMOJI_MODIFIER_SEQUENCE

      static final int RGI_EMOJI_MODIFIER_SEQUENCE
      Binary property of strings RGI_Emoji_Modifier_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
      See Also:
    • RGI_EMOJI_FLAG_SEQUENCE

      static final int RGI_EMOJI_FLAG_SEQUENCE
      Binary property of strings RGI_Emoji_Flag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
      See Also:
    • RGI_EMOJI_TAG_SEQUENCE

      static final int RGI_EMOJI_TAG_SEQUENCE
      Binary property of strings RGI_Emoji_Tag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
      See Also:
    • RGI_EMOJI_ZWJ_SEQUENCE

      static final int RGI_EMOJI_ZWJ_SEQUENCE
      Binary property of strings RGI_Emoji_ZWJ_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
      See Also:
    • RGI_EMOJI

      static final int RGI_EMOJI
      Binary property of strings RGI_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
      See Also:
    • BINARY_LIMIT

      @Deprecated static final int BINARY_LIMIT
      Deprecated.
      ICU 58 The numeric value may change over time, see ICU ticket #12420.
      One more than the last constant for binary Unicode properties.
      See Also:
    • BIDI_CLASS

      static final int BIDI_CLASS
      Enumerated property Bidi_Class. Same as UCharacter.getDirection(int), returns UCharacterDirection values.
      See Also:
    • INT_START

      static final int INT_START
      First constant for enumerated/integer Unicode properties.
      See Also:
    • BLOCK

      static final int BLOCK
      Enumerated property Block. Same as UCharacter.UnicodeBlock.of(int), returns UCharacter.UnicodeBlock values.
      See Also:
    • CANONICAL_COMBINING_CLASS

      static final int CANONICAL_COMBINING_CLASS
      Enumerated property Canonical_Combining_Class. Same as UCharacter.getCombiningClass(int), returns 8-bit numeric values.
      See Also:
    • DECOMPOSITION_TYPE

      static final int DECOMPOSITION_TYPE
      Enumerated property Decomposition_Type. Returns UCharacter.DecompositionType values.
      See Also:
    • EAST_ASIAN_WIDTH

      static final int EAST_ASIAN_WIDTH
      Enumerated property East_Asian_Width. See http://www.unicode.org/reports/tr11/ Returns UCharacter.EastAsianWidth values.
      See Also:
    • GENERAL_CATEGORY

      static final int GENERAL_CATEGORY
      Enumerated property General_Category. Same as UCharacter.getType(int), returns UCharacterCategory values.
      See Also:
    • JOINING_GROUP

      static final int JOINING_GROUP
      Enumerated property Joining_Group. Returns UCharacter.JoiningGroup values.
      See Also:
    • JOINING_TYPE

      static final int JOINING_TYPE
      Enumerated property Joining_Type. Returns UCharacter.JoiningType values.
      See Also:
    • LINE_BREAK

      static final int LINE_BREAK
      Enumerated property Line_Break. Returns UCharacter.LineBreak values.
      See Also:
    • NUMERIC_TYPE

      static final int NUMERIC_TYPE
      Enumerated property Numeric_Type. Returns UCharacter.NumericType values.
      See Also:
    • SCRIPT

      static final int SCRIPT
      Enumerated property Script. Same as UScript.getScript(int), returns UScript values.
      See Also:
    • HANGUL_SYLLABLE_TYPE

      static final int HANGUL_SYLLABLE_TYPE
      Enumerated property Hangul_Syllable_Type, new in Unicode 4. Returns UCharacter.HangulSyllableType values.
      See Also:
    • NFD_QUICK_CHECK

      static final int NFD_QUICK_CHECK
      Enumerated property NFD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
      See Also:
    • NFKD_QUICK_CHECK

      static final int NFKD_QUICK_CHECK
      Enumerated property NFKD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
      See Also:
    • NFC_QUICK_CHECK

      static final int NFC_QUICK_CHECK
      Enumerated property NFC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
      See Also:
    • NFKC_QUICK_CHECK

      static final int NFKC_QUICK_CHECK
      Enumerated property NFKC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
      See Also:
    • LEAD_CANONICAL_COMBINING_CLASS

      static final int LEAD_CANONICAL_COMBINING_CLASS
      Enumerated property Lead_Canonical_Combining_Class. ICU-specific property for the ccc of the first code point of the decomposition, or lccc(c)=ccc(NFD(c)[0]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
      See Also:
    • TRAIL_CANONICAL_COMBINING_CLASS

      static final int TRAIL_CANONICAL_COMBINING_CLASS
      Enumerated property Trail_Canonical_Combining_Class. ICU-specific property for the ccc of the last code point of the decomposition, or lccc(c)=ccc(NFD(c)[last]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
      See Also:
    • GRAPHEME_CLUSTER_BREAK

      static final int GRAPHEME_CLUSTER_BREAK
      Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.GraphemeClusterBreak values.
      See Also:
    • SENTENCE_BREAK

      static final int SENTENCE_BREAK
      Enumerated property Sentence_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.SentenceBreak values.
      See Also:
    • WORD_BREAK

      static final int WORD_BREAK
      Enumerated property Word_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.WordBreak values.
      See Also:
    • BIDI_PAIRED_BRACKET_TYPE

      static final int BIDI_PAIRED_BRACKET_TYPE
      Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). Used in UAX #9: Unicode Bidirectional Algorithm (http://www.unicode.org/reports/tr9/) Returns UCharacter.BidiPairedBracketType values.
      See Also:
    • INDIC_POSITIONAL_CATEGORY

      static final int INDIC_POSITIONAL_CATEGORY
      Enumerated property Indic_Positional_Category. New in Unicode 6.0 as provisional property Indic_Matra_Category; renamed and changed to informative in Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicPositionalCategory.txt
      See Also:
    • INDIC_SYLLABIC_CATEGORY

      static final int INDIC_SYLLABIC_CATEGORY
      Enumerated property Indic_Syllabic_Category. New in Unicode 6.0 as provisional; informative since Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicSyllabicCategory.txt
      See Also:
    • VERTICAL_ORIENTATION

      static final int VERTICAL_ORIENTATION
      Enumerated property Vertical_Orientation. Used for UAX #50 Unicode Vertical Text Layout (https://www.unicode.org/reports/tr50/). New as a UCD property in Unicode 10.0.
      See Also:
    • INT_LIMIT

      @Deprecated static final int INT_LIMIT
      Deprecated.
      ICU 58 The numeric value may change over time, see ICU ticket #12420.
      One more than the last constant for enumerated/integer Unicode properties.
      See Also:
    • GENERAL_CATEGORY_MASK

      static final int GENERAL_CATEGORY_MASK
      Bitmask property General_Category_Mask. This is the General_Category property returned as a bit mask. When used in UCharacter.getIntPropertyValue(c), returns bit masks for UCharacterCategory values where exactly one bit is set. When used with UCharacter.getPropertyValueName() and UCharacter.getPropertyValueEnum(), a multi-bit mask is used for sets of categories like "Letters".
      See Also:
    • MASK_START

      static final int MASK_START
      First constant for bit-mask Unicode properties.
      See Also:
    • MASK_LIMIT

      @Deprecated static final int MASK_LIMIT
      Deprecated.
      ICU 58 The numeric value may change over time, see ICU ticket #12420.
      One more than the last constant for bit-mask Unicode properties.
      See Also:
    • NUMERIC_VALUE

      static final int NUMERIC_VALUE
      Double property Numeric_Value. Corresponds to UCharacter.getUnicodeNumericValue(int).
      See Also:
    • DOUBLE_START

      static final int DOUBLE_START
      First constant for double Unicode properties.
      See Also:
    • DOUBLE_LIMIT

      @Deprecated static final int DOUBLE_LIMIT
      Deprecated.
      ICU 58 The numeric value may change over time, see ICU ticket #12420.
      One more than the last constant for double Unicode properties.
      See Also:
    • AGE

      static final int AGE
      String property Age. Corresponds to UCharacter.getAge(int).
      See Also:
    • STRING_START

      static final int STRING_START
      First constant for string Unicode properties.
      See Also:
    • BIDI_MIRRORING_GLYPH

      static final int BIDI_MIRRORING_GLYPH
      String property Bidi_Mirroring_Glyph. Corresponds to UCharacter.getMirror(int).
      See Also:
    • CASE_FOLDING

      static final int CASE_FOLDING
      String property Case_Folding. Corresponds to UCharacter.foldCase(String, boolean).
      See Also:
    • ISO_COMMENT

      @Deprecated static final int ISO_COMMENT
      Deprecated.
      ICU 49
      Deprecated string property ISO_Comment. Corresponds to UCharacter.getISOComment(int).
      See Also:
    • LOWERCASE_MAPPING

      static final int LOWERCASE_MAPPING
      String property Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(String).
      See Also:
    • NAME

      static final int NAME
      String property Name. Corresponds to UCharacter.getName(int).
      See Also:
    • SIMPLE_CASE_FOLDING

      static final int SIMPLE_CASE_FOLDING
      String property Simple_Case_Folding. Corresponds to UCharacter.foldCase(int, boolean).
      See Also:
    • SIMPLE_LOWERCASE_MAPPING

      static final int SIMPLE_LOWERCASE_MAPPING
      String property Simple_Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(int).
      See Also:
    • SIMPLE_TITLECASE_MAPPING

      static final int SIMPLE_TITLECASE_MAPPING
      String property Simple_Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(int).
      See Also:
    • SIMPLE_UPPERCASE_MAPPING

      static final int SIMPLE_UPPERCASE_MAPPING
      String property Simple_Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(int).
      See Also:
    • TITLECASE_MAPPING

      static final int TITLECASE_MAPPING
      String property Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(String).
      See Also:
    • UNICODE_1_NAME

      @Deprecated static final int UNICODE_1_NAME
      Deprecated.
      ICU 49
      String property Unicode_1_Name. This property is of little practical value. Beginning with ICU 49, ICU APIs return null or an empty string for this property. Corresponds to UCharacter.getName1_0(int).
      See Also:
    • UPPERCASE_MAPPING

      static final int UPPERCASE_MAPPING
      String property Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(String).
      See Also:
    • BIDI_PAIRED_BRACKET

      static final int BIDI_PAIRED_BRACKET
      String property Bidi_Paired_Bracket (new in Unicode 6.3). Corresponds to UCharacter.getBidiPairedBracket.
      See Also:
    • STRING_LIMIT

      @Deprecated static final int STRING_LIMIT
      Deprecated.
      ICU 58 The numeric value may change over time, see ICU ticket #12420.
      One more than the last constant for string Unicode properties.
      See Also:
    • SCRIPT_EXTENSIONS

      static final int SCRIPT_EXTENSIONS
      Miscellaneous property Script_Extensions (new in Unicode 6.0). Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/. Corresponds to UScript.hasScript and UScript.getScriptExtensions.
      See Also:
    • OTHER_PROPERTY_START

      static final int OTHER_PROPERTY_START
      First constant for Unicode properties with unusual value types.
      See Also:
    • OTHER_PROPERTY_LIMIT

      @Deprecated static final int OTHER_PROPERTY_LIMIT
      Deprecated.
      ICU 58 The numeric value may change over time, see ICU ticket #12420.
      One more than the last constant for Unicode properties with unusual value types.
      See Also: