Package com.ibm.icu.text
package com.ibm.icu.text
Extensions and enhancements to java.text to support unicode transforms, UnicodeSet, surrogate char utilities, UCA collation, normalization, break iteration (rule and dictionary based), enhanced number format, international string searching, and arabic shaping.
- Unicode Transforms (Transliteration) convert between different representations of unicode text.
- UnicodeSet provides set operations on unicode characters and strings, and are representable as compact expressions.
- Surrogate character utilities (UTF16) provides indexing and substring operations on text containing surrogates.
- UCA collation implements the current Unicode Collation Algorithm. Data for many locale-specific collations is provided.
- Normalization supports standard Unicode normalization forms as well as FCD.
- Break iteration supports character, word, line, sentences, and title case breaks.
- Enhanced number format to 'spell out' numbers, padding and rounding control.
- Internationally sensitive string searching, uses collation.
- Arabic shaping converts between shaped and unshaped arabic characters and digits.
-
ClassDescriptionAlphabeticIndex supports the creation of a UI index appropriate for a given language.An index "bucket" with a label string and type.Type of the labelImmutable, thread-safe version of
AlphabeticIndex
.A (name, data) pair, to be sorted by name into one of the index buckets.Shape Arabic text on a character basis.Thrown by ArabicShaping when there is a shaping error.Bidi algorithm for ICUOverrides default Bidi class values with custom ones.A BidiRun represents a sequence of characters at the same embedding level.Bidi Layout Transformation Engine.
indicates whether or not characters with the "mirrored" property in RTL runs should be replaced with their mirror-image counterparts.BidiTransform.Mirroring
indicates the order of text.BidiTransform.Order
.This class allows one to iterate through all the strings that are canonically equivalent to a given string.Low-level case mapping options and methods.Case folding options and methods.Lowercasing options and methods.Titlecasing options and methods.Uppercasing options and methods.CharsetDetector
provides a facility for detecting the charset or encoding of character data in an unknown format.This class represents a charset that has been identified by a CharsetDetector as a possible encoding for a set of input data.Deprecated.ICU 50 Use SimpleDateFormat instead.Deprecated.ICU 50Deprecated.ICU 50CollationElementIterator
is an iterator created by a RuleBasedCollator to walk through a string.ACollationKey
represents aString
under the rules of a specificCollator
object.Options that used in the API CollationKey.getBound() for getting a CollationKey based on the bound mode requested..A factory used with registerFactory to register multiple collators and provide display names for them.Reordering codes for non-script groups that can be reordered under collation.Formats numbers in compact (abbreviated) notation, like "1.2K" instead of "1200".Style parameter for CompactDecimalFormat.Deprecated.ICU 2.2Represents a span of a string containing a given field.Returns currency names localized for a locale.Provides information about currencies that is not specific to a locale.Represents the raw information about fraction digits and rounding increment.A filter used to select which currency info is returned.Represents a complete currency info record listing the region, currency, from and to dates, and priority.This class represents the information needed by DecimalFormat to format currency plural, such as "3.00 US dollars" or "1.00 US dollar"..boolean attributesThe instances of this inner class are used as attribute keys and values in AttributedCharacterIterator that DateFormat.formatToCharacterIterator() method returns.Hour Cycle.DateIntervalFormat is a class for formatting and parsing date intervals in a language-independent manner.An immutable class containing the result of a date interval formatting operation.Class for span fields in FormattedDateInterval.DateIntervalInfo is a public class for encapsulating localizable date time interval patterns.PatternInfo class saves the first and second part of interval pattern, and whether the interval pattern is earlier date first.This class provides flexible generation of date format patterns, like "yy-MM-dd".Field display name width constants for getFieldDisplayNameDeprecated.This API is ICU internal only.PatternInfo supplies output parameters for addPattern(...).Deprecated.This API is ICU internal only..Deprecated.This API is ICU internal only.. This class represents the set of symbols (such as the decimal separator, the grouping separator, and so on) needed byDecimalFormat
to format numbers.Display context settings.Type values for DisplayContextRepresents all the display options that are supported by CLDR such as grammatical case, noun class, ... etc.Responsible for buildingDisplayOptions
.Represents all the capitalization options.Represents all the display lengths.Represents all the grammatical cases that are supported by CLDR.Represents all the name styles.Represents all the grammatical noun classes that are supported by CLDR.Standard CLDR plural category constants.Represents all the substitute handlings.Deprecated.ICU 56 Use MeasureFormat or RelativeDateTimeFormatter instead.Records lengths of string edits but not replacement text.Access to the list of edits.The BreakIteratorFilter is used to modify the behavior of a BreakIterator by constructing a new BreakIterator which suppresses certain segment boundaries.Normalization filtered by a UnicodeSet.An abstract formatted value: a string with associated field attributes.Abstract base class for IDNA processing.IDNA error bit set values.Output container for IDNA processing errors.Immutable class for formatting a list, using data from CLDR (or supplied separately).Field selectors for format fields defined by ListFormatter.An immutable class containing the result of a list formatting operation.Class for span fields in FormattedList.Type of meaning expressed by the list.Verbosity level of the list patterns.Returns display names of ULocales and components of ULocales.Enum used inLocaleDisplayNames.getInstance(ULocale, DialectHandling)
.Struct-like class used to return information for constructing a UI list, each corresponding to a locale.A formatter for Measure objects.Formatting width enum..Defines constants that are used as attribute keys in theAttributedCharacterIterator
returned fromMessageFormat.formatToCharacterIterator
.Parses and represents ICU MessageFormat patterns.Mode for when an apostrophe starts quoted literal text for MessageFormat output.Argument type constants.A message pattern "part", representing a pattern parsing event.Part type constants.Utilities for working with a MessagePattern.A piece of MessageNode contents representing a message argument and its details.A Node representing details of the argument style of a complex argument.A piece of MessageNode contents.The type of a piece of MessageNode contents.A Node representing a parsed MessageFormat pattern string.Common base class for all elements in a tree of nodes returned byMessagePatternUtil.buildMessageNode(MessagePattern)
.Literal text, a piece of MessageNode contents.A Node representing a nested message (nested inside an argument) with its selection criterion.Old Unicode normalization API.Deprecated.ICU 56 UseNormalizer2
instead.Result values for quickCheck().Unicode normalization functionality for standard Unicode normalization or for using custom mapping tables.Constants for normalization modes..The instances of this inner class are used as attribute keys and values in AttributedCharacterIterator that NumberFormat.formatToCharacterIterator() method returns.A NumberFormatFactory is used to register new number formats.A NumberFormatFactory that supports a single locale.NumberingSystem
is the base class for all number systems.An object used to provide name data to the PersonNameFormatter for formatting.Identifiers for the name field modifiers supported by the PersonName and PersonNameFormatter objects.Identifiers for the name fields supported by the PersonName object.An enum to specify the preferred field order for the name.A class for formatting names of people.A utility class that can be used to construct a PersonNameFormatter.An enum indicating the desired display order for a formatted name.Specifies the intended formality of the formatted name.Specifies the desired length of the formatted name.Specifies the intended usage of the formatted name.PluralFormat
supports the creation of internationalized messages with plural inflection.Defines rules for mapping non-negative numeric values onto a small set of keywords.Deprecated.This API is ICU internal only.Deprecated.This API is ICU internal only.Deprecated.This API is ICU internal only.Deprecated.This API is ICU internal only.Deprecated.This API is ICU internal only.Status of the keyword for the rules, given a set of explicit values.Deprecated.This API is ICU internal only.Type of plurals and PluralRules.Deprecated.This API is ICU internal only.Simple class wrapper to store the internal byte representation of a CollationKey.Deprecated.ICU 54Deprecated.ICU 54Formats simple relative dates.Represents an absolute unit.Represents a direction for an absolute unit e.g "Next Tuesday" or "Last Tuesday"Field constants used when accessing field information for relative datetime strings in FormattedValue.Represents the result of a formatting operation of a relative datetime.Represents the unit for formatting a relative date. e.g "in 5 days" or "next year"Represents the unit for formatting a relative date. e.g "in 5 days" or "in 3 months"The formatting styleReplaceable
is an interface representing a string of characters that supports the replacement of a range of itself with a new string of characters.ReplaceableString
is an adapter class that implements theReplaceable
API around an ordinaryStringBuffer
.Rule Based Break Iterator This is a port of the C++ class RuleBasedBreakIterator from ICU4C.RuleBasedCollator is a concrete subclass of Collator.A class that formats numbers according to a set of rules.Deprecated.This API is ICU internal only.A formatter that formats numbers in user-friendly scientific notation.SearchIterator is an abstract base class that provides methods to search for a pattern within a text string.Option to control how collation elements are compared.SelectFormat
supports the creation of internationalized messages by selecting phrases based on keywords..Formats simple patterns like "{1} was born in {0}".A concrete implementation of PersonNameFormatter.PersonName that simply stores the field values in a Map.A utility class for constructing a SimplePersonName.This class, based on Unicode Technical Report #36 and Unicode Technical Standard #39, has two main functions: Checking whether two strings are visually confusable with each other, such as "desparejado" and "ԁеѕрагејаԁо". Checking whether an individual string is likely to be an attempt at confusing the reader (spoof detection), such as "pаypаl" spelled with Cyrillic 'а' characters.SpoofChecker Builder.A struct-like class to hold the results of a Spoof Check operation.Constants from UTS 39 for use in setRestrictionLevel.Deprecated.ICU 2.4.StringPrep API implements the StingPrep framework as described by RFC 3454.Exception that signals an error has occurred while parsing the input to StringPrep or IDNA.StringSearch is aSearchIterator
that provides language-sensitive text searching based on the comparison rules defined in aRuleBasedCollator
object.Provide a base class for Transforms that focuses just on the transformation of the text.An interface that defines both lookup protocol and parsing of symbolic names.Deprecated.ICU 53 useMeasureFormat
instead.TimeZoneFormat
supports time zone display name formatting and parsing.Offset pattern type enum.Parse option enum, used for specifying optional parse behavior.Time zone display format style enum used by format/parse APIs inTimeZoneFormat
.Time type enum used for receiving time type (standard time, daylight time or unknown) inTimeZoneFormat
APIs.TimeZoneNames
is an abstract class representing the time zone display name data model defined by UTS#35 Unicode Locale Data Markup Language (LDML).Deprecated.This API is ICU internal only.AMatchInfo
represents a time zone name match used byTimeZoneNames.find(CharSequence, int, EnumSet)
.Time zone display name typesTransform<S,D> Provide an interface for Transforms that focuses just on the transformation of the text.Transliterator
is an abstract class that transliterates text from one format to another.The factory interface for transliterators.Position structure for incremental transliteration.Abstract class that defines an API for iteration on text objects.This is an interface for forward and backward iteration and random access into a text object.Deprecated.This API is ICU internal only.An abstract class that extendsFormat
to provide additional ICU protocol, specifically, the getLocale() API.A field that represents a span of text that may be composed with other fields.Interface that defines an API for forward-only iteration on text objects.A compression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6.A decompression engine implementing the Standard Compression Scheme for Unicode (SCSU) as outlined in Unicode Technical Report #6.UnicodeFilter
defines a protocol for selecting a subset of the full range (U+0000 to U+FFFF) of Unicode characters.UnicodeMatcher
defines a protocol for objects that can match a range of characters in a Replaceable string.A mutable set of Unicode characters and multicharacter strings.Comparison style enums used byUnicodeSet.compareTo(UnicodeSet, ComparisonStyle)
.A struct-like class used for iteration through ranges, for faster iteration than by String.Argument values for whether span() and similar functions continue while the current character is contained vs.Internal class for customizing UnicodeSet parsing of properties.UnicodeSetIterator iterates over the contents of a UnicodeSet.A helper class used to count, replace, and trim CharSequences based on UnicodeSet matches.Options for replaceFrom and countIn to control how to treat each matched span.Options for the trim() methodStandalone utility class providing UTF16 character conversions and indexing conversions.UTF16 string comparator class.