Interface UProperty
-
public interface UPropertySelection constants for Unicode properties.
These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.
The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).
For details about the properties see UAX #44: Unicode Character Database.
Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.
- Author:
- Syn Wee Quek
- See Also:
UCharacter- Status:
- Stable ICU 2.6.
-
-
Nested Class Summary
Nested Classes Modifier and Type Interface Description static interfaceUProperty.NameChoiceSelector constants for UCharacter.getPropertyName() and UCharacter.getPropertyValueName().
-
Field Summary
Fields Modifier and Type Field Description static intAGEString property Age.static intALPHABETICBinary property Alphabetic.static intASCII_HEX_DIGITBinary property ASCII_Hex_Digit (0-9 A-F a-f).static intBASIC_EMOJIBinary property of strings Basic_Emoji.static intBIDI_CLASSEnumerated property Bidi_Class.static intBIDI_CONTROLBinary property Bidi_Control.static intBIDI_MIRROREDBinary property Bidi_Mirrored.static intBIDI_MIRRORING_GLYPHString property Bidi_Mirroring_Glyph.static intBIDI_PAIRED_BRACKETString property Bidi_Paired_Bracket (new in Unicode 6.3).static intBIDI_PAIRED_BRACKET_TYPEEnumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3).static intBINARY_LIMITDeprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.static intBINARY_STARTFirst constant for binary Unicode properties.static intBLOCKEnumerated property Block.static intCANONICAL_COMBINING_CLASSEnumerated property Canonical_Combining_Class.static intCASE_FOLDINGString property Case_Folding.static intCASE_IGNORABLEBinary property Case_Ignorable.static intCASE_SENSITIVEBinary property Case_Sensitive.static intCASEDBinary property Cased.static intCHANGES_WHEN_CASEFOLDEDBinary property Changes_When_Casefolded.static intCHANGES_WHEN_CASEMAPPEDBinary property Changes_When_Casemapped.static intCHANGES_WHEN_LOWERCASEDBinary property Changes_When_Lowercased.static intCHANGES_WHEN_NFKC_CASEFOLDEDBinary property Changes_When_NFKC_Casefolded.static intCHANGES_WHEN_TITLECASEDBinary property Changes_When_Titlecased.static intCHANGES_WHEN_UPPERCASEDBinary property Changes_When_Uppercased.static intDASHBinary property Dash.static intDECOMPOSITION_TYPEEnumerated property Decomposition_Type.static intDEFAULT_IGNORABLE_CODE_POINTBinary property Default_Ignorable_Code_Point (new).static intDEPRECATEDBinary property Deprecated (new).static intDIACRITICBinary property Diacritic.static intDOUBLE_LIMITDeprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.static intDOUBLE_STARTFirst constant for double Unicode properties.static intEAST_ASIAN_WIDTHEnumerated property East_Asian_Width.static intEMOJIBinary property Emoji.static intEMOJI_COMPONENTBinary property Emoji_Component.static intEMOJI_KEYCAP_SEQUENCEBinary property of strings Emoji_Keycap_Sequence.static intEMOJI_MODIFIERBinary property Emoji_Modifier.static intEMOJI_MODIFIER_BASEBinary property Emoji_Modifier_Base.static intEMOJI_PRESENTATIONBinary property Emoji_Presentation.static intEXTENDED_PICTOGRAPHICBinary property Extended_Pictographic.static intEXTENDERBinary property Extender.static intFULL_COMPOSITION_EXCLUSIONBinary property Full_Composition_Exclusion.static intGENERAL_CATEGORYEnumerated property General_Category.static intGENERAL_CATEGORY_MASKBitmask property General_Category_Mask.static intGRAPHEME_BASEBinary property Grapheme_Base (new).static intGRAPHEME_CLUSTER_BREAKEnumerated property Grapheme_Cluster_Break (new in Unicode 4.1).static intGRAPHEME_EXTENDBinary property Grapheme_Extend (new).static intGRAPHEME_LINKBinary property Grapheme_Link (new).static intHANGUL_SYLLABLE_TYPEEnumerated property Hangul_Syllable_Type, new in Unicode 4.static intHEX_DIGITBinary property Hex_Digit.static intHYPHENBinary property Hyphen.static intID_COMPAT_MATH_CONTINUEBinary property ID_Compat_Math_Continue.static intID_COMPAT_MATH_STARTBinary property ID_Compat_Math_Start.static intID_CONTINUEBinary property ID_Continue.static intID_STARTBinary property ID_Start.static intIDENTIFIER_STATUSEnumerated property Identifier_Status.static intIDENTIFIER_TYPEMiscellaneous property Identifier_Type.static intIDEOGRAPHICBinary property Ideographic.static intIDS_BINARY_OPERATORBinary property IDS_Binary_Operator (new).static intIDS_TRINARY_OPERATORBinary property IDS_Trinary_Operator (new).static intIDS_UNARY_OPERATORBinary property IDS_Unary_Operator.static intINDIC_CONJUNCT_BREAKEnumerated property Indic_Conjunct_Break.static intINDIC_POSITIONAL_CATEGORYEnumerated property Indic_Positional_Category.static intINDIC_SYLLABIC_CATEGORYEnumerated property Indic_Syllabic_Category.static intINT_LIMITDeprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.static intINT_STARTFirst constant for enumerated/integer Unicode properties.static intISO_COMMENTDeprecated.ICU 49static intJOIN_CONTROLBinary property Join_Control.static intJOINING_GROUPEnumerated property Joining_Group.static intJOINING_TYPEEnumerated property Joining_Type.static intLEAD_CANONICAL_COMBINING_CLASSEnumerated property Lead_Canonical_Combining_Class.static intLINE_BREAKEnumerated property Line_Break.static intLOGICAL_ORDER_EXCEPTIONBinary property Logical_Order_Exception (new).static intLOWERCASEBinary property Lowercase.static intLOWERCASE_MAPPINGString property Lowercase_Mapping.static intMASK_LIMITDeprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.static intMASK_STARTFirst constant for bit-mask Unicode properties.static intMATHBinary property Math.static intMODIFIER_COMBINING_MARKBinary property Modifier_Combining_Mark.static intNAMEString property Name.static intNFC_INERTBinary property NFC_Inert.static intNFC_QUICK_CHECKEnumerated property NFC_Quick_Check.static intNFD_INERTBinary property NFD_Inert.static intNFD_QUICK_CHECKEnumerated property NFD_Quick_Check.static intNFKC_INERTBinary property NFKC_Inert.static intNFKC_QUICK_CHECKEnumerated property NFKC_Quick_Check.static intNFKD_INERTBinary property NFKD_Inert.static intNFKD_QUICK_CHECKEnumerated property NFKD_Quick_Check.static intNONCHARACTER_CODE_POINTBinary property Noncharacter_Code_Point.static intNUMERIC_TYPEEnumerated property Numeric_Type.static intNUMERIC_VALUEDouble property Numeric_Value.static intOTHER_PROPERTY_LIMITDeprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.static intOTHER_PROPERTY_STARTFirst constant for Unicode properties with unusual value types.static intPATTERN_SYNTAXBinary property Pattern_Syntax (new in Unicode 4.1).static intPATTERN_WHITE_SPACEBinary property Pattern_White_Space (new in Unicode 4.1).static intPOSIX_ALNUMBinary property alnum (a C/POSIX character class).static intPOSIX_BLANKBinary property blank (a C/POSIX character class).static intPOSIX_GRAPHBinary property graph (a C/POSIX character class).static intPOSIX_PRINTBinary property print (a C/POSIX character class).static intPOSIX_XDIGITBinary property xdigit (a C/POSIX character class).static intPREPENDED_CONCATENATION_MARKBinary property Prepended_Concatenation_Mark.static intQUOTATION_MARKBinary property Quotation_Mark.static intRADICALBinary property Radical (new).static intREGIONAL_INDICATORBinary property Regional_Indicator.static intRGI_EMOJIBinary property of strings RGI_Emoji.static intRGI_EMOJI_FLAG_SEQUENCEBinary property of strings RGI_Emoji_Flag_Sequence.static intRGI_EMOJI_MODIFIER_SEQUENCEBinary property of strings RGI_Emoji_Modifier_Sequence.static intRGI_EMOJI_TAG_SEQUENCEBinary property of strings RGI_Emoji_Tag_Sequence.static intRGI_EMOJI_ZWJ_SEQUENCEBinary property of strings RGI_Emoji_ZWJ_Sequence.static intS_TERMBinary property STerm (new in Unicode 4.0.1).static intSCRIPTEnumerated property Script.static intSCRIPT_EXTENSIONSMiscellaneous property Script_Extensions (new in Unicode 6.0).static intSEGMENT_STARTERBinary Property Segment_Starter.static intSENTENCE_BREAKEnumerated property Sentence_Break (new in Unicode 4.1).static intSIMPLE_CASE_FOLDINGString property Simple_Case_Folding.static intSIMPLE_LOWERCASE_MAPPINGString property Simple_Lowercase_Mapping.static intSIMPLE_TITLECASE_MAPPINGString property Simple_Titlecase_Mapping.static intSIMPLE_UPPERCASE_MAPPINGString property Simple_Uppercase_Mapping.static intSOFT_DOTTEDBinary property Soft_Dotted (new).static intSTRING_LIMITDeprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.static intSTRING_STARTFirst constant for string Unicode properties.static intTERMINAL_PUNCTUATIONBinary property Terminal_Punctuation.static intTITLECASE_MAPPINGString property Titlecase_Mapping.static intTRAIL_CANONICAL_COMBINING_CLASSEnumerated property Trail_Canonical_Combining_Class.static intUNDEFINEDDeprecated.This API is ICU internal only.static intUNICODE_1_NAMEDeprecated.ICU 49static intUNIFIED_IDEOGRAPHBinary property Unified_Ideograph (new).static intUPPERCASEBinary property Uppercase.static intUPPERCASE_MAPPINGString property Uppercase_Mapping.static intVARIATION_SELECTORBinary property Variation_Selector (new in Unicode 4.0.1).static intVERTICAL_ORIENTATIONEnumerated property Vertical_Orientation.static intWHITE_SPACEBinary property White_Space.static intWORD_BREAKEnumerated property Word_Break (new in Unicode 4.1).static intXID_CONTINUEBinary property XID_Continue.static intXID_STARTBinary property XID_Start.
-
-
-
Field Detail
-
UNDEFINED
@Deprecated static final int UNDEFINED
Deprecated.This API is ICU internal only.Special value indicating undefined property.- See Also:
- Constant Field Values
- Status:
- Internal. This API is ICU internal only.
-
ALPHABETIC
static final int ALPHABETIC
Binary property Alphabetic.
Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().
Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
BINARY_START
static final int BINARY_START
First constant for binary Unicode properties.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
ASCII_HEX_DIGIT
static final int ASCII_HEX_DIGIT
Binary property ASCII_Hex_Digit (0-9 A-F a-f).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
BIDI_CONTROL
static final int BIDI_CONTROL
Binary property Bidi_Control.
Format controls which have specific functions in the Bidi Algorithm.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
BIDI_MIRRORED
static final int BIDI_MIRRORED
Binary property Bidi_Mirrored.
Characters that may change display in RTL text.
Property for UCharacter.isMirrored().
See Bidi Algorithm; UTR 9.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
DASH
static final int DASH
Binary property Dash.
Variations of dashes.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
DEFAULT_IGNORABLE_CODE_POINT
static final int DEFAULT_IGNORABLE_CODE_POINT
Binary property Default_Ignorable_Code_Point (new).
Property that indicates codepoint is ignorable in most processing.
Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
DEPRECATED
static final int DEPRECATED
Binary property Deprecated (new).
The usage of deprecated characters is strongly discouraged.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
DIACRITIC
static final int DIACRITIC
Binary property Diacritic.
Characters that linguistically modify the meaning of another character to which they apply.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
EXTENDER
static final int EXTENDER
Binary property Extender.
Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
FULL_COMPOSITION_EXCLUSION
static final int FULL_COMPOSITION_EXCLUSION
Binary property Full_Composition_Exclusion.
CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
GRAPHEME_BASE
static final int GRAPHEME_BASE
Binary property Grapheme_Base (new).
For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
GRAPHEME_EXTEND
static final int GRAPHEME_EXTEND
Binary property Grapheme_Extend (new).
For programmatic determination of grapheme cluster boundaries.
Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
GRAPHEME_LINK
static final int GRAPHEME_LINK
Binary property Grapheme_Link (new).
For programmatic determination of grapheme cluster boundaries.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
HEX_DIGIT
static final int HEX_DIGIT
Binary property Hex_Digit.
Characters commonly used for hexadecimal numbers.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
HYPHEN
static final int HYPHEN
Binary property Hyphen.
Dashes used to mark connections between pieces of words, plus the Katakana middle dot.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
ID_CONTINUE
static final int ID_CONTINUE
Binary property ID_Continue.
Characters that can continue an identifier.
ID_Start+Mn+Mc+Nd+Pc
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
ID_START
static final int ID_START
Binary property ID_Start.
Characters that can start an identifier.
Lu+Ll+Lt+Lm+Lo+Nl
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
IDEOGRAPHIC
static final int IDEOGRAPHIC
Binary property Ideographic.
CJKV ideographs.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
IDS_BINARY_OPERATOR
static final int IDS_BINARY_OPERATOR
Binary property IDS_Binary_Operator (new).
For programmatic determination of Ideographic Description Sequences.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
IDS_TRINARY_OPERATOR
static final int IDS_TRINARY_OPERATOR
Binary property IDS_Trinary_Operator (new).
For programmatic determination of Ideographic Description Sequences.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
JOIN_CONTROL
static final int JOIN_CONTROL
Binary property Join_Control.
Format controls for cursive joining and ligation.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
LOGICAL_ORDER_EXCEPTION
static final int LOGICAL_ORDER_EXCEPTION
Binary property Logical_Order_Exception (new).
Characters that do not use logical order and require special handling in most processing.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
LOWERCASE
static final int LOWERCASE
Binary property Lowercase.
Same as UCharacter.isULowercase(), different from UCharacter.islower().
Ll+Other_Lowercase
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
MATH
static final int MATH
Binary property Math.
Sm+Other_Math
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
NONCHARACTER_CODE_POINT
static final int NONCHARACTER_CODE_POINT
Binary property Noncharacter_Code_Point.
Code points that are explicitly defined as illegal for the encoding of characters.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
QUOTATION_MARK
static final int QUOTATION_MARK
Binary property Quotation_Mark.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
RADICAL
static final int RADICAL
Binary property Radical (new).
For programmatic determination of Ideographic Description Sequences.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
SOFT_DOTTED
static final int SOFT_DOTTED
Binary property Soft_Dotted (new).
Characters with a "soft dot", like i or j.
An accent placed on these characters causes the dot to disappear.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
TERMINAL_PUNCTUATION
static final int TERMINAL_PUNCTUATION
Binary property Terminal_Punctuation.
Punctuation characters that generally mark the end of textual units.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
UNIFIED_IDEOGRAPH
static final int UNIFIED_IDEOGRAPH
Binary property Unified_Ideograph (new).
For programmatic determination of Ideographic Description Sequences.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
UPPERCASE
static final int UPPERCASE
Binary property Uppercase.
Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().
Lu+Other_Uppercase
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
WHITE_SPACE
static final int WHITE_SPACE
Binary property White_Space.
Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace(). Space characters+TAB+CR+LF-ZWSP-ZWNBSP
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
XID_CONTINUE
static final int XID_CONTINUE
Binary property XID_Continue.
ID_Continue modified to allow closure under normalization forms NFKC and NFKD.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
XID_START
static final int XID_START
Binary property XID_Start.
ID_Start modified to allow closure under normalization forms NFKC and NFKD.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
CASE_SENSITIVE
static final int CASE_SENSITIVE
Binary property Case_Sensitive.
Either the source of a case mapping or in the target of a case mapping. Not the same as the general category Cased_Letter.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
S_TERM
static final int S_TERM
Binary property STerm (new in Unicode 4.0.1). Sentence Terminal. Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/)- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
VARIATION_SELECTOR
static final int VARIATION_SELECTOR
Binary property Variation_Selector (new in Unicode 4.0.1). Indicates all those characters that qualify as Variation Selectors. For details on the behavior of these characters, see StandardizedVariants.html and 15.6 Variation Selectors.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
NFD_INERT
static final int NFD_INERT
Binary property NFD_Inert. ICU-specific property for characters that are inert under NFD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions. There is one such property per normalization form. These properties are computed as follows - an inert character is: a) unassigned, or ALL of the following: b) of combining class 0. c) not decomposed by this normalization form. AND if NFC or NFKC, d) can never compose with a previous character. e) can never compose with a following character. f) can never change if another character is added. Example: a-breve might satisfy all but f, but if you add an ogonek it changes to a-ogonek + breve See also com.ibm.text.UCD.NFSkippable in the ICU4J repository, and icu/source/common/unormimp.h .- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
NFKD_INERT
static final int NFKD_INERT
Binary property NFKD_Inert. ICU-specific property for characters that are inert under NFKD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.- See Also:
NFD_INERT, Constant Field Values- Status:
- Stable ICU 3.0.
-
NFC_INERT
static final int NFC_INERT
Binary property NFC_Inert. ICU-specific property for characters that are inert under NFC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.- See Also:
NFD_INERT, Constant Field Values- Status:
- Stable ICU 3.0.
-
NFKC_INERT
static final int NFKC_INERT
Binary property NFKC_Inert. ICU-specific property for characters that are inert under NFKC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.- See Also:
NFD_INERT, Constant Field Values- Status:
- Stable ICU 3.0.
-
SEGMENT_STARTER
static final int SEGMENT_STARTER
Binary Property Segment_Starter. ICU-specific property for characters that are starters in terms of Unicode normalization and combining character sequences. They have ccc=0 and do not occur in non-initial position of the canonical decomposition of any character (like " in NFD(a-umlaut) and a Jamo T in an NFD(Hangul LVT)). ICU uses this property for segmenting a string for generating a set of canonically equivalent strings, e.g. for canonical closure while processing collation tailoring rules.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
PATTERN_SYNTAX
static final int PATTERN_SYNTAX
Binary property Pattern_Syntax (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
PATTERN_WHITE_SPACE
static final int PATTERN_WHITE_SPACE
Binary property Pattern_White_Space (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
POSIX_ALNUM
static final int POSIX_ALNUM
Binary property alnum (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
POSIX_BLANK
static final int POSIX_BLANK
Binary property blank (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
POSIX_GRAPH
static final int POSIX_GRAPH
Binary property graph (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
POSIX_PRINT
static final int POSIX_PRINT
Binary property print (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
POSIX_XDIGIT
static final int POSIX_XDIGIT
Binary property xdigit (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
CASED
static final int CASED
Binary property Cased. For Lowercase, Uppercase and Titlecase characters.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
CASE_IGNORABLE
static final int CASE_IGNORABLE
Binary property Case_Ignorable. Used in context-sensitive case mappings.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
CHANGES_WHEN_LOWERCASED
static final int CHANGES_WHEN_LOWERCASED
Binary property Changes_When_Lowercased.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
CHANGES_WHEN_UPPERCASED
static final int CHANGES_WHEN_UPPERCASED
Binary property Changes_When_Uppercased.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
CHANGES_WHEN_TITLECASED
static final int CHANGES_WHEN_TITLECASED
Binary property Changes_When_Titlecased.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
CHANGES_WHEN_CASEFOLDED
static final int CHANGES_WHEN_CASEFOLDED
Binary property Changes_When_Casefolded.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
CHANGES_WHEN_CASEMAPPED
static final int CHANGES_WHEN_CASEMAPPED
Binary property Changes_When_Casemapped.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
CHANGES_WHEN_NFKC_CASEFOLDED
static final int CHANGES_WHEN_NFKC_CASEFOLDED
Binary property Changes_When_NFKC_Casefolded.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.4.
-
EMOJI
static final int EMOJI
Binary property Emoji. See http://www.unicode.org/reports/tr51/#Emoji_Properties- See Also:
- Constant Field Values
- Status:
- Stable ICU 57.
-
EMOJI_PRESENTATION
static final int EMOJI_PRESENTATION
Binary property Emoji_Presentation. See http://www.unicode.org/reports/tr51/#Emoji_Properties- See Also:
- Constant Field Values
- Status:
- Stable ICU 57.
-
EMOJI_MODIFIER
static final int EMOJI_MODIFIER
Binary property Emoji_Modifier. See http://www.unicode.org/reports/tr51/#Emoji_Properties- See Also:
- Constant Field Values
- Status:
- Stable ICU 57.
-
EMOJI_MODIFIER_BASE
static final int EMOJI_MODIFIER_BASE
Binary property Emoji_Modifier_Base. See http://www.unicode.org/reports/tr51/#Emoji_Properties- See Also:
- Constant Field Values
- Status:
- Stable ICU 57.
-
EMOJI_COMPONENT
static final int EMOJI_COMPONENT
Binary property Emoji_Component. See http://www.unicode.org/reports/tr51/#Emoji_Properties- See Also:
- Constant Field Values
- Status:
- Stable ICU 60.
-
REGIONAL_INDICATOR
static final int REGIONAL_INDICATOR
Binary property Regional_Indicator.- See Also:
- Constant Field Values
- Status:
- Stable ICU 60.
-
PREPENDED_CONCATENATION_MARK
static final int PREPENDED_CONCATENATION_MARK
Binary property Prepended_Concatenation_Mark.- See Also:
- Constant Field Values
- Status:
- Stable ICU 60.
-
EXTENDED_PICTOGRAPHIC
static final int EXTENDED_PICTOGRAPHIC
Binary property Extended_Pictographic. See http://www.unicode.org/reports/tr51/#Emoji_Properties- See Also:
- Constant Field Values
- Status:
- Stable ICU 62.
-
BASIC_EMOJI
static final int BASIC_EMOJI
Binary property of strings Basic_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets- See Also:
- Constant Field Values
- Status:
- Stable ICU 70.
-
EMOJI_KEYCAP_SEQUENCE
static final int EMOJI_KEYCAP_SEQUENCE
Binary property of strings Emoji_Keycap_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets- See Also:
- Constant Field Values
- Status:
- Stable ICU 70.
-
RGI_EMOJI_MODIFIER_SEQUENCE
static final int RGI_EMOJI_MODIFIER_SEQUENCE
Binary property of strings RGI_Emoji_Modifier_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets- See Also:
- Constant Field Values
- Status:
- Stable ICU 70.
-
RGI_EMOJI_FLAG_SEQUENCE
static final int RGI_EMOJI_FLAG_SEQUENCE
Binary property of strings RGI_Emoji_Flag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets- See Also:
- Constant Field Values
- Status:
- Stable ICU 70.
-
RGI_EMOJI_TAG_SEQUENCE
static final int RGI_EMOJI_TAG_SEQUENCE
Binary property of strings RGI_Emoji_Tag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets- See Also:
- Constant Field Values
- Status:
- Stable ICU 70.
-
RGI_EMOJI_ZWJ_SEQUENCE
static final int RGI_EMOJI_ZWJ_SEQUENCE
Binary property of strings RGI_Emoji_ZWJ_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets- See Also:
- Constant Field Values
- Status:
- Stable ICU 70.
-
RGI_EMOJI
static final int RGI_EMOJI
Binary property of strings RGI_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets- See Also:
- Constant Field Values
- Status:
- Stable ICU 70.
-
IDS_UNARY_OPERATOR
static final int IDS_UNARY_OPERATOR
Binary property IDS_Unary_Operator. For programmatic determination of Ideographic Description Sequences.- See Also:
- Constant Field Values
- Status:
- Stable ICU 74.
-
ID_COMPAT_MATH_START
static final int ID_COMPAT_MATH_START
Binary property ID_Compat_Math_Start.Used in mathematical identifier profile in UAX #31.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 74.
-
ID_COMPAT_MATH_CONTINUE
static final int ID_COMPAT_MATH_CONTINUE
Binary property ID_Compat_Math_Continue.Used in mathematical identifier profile in UAX #31.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 74.
-
MODIFIER_COMBINING_MARK
static final int MODIFIER_COMBINING_MARK
Binary property Modifier_Combining_Mark.Used by the AMTRA algorithm in UAX #53.
- See Also:
- Constant Field Values
- Status:
- Stable ICU 76.
-
BINARY_LIMIT
@Deprecated static final int BINARY_LIMIT
Deprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.One more than the last constant for binary Unicode properties.- See Also:
- Constant Field Values
-
BIDI_CLASS
static final int BIDI_CLASS
Enumerated property Bidi_Class. Same as UCharacter.getDirection(int), returns UCharacterDirection values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
INT_START
static final int INT_START
First constant for enumerated/integer Unicode properties.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
BLOCK
static final int BLOCK
Enumerated property Block. Same as UCharacter.UnicodeBlock.of(int), returns UCharacter.UnicodeBlock values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
CANONICAL_COMBINING_CLASS
static final int CANONICAL_COMBINING_CLASS
Enumerated property Canonical_Combining_Class. Same as UCharacter.getCombiningClass(int), returns 8-bit numeric values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
DECOMPOSITION_TYPE
static final int DECOMPOSITION_TYPE
Enumerated property Decomposition_Type. Returns UCharacter.DecompositionType values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
EAST_ASIAN_WIDTH
static final int EAST_ASIAN_WIDTH
Enumerated property East_Asian_Width. See http://www.unicode.org/reports/tr11/ Returns UCharacter.EastAsianWidth values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
GENERAL_CATEGORY
static final int GENERAL_CATEGORY
Enumerated property General_Category. Same as UCharacter.getType(int), returns UCharacterCategory values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
JOINING_GROUP
static final int JOINING_GROUP
Enumerated property Joining_Group. Returns UCharacter.JoiningGroup values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
JOINING_TYPE
static final int JOINING_TYPE
Enumerated property Joining_Type. Returns UCharacter.JoiningType values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
LINE_BREAK
static final int LINE_BREAK
Enumerated property Line_Break. Returns UCharacter.LineBreak values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
NUMERIC_TYPE
static final int NUMERIC_TYPE
Enumerated property Numeric_Type. Returns UCharacter.NumericType values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
SCRIPT
static final int SCRIPT
Enumerated property Script. Same as UScript.getScript(int), returns UScript values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
HANGUL_SYLLABLE_TYPE
static final int HANGUL_SYLLABLE_TYPE
Enumerated property Hangul_Syllable_Type, new in Unicode 4. Returns UCharacter.HangulSyllableType values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.6.
-
NFD_QUICK_CHECK
static final int NFD_QUICK_CHECK
Enumerated property NFD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
NFKD_QUICK_CHECK
static final int NFKD_QUICK_CHECK
Enumerated property NFKD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
NFC_QUICK_CHECK
static final int NFC_QUICK_CHECK
Enumerated property NFC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
NFKC_QUICK_CHECK
static final int NFKC_QUICK_CHECK
Enumerated property NFKC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
LEAD_CANONICAL_COMBINING_CLASS
static final int LEAD_CANONICAL_COMBINING_CLASS
Enumerated property Lead_Canonical_Combining_Class. ICU-specific property for the ccc of the first code point of the decomposition, or lccc(c)=ccc(NFD(c)[0]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
TRAIL_CANONICAL_COMBINING_CLASS
static final int TRAIL_CANONICAL_COMBINING_CLASS
Enumerated property Trail_Canonical_Combining_Class. ICU-specific property for the ccc of the last code point of the decomposition, or lccc(c)=ccc(NFD(c)[last]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.0.
-
GRAPHEME_CLUSTER_BREAK
static final int GRAPHEME_CLUSTER_BREAK
Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.GraphemeClusterBreak values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
SENTENCE_BREAK
static final int SENTENCE_BREAK
Enumerated property Sentence_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.SentenceBreak values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
WORD_BREAK
static final int WORD_BREAK
Enumerated property Word_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.WordBreak values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 3.4.
-
BIDI_PAIRED_BRACKET_TYPE
static final int BIDI_PAIRED_BRACKET_TYPE
Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). Used in UAX #9: Unicode Bidirectional Algorithm (http://www.unicode.org/reports/tr9/) Returns UCharacter.BidiPairedBracketType values.- See Also:
- Constant Field Values
- Status:
- Stable ICU 52.
-
INDIC_POSITIONAL_CATEGORY
static final int INDIC_POSITIONAL_CATEGORY
Enumerated property Indic_Positional_Category. New in Unicode 6.0 as provisional property Indic_Matra_Category; renamed and changed to informative in Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicPositionalCategory.txt- See Also:
- Constant Field Values
- Status:
- Stable ICU 63.
-
INDIC_SYLLABIC_CATEGORY
static final int INDIC_SYLLABIC_CATEGORY
Enumerated property Indic_Syllabic_Category. New in Unicode 6.0 as provisional; informative since Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicSyllabicCategory.txt- See Also:
- Constant Field Values
- Status:
- Stable ICU 63.
-
VERTICAL_ORIENTATION
static final int VERTICAL_ORIENTATION
Enumerated property Vertical_Orientation. Used for UAX #50 Unicode Vertical Text Layout (https://www.unicode.org/reports/tr50/). New as a UCD property in Unicode 10.0.- See Also:
- Constant Field Values
- Status:
- Stable ICU 63.
-
IDENTIFIER_STATUS
static final int IDENTIFIER_STATUS
Enumerated property Identifier_Status. Used for UTS #39 General Security Profile for Identifiers (https://www.unicode.org/reports/tr39/#General_Security_Profile).- See Also:
- Constant Field Values
- Status:
- Stable ICU 75.
-
INDIC_CONJUNCT_BREAK
static final int INDIC_CONJUNCT_BREAK
Enumerated property Indic_Conjunct_Break. Used in the grapheme cluster break algorithm in UAX #29.- See Also:
- Constant Field Values
- Status:
- Stable ICU 76.
-
INT_LIMIT
@Deprecated static final int INT_LIMIT
Deprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.One more than the last constant for enumerated/integer Unicode properties.- See Also:
- Constant Field Values
-
GENERAL_CATEGORY_MASK
static final int GENERAL_CATEGORY_MASK
Bitmask property General_Category_Mask. This is the General_Category property returned as a bit mask. When used in UCharacter.getIntPropertyValue(c), returns bit masks for UCharacterCategory values where exactly one bit is set. When used with UCharacter.getPropertyValueName() and UCharacter.getPropertyValueEnum(), a multi-bit mask is used for sets of categories like "Letters".- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
MASK_START
static final int MASK_START
First constant for bit-mask Unicode properties.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
MASK_LIMIT
@Deprecated static final int MASK_LIMIT
Deprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.One more than the last constant for bit-mask Unicode properties.- See Also:
- Constant Field Values
-
NUMERIC_VALUE
static final int NUMERIC_VALUE
Double property Numeric_Value. Corresponds to UCharacter.getUnicodeNumericValue(int).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
DOUBLE_START
static final int DOUBLE_START
First constant for double Unicode properties.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
DOUBLE_LIMIT
@Deprecated static final int DOUBLE_LIMIT
Deprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.One more than the last constant for double Unicode properties.- See Also:
- Constant Field Values
-
AGE
static final int AGE
String property Age. Corresponds to UCharacter.getAge(int).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
STRING_START
static final int STRING_START
First constant for string Unicode properties.- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
BIDI_MIRRORING_GLYPH
static final int BIDI_MIRRORING_GLYPH
String property Bidi_Mirroring_Glyph. Corresponds to UCharacter.getMirror(int).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
CASE_FOLDING
static final int CASE_FOLDING
String property Case_Folding. Corresponds to UCharacter.foldCase(String, boolean).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
ISO_COMMENT
@Deprecated static final int ISO_COMMENT
Deprecated.ICU 49Deprecated string property ISO_Comment. Corresponds to UCharacter.getISOComment(int).- See Also:
- Constant Field Values
-
LOWERCASE_MAPPING
static final int LOWERCASE_MAPPING
String property Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(String).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
NAME
static final int NAME
String property Name. Corresponds to UCharacter.getName(int).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
SIMPLE_CASE_FOLDING
static final int SIMPLE_CASE_FOLDING
String property Simple_Case_Folding. Corresponds to UCharacter.foldCase(int, boolean).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
SIMPLE_LOWERCASE_MAPPING
static final int SIMPLE_LOWERCASE_MAPPING
String property Simple_Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(int).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
SIMPLE_TITLECASE_MAPPING
static final int SIMPLE_TITLECASE_MAPPING
String property Simple_Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(int).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
SIMPLE_UPPERCASE_MAPPING
static final int SIMPLE_UPPERCASE_MAPPING
String property Simple_Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(int).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
TITLECASE_MAPPING
static final int TITLECASE_MAPPING
String property Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(String).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
UNICODE_1_NAME
@Deprecated static final int UNICODE_1_NAME
Deprecated.ICU 49String property Unicode_1_Name. This property is of little practical value. Beginning with ICU 49, ICU APIs return null or an empty string for this property. Corresponds to UCharacter.getName1_0(int).- See Also:
- Constant Field Values
-
UPPERCASE_MAPPING
static final int UPPERCASE_MAPPING
String property Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(String).- See Also:
- Constant Field Values
- Status:
- Stable ICU 2.4.
-
BIDI_PAIRED_BRACKET
static final int BIDI_PAIRED_BRACKET
String property Bidi_Paired_Bracket (new in Unicode 6.3). Corresponds to UCharacter.getBidiPairedBracket.- See Also:
- Constant Field Values
- Status:
- Stable ICU 52.
-
STRING_LIMIT
@Deprecated static final int STRING_LIMIT
Deprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.One more than the last constant for string Unicode properties.- See Also:
- Constant Field Values
-
SCRIPT_EXTENSIONS
static final int SCRIPT_EXTENSIONS
Miscellaneous property Script_Extensions (new in Unicode 6.0). Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/. Corresponds to UScript.hasScript and UScript.getScriptExtensions.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.6.
-
OTHER_PROPERTY_START
static final int OTHER_PROPERTY_START
First constant for Unicode properties with unusual value types.- See Also:
- Constant Field Values
- Status:
- Stable ICU 4.6.
-
IDENTIFIER_TYPE
static final int IDENTIFIER_TYPE
Miscellaneous property Identifier_Type. Used for UTS #39 General Security Profile for Identifiers (https://www.unicode.org/reports/tr39/#General_Security_Profile).Corresponds to
UCharacter.hasIdentifierType(int, UCharacter.IdentifierType)andUCharacter.getIdentifierTypes(int, java.util.EnumSet).Each code point maps to a set of IdentifierType values.
- See Also:
UCharacter.hasIdentifierType(int, UCharacter.IdentifierType),UCharacter.getIdentifierTypes(int, java.util.EnumSet), Constant Field Values- Status:
- Stable ICU 75.
-
OTHER_PROPERTY_LIMIT
@Deprecated static final int OTHER_PROPERTY_LIMIT
Deprecated.ICU 58 The numeric value may change over time, see ICU ticket #12420.One more than the last constant for Unicode properties with unusual value types.- See Also:
- Constant Field Values
-
-