public final class UScript extends Object
The current set of script code constants supports at least all scripts that are encoded in the version of Unicode which ICU currently supports. The names of the constants are usually derived from the Unicode script property value aliases. See UAX #24 Unicode Script Property (http://www.unicode.org/reports/tr24/) and http://www.unicode.org/Public/UCD/latest/ucd/PropertyValueAliases.txt .
In addition, constants for many ISO 15924 script codes are included, for use with language tags, CLDR data, and similar. Some of those codes are not used in the Unicode Character Database (UCD). For example, there are no characters that have a UCD script property value of Hans or Hant. All Han ideographs have the Hani script property value in Unicode.
Private-use codes Qaaa..Qabx are not included, except as used in the UCD or in CLDR.
Starting with ICU 55, script codes are only added when their scripts have been or will certainly be encoded in Unicode, and have been assigned Unicode script property value aliases, to ensure that their script names are stable and match the names of the constants. Script codes like Latf and Aran that are not subject to separate encoding may be added at any time.
Modifier and Type | Class and Description |
---|---|
static class |
UScript.ScriptUsage
Script usage constants.
|
Modifier and Type | Field and Description |
---|---|
static int |
ADLAM
ISO 15924 script code
|
static int |
AFAKA
ISO 15924 script code
|
static int |
AHOM
ISO 15924 script code
|
static int |
ANATOLIAN_HIEROGLYPHS
ISO 15924 script code
|
static int |
ARABIC
Arabic
|
static int |
ARABIC_NASTALIQ |
static int |
ARMENIAN
Armenian
|
static int |
AVESTAN
ISO 15924 script code
|
static int |
BALINESE
ISO 15924 script code
|
static int |
BAMUM
ISO 15924 script code
|
static int |
BASSA_VAH
ISO 15924 script code
|
static int |
BATAK
ISO 15924 script code
|
static int |
BENGALI
Bengali
|
static int |
BHAIKSUKI
ISO 15924 script code
|
static int |
BLISSYMBOLS
ISO 15924 script code
|
static int |
BOOK_PAHLAVI
ISO 15924 script code
|
static int |
BOPOMOFO
Bopomofo
|
static int |
BRAHMI
ISO 15924 script code
|
static int |
BRAILLE
Braille
Script in Unicode 4
|
static int |
BUGINESE
Script in Unicode 4.1
|
static int |
BUHID
Buhid
|
static int |
CANADIAN_ABORIGINAL
Unified Canadian Aboriginal Symbols
|
static int |
CARIAN
ISO 15924 script code
|
static int |
CAUCASIAN_ALBANIAN
ISO 15924 script code
|
static int |
CHAKMA
ISO 15924 script code
|
static int |
CHAM
ISO 15924 script code
|
static int |
CHEROKEE
Cherokee
|
static int |
CHORASMIAN |
static int |
CIRTH
ISO 15924 script code
|
static int |
CODE_LIMIT
Deprecated.
ICU 58 The numeric value may change over time, see ICU ticket #12420.
|
static int |
COMMON
Common
|
static int |
COPTIC
Coptic
|
static int |
CUNEIFORM
ISO 15924 script code
|
static int |
CYPRIOT
Cypriot
Script in Unicode 4
|
static int |
CYPRO_MINOAN |
static int |
CYRILLIC
Cyrillic
|
static int |
DEMOTIC_EGYPTIAN
ISO 15924 script code
|
static int |
DESERET
Deseret
|
static int |
DEVANAGARI
Devanagari
|
static int |
DIVES_AKURU |
static int |
DOGRA
ISO 15924 script code
|
static int |
DUPLOYAN
ISO 15924 script code
|
static int |
DUPLOYAN_SHORTAND
Deprecated.
ICU 54
|
static int |
EASTERN_SYRIAC
ISO 15924 script code
|
static int |
EGYPTIAN_HIEROGLYPHS
ISO 15924 script code
|
static int |
ELBASAN
ISO 15924 script code
|
static int |
ELYMAIC |
static int |
ESTRANGELO_SYRIAC
ISO 15924 script code
|
static int |
ETHIOPIC
Ethiopic
|
static int |
GARAY |
static int |
GEORGIAN
Georgian
|
static int |
GLAGOLITIC
Script in Unicode 4.1
|
static int |
GOTHIC
Gothic
|
static int |
GRANTHA
ISO 15924 script code
|
static int |
GREEK
Greek
|
static int |
GUJARATI
Gujarati
|
static int |
GUNJALA_GONDI |
static int |
GURMUKHI
Gurmukhi
|
static int |
GURUNG_KHEMA |
static int |
HAN
Han
|
static int |
HAN_WITH_BOPOMOFO
ISO 15924 script code
|
static int |
HANGUL
Hangul
|
static int |
HANIFI_ROHINGYA |
static int |
HANUNOO
Hanunooo
|
static int |
HARAPPAN_INDUS
ISO 15924 script code
|
static int |
HATRAN
ISO 15924 script code
|
static int |
HEBREW
Hebrew
|
static int |
HIERATIC_EGYPTIAN
ISO 15924 script code
|
static int |
HIRAGANA
Hiragana
|
static int |
IMPERIAL_ARAMAIC
ISO 15924 script code
|
static int |
INHERITED
Inherited
|
static int |
INSCRIPTIONAL_PAHLAVI
ISO 15924 script code
|
static int |
INSCRIPTIONAL_PARTHIAN
ISO 15924 script code
|
static int |
INVALID_CODE
Invalid code
|
static int |
JAMO
ISO 15924 script code
|
static int |
JAPANESE
ISO 15924 script code
|
static int |
JAVANESE
ISO 15924 script code
|
static int |
JURCHEN
ISO 15924 script code
|
static int |
KAITHI
ISO 15924 script code
|
static int |
KANNADA
Kannada
|
static int |
KATAKANA
Katakana
|
static int |
KATAKANA_OR_HIRAGANA
Script in Unicode 4.0.1
|
static int |
KAWI |
static int |
KAYAH_LI
ISO 15924 script code
|
static int |
KHAROSHTHI
Script in Unicode 4.1
|
static int |
KHITAN_SMALL_SCRIPT |
static int |
KHMER
Khmer
|
static int |
KHOJKI
ISO 15924 script code
|
static int |
KHUDAWADI
ISO 15924 script code
|
static int |
KHUTSURI
ISO 15924 script code
|
static int |
KIRAT_RAI |
static int |
KOREAN
ISO 15924 script code
|
static int |
KPELLE
ISO 15924 script code
|
static int |
LANNA
ISO 15924 script code
|
static int |
LAO
Lao
|
static int |
LATIN
Latin
|
static int |
LATIN_FRAKTUR
ISO 15924 script code
|
static int |
LATIN_GAELIC
ISO 15924 script code
|
static int |
LEPCHA
ISO 15924 script code
|
static int |
LIMBU
Limbu
Script in Unicode 4
|
static int |
LINEAR_A
ISO 15924 script code
|
static int |
LINEAR_B
Linear B
Script in Unicode 4
|
static int |
LISU
ISO 15924 script code
|
static int |
LOMA
ISO 15924 script code
|
static int |
LYCIAN
ISO 15924 script code
|
static int |
LYDIAN
ISO 15924 script code
|
static int |
MAHAJANI
ISO 15924 script code
|
static int |
MAKASAR |
static int |
MALAYALAM
Malayalam
|
static int |
MANDAEAN
ISO 15924 script code
|
static int |
MANDAIC
ISO 15924 script code
|
static int |
MANICHAEAN
ISO 15924 script code
|
static int |
MARCHEN
ISO 15924 script code
|
static int |
MASARAM_GONDI
ISO 15924 script code
|
static int |
MATHEMATICAL_NOTATION
ISO 15924 script code
|
static int |
MAYAN_HIEROGLYPHS
ISO 15924 script code
|
static int |
MEDEFAIDRIN |
static int |
MEITEI_MAYEK
ISO 15924 script code
|
static int |
MENDE
Mende Kikakui
ISO 15924 script code
|
static int |
MEROITIC
ISO 15924 script code
|
static int |
MEROITIC_CURSIVE
ISO 15924 script code
|
static int |
MEROITIC_HIEROGLYPHS
ISO 15924 script code
|
static int |
MIAO
ISO 15924 script code
|
static int |
MODI
ISO 15924 script code
|
static int |
MONGOLIAN
Mangolian
|
static int |
MOON
ISO 15924 script code
|
static int |
MRO
ISO 15924 script code
|
static int |
MULTANI
ISO 15924 script code
|
static int |
MYANMAR
Myammar
|
static int |
NABATAEAN
ISO 15924 script code
|
static int |
NAG_MUNDARI |
static int |
NAKHI_GEBA
ISO 15924 script code
|
static int |
NANDINAGARI |
static int |
NEW_TAI_LUE
Script in Unicode 4.1
|
static int |
NEWA
ISO 15924 script code
|
static int |
NKO
ISO 15924 script code
|
static int |
NUSHU
ISO 15924 script code
|
static int |
NYIAKENG_PUACHUE_HMONG |
static int |
OGHAM
Ogham
|
static int |
OL_CHIKI
ISO 15924 script code
|
static int |
OL_ONAL |
static int |
OLD_CHURCH_SLAVONIC_CYRILLIC
ISO 15924 script code
|
static int |
OLD_HUNGARIAN
ISO 15924 script code
|
static int |
OLD_ITALIC
Old Itallic
|
static int |
OLD_NORTH_ARABIAN
ISO 15924 script code
|
static int |
OLD_PERMIC
ISO 15924 script code
|
static int |
OLD_PERSIAN
Script in Unicode 4.1
|
static int |
OLD_SOGDIAN |
static int |
OLD_SOUTH_ARABIAN
ISO 15924 script code
|
static int |
OLD_UYGHUR |
static int |
ORIYA
Oriya
|
static int |
ORKHON
ISO 15924 script code
|
static int |
OSAGE
ISO 15924 script code
|
static int |
OSMANYA
Osmanya
Script in Unicode 4
|
static int |
PAHAWH_HMONG
ISO 15924 script code
|
static int |
PALMYRENE
ISO 15924 script code
|
static int |
PAU_CIN_HAU
ISO 15924 script code
|
static int |
PHAGS_PA
ISO 15924 script code
|
static int |
PHOENICIAN
ISO 15924 script code
|
static int |
PHONETIC_POLLARD
ISO 15924 script code
|
static int |
PSALTER_PAHLAVI
ISO 15924 script code
|
static int |
REJANG
ISO 15924 script code
|
static int |
RONGORONGO
ISO 15924 script code
|
static int |
RUNIC
Runic
|
static int |
SAMARITAN
ISO 15924 script code
|
static int |
SARATI
ISO 15924 script code
|
static int |
SAURASHTRA
ISO 15924 script code
|
static int |
SHARADA
ISO 15924 script code
|
static int |
SHAVIAN
Shavian
Script in Unicode 4
|
static int |
SIDDHAM
ISO 15924 script code
|
static int |
SIGN_WRITING
ISO 15924 script code for Sutton SignWriting
|
static int |
SIMPLIFIED_HAN
ISO 15924 script code
|
static int |
SINDHI
ISO 15924 script code
|
static int |
SINHALA
Sinhala
|
static int |
SOGDIAN |
static int |
SORA_SOMPENG
ISO 15924 script code
|
static int |
SOYOMBO
ISO 15924 script code
|
static int |
SUNDANESE
ISO 15924 script code
|
static int |
SUNUWAR |
static int |
SYLOTI_NAGRI
Script in Unicode 4.1
|
static int |
SYMBOLS
ISO 15924 script code
|
static int |
SYMBOLS_EMOJI
ISO 15924 script code
|
static int |
SYRIAC
Syriac
|
static int |
TAGALOG
Tagalog
|
static int |
TAGBANWA
Tagbanwa
|
static int |
TAI_LE
Tai Le
Script in Unicode 4
|
static int |
TAI_VIET
ISO 15924 script code
|
static int |
TAKRI
ISO 15924 script code
|
static int |
TAMIL
Tamil
|
static int |
TANGSA |
static int |
TANGUT
ISO 15924 script code
|
static int |
TELUGU
Telugu
|
static int |
TENGWAR
ISO 15924 script code
|
static int |
THAANA
Thana
|
static int |
THAI
Thai
|
static int |
TIBETAN
Tibetan
|
static int |
TIFINAGH
Script in Unicode 4.1
|
static int |
TIRHUTA
ISO 15924 script code
|
static int |
TODHRI |
static int |
TOTO |
static int |
TRADITIONAL_HAN
ISO 15924 script code
|
static int |
TULU_TIGALARI |
static int |
UCAS
Unified Canadian Aboriginal Symbols (alias)
|
static int |
UGARITIC
Ugaritic
Script in Unicode 4
|
static int |
UNKNOWN
ISO 15924 script code
|
static int |
UNWRITTEN_LANGUAGES
ISO 15924 script code
|
static int |
VAI
ISO 15924 script code
|
static int |
VISIBLE_SPEECH
ISO 15924 script code
|
static int |
VITHKUQI |
static int |
WANCHO |
static int |
WARANG_CITI
ISO 15924 script code
|
static int |
WESTERN_SYRIAC
ISO 15924 script code
|
static int |
WOLEAI
ISO 15924 script code
|
static int |
YEZIDI |
static int |
YI
Yi syllables
|
static int |
ZANABAZAR_SQUARE
ISO 15924 script code
|
Modifier and Type | Method and Description |
---|---|
static boolean |
breaksBetweenLetters(int script)
Returns true if the script allows line breaks between letters (excluding hyphenation).
|
static int[] |
getCode(Locale locale)
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name.
|
static int[] |
getCode(String nameOrAbbrOrLocale)
Gets the script codes associated with the given locale or ISO 15924 abbreviation or name.
|
static int[] |
getCode(ULocale locale)
Gets a script codes associated with the given locale or ISO 15924 abbreviation or name.
|
static int |
getCodeFromName(String nameOrAbbr)
Returns the script code associated with the given Unicode script property alias
(name or abbreviation).
|
static String |
getName(int scriptCode)
Returns the long Unicode script name, if there is one.
|
static String |
getSampleString(int script)
Returns the script sample character string.
|
static int |
getScript(int codepoint)
Gets the script code associated with the given codepoint.
|
static int |
getScriptExtensions(int c,
BitSet set)
Sets code point c's Script_Extensions as script code integers into the output BitSet.
|
static String |
getShortName(int scriptCode)
Returns the 4-letter ISO 15924 script code,
which is the same as the short Unicode script name if Unicode has names for the script.
|
static UScript.ScriptUsage |
getUsage(int script)
Returns the script usage according to UAX #31 Unicode Identifier and Pattern Syntax.
|
static boolean |
hasScript(int c,
int sc)
Do the Script_Extensions of code point c contain script sc?
|
static boolean |
isCased(int script)
Returns true if in modern (or most recent) usage of the script case distinctions are customary.
|
static boolean |
isRightToLeft(int script)
Returns true if the script is written right-to-left.
|
public static final int INVALID_CODE
public static final int COMMON
public static final int INHERITED
public static final int ARABIC
public static final int ARMENIAN
public static final int BENGALI
public static final int BOPOMOFO
public static final int CHEROKEE
public static final int COPTIC
public static final int CYRILLIC
public static final int DESERET
public static final int DEVANAGARI
public static final int ETHIOPIC
public static final int GEORGIAN
public static final int GOTHIC
public static final int GREEK
public static final int GUJARATI
public static final int GURMUKHI
public static final int HAN
public static final int HANGUL
public static final int HEBREW
public static final int HIRAGANA
public static final int KANNADA
public static final int KATAKANA
public static final int KHMER
public static final int LAO
public static final int LATIN
public static final int MALAYALAM
public static final int MONGOLIAN
public static final int MYANMAR
public static final int OGHAM
public static final int OLD_ITALIC
public static final int ORIYA
public static final int RUNIC
public static final int SINHALA
public static final int SYRIAC
public static final int TAMIL
public static final int TELUGU
public static final int THAANA
public static final int THAI
public static final int TIBETAN
public static final int CANADIAN_ABORIGINAL
public static final int UCAS
public static final int YI
public static final int TAGALOG
public static final int HANUNOO
public static final int BUHID
public static final int TAGBANWA
public static final int BRAILLE
public static final int CYPRIOT
public static final int LIMBU
public static final int LINEAR_B
public static final int OSMANYA
public static final int SHAVIAN
public static final int TAI_LE
public static final int UGARITIC
public static final int KATAKANA_OR_HIRAGANA
public static final int BUGINESE
public static final int GLAGOLITIC
public static final int KHAROSHTHI
public static final int SYLOTI_NAGRI
public static final int NEW_TAI_LUE
public static final int TIFINAGH
public static final int OLD_PERSIAN
public static final int BALINESE
public static final int BATAK
public static final int BLISSYMBOLS
public static final int BRAHMI
public static final int CHAM
public static final int CIRTH
public static final int OLD_CHURCH_SLAVONIC_CYRILLIC
public static final int DEMOTIC_EGYPTIAN
public static final int HIERATIC_EGYPTIAN
public static final int EGYPTIAN_HIEROGLYPHS
public static final int KHUTSURI
public static final int SIMPLIFIED_HAN
public static final int TRADITIONAL_HAN
public static final int PAHAWH_HMONG
public static final int OLD_HUNGARIAN
public static final int HARAPPAN_INDUS
public static final int JAVANESE
public static final int KAYAH_LI
public static final int LATIN_FRAKTUR
public static final int LATIN_GAELIC
public static final int LEPCHA
public static final int LINEAR_A
public static final int MANDAIC
public static final int MANDAEAN
public static final int MAYAN_HIEROGLYPHS
public static final int MEROITIC_HIEROGLYPHS
public static final int MEROITIC
public static final int NKO
public static final int ORKHON
public static final int OLD_PERMIC
public static final int PHAGS_PA
public static final int PHOENICIAN
public static final int MIAO
public static final int PHONETIC_POLLARD
public static final int RONGORONGO
public static final int SARATI
public static final int ESTRANGELO_SYRIAC
public static final int WESTERN_SYRIAC
public static final int EASTERN_SYRIAC
public static final int TENGWAR
public static final int VAI
public static final int VISIBLE_SPEECH
public static final int CUNEIFORM
public static final int UNWRITTEN_LANGUAGES
public static final int UNKNOWN
public static final int CARIAN
public static final int JAPANESE
public static final int LANNA
public static final int LYCIAN
public static final int LYDIAN
public static final int OL_CHIKI
public static final int REJANG
public static final int SAURASHTRA
public static final int SIGN_WRITING
public static final int SUNDANESE
public static final int MOON
public static final int MEITEI_MAYEK
public static final int IMPERIAL_ARAMAIC
public static final int AVESTAN
public static final int CHAKMA
public static final int KOREAN
public static final int KAITHI
public static final int MANICHAEAN
public static final int INSCRIPTIONAL_PAHLAVI
public static final int PSALTER_PAHLAVI
public static final int BOOK_PAHLAVI
public static final int INSCRIPTIONAL_PARTHIAN
public static final int SAMARITAN
public static final int TAI_VIET
public static final int MATHEMATICAL_NOTATION
public static final int SYMBOLS
public static final int BAMUM
public static final int LISU
public static final int NAKHI_GEBA
public static final int OLD_SOUTH_ARABIAN
public static final int BASSA_VAH
public static final int DUPLOYAN
@Deprecated public static final int DUPLOYAN_SHORTAND
public static final int ELBASAN
public static final int GRANTHA
public static final int KPELLE
public static final int LOMA
public static final int MENDE
public static final int MEROITIC_CURSIVE
public static final int OLD_NORTH_ARABIAN
public static final int NABATAEAN
public static final int PALMYRENE
public static final int KHUDAWADI
public static final int SINDHI
public static final int WARANG_CITI
public static final int AFAKA
public static final int JURCHEN
public static final int MRO
public static final int NUSHU
public static final int SHARADA
public static final int SORA_SOMPENG
public static final int TAKRI
public static final int TANGUT
public static final int WOLEAI
public static final int ANATOLIAN_HIEROGLYPHS
public static final int KHOJKI
public static final int TIRHUTA
public static final int CAUCASIAN_ALBANIAN
public static final int MAHAJANI
public static final int AHOM
public static final int HATRAN
public static final int MODI
public static final int MULTANI
public static final int PAU_CIN_HAU
public static final int SIDDHAM
public static final int ADLAM
public static final int BHAIKSUKI
public static final int MARCHEN
public static final int NEWA
public static final int OSAGE
public static final int HAN_WITH_BOPOMOFO
public static final int JAMO
public static final int SYMBOLS_EMOJI
public static final int MASARAM_GONDI
public static final int SOYOMBO
public static final int ZANABAZAR_SQUARE
public static final int DOGRA
public static final int GUNJALA_GONDI
public static final int MAKASAR
public static final int MEDEFAIDRIN
public static final int HANIFI_ROHINGYA
public static final int SOGDIAN
public static final int OLD_SOGDIAN
public static final int ELYMAIC
public static final int NYIAKENG_PUACHUE_HMONG
public static final int NANDINAGARI
public static final int WANCHO
public static final int CHORASMIAN
public static final int DIVES_AKURU
public static final int KHITAN_SMALL_SCRIPT
public static final int YEZIDI
public static final int CYPRO_MINOAN
public static final int OLD_UYGHUR
public static final int TANGSA
public static final int TOTO
public static final int VITHKUQI
public static final int KAWI
public static final int NAG_MUNDARI
public static final int ARABIC_NASTALIQ
public static final int GARAY
public static final int GURUNG_KHEMA
public static final int KIRAT_RAI
public static final int OL_ONAL
public static final int SUNUWAR
public static final int TODHRI
public static final int TULU_TIGALARI
@Deprecated public static final int CODE_LIMIT
public static final int[] getCode(Locale locale)
locale
- Localepublic static final int[] getCode(ULocale locale)
locale
- ULocalepublic static final int[] getCode(String nameOrAbbrOrLocale)
Note: To search by short or long script alias only, use
getCodeFromName(String)
instead.
That does a fast lookup with no access of the locale data.
nameOrAbbrOrLocale
- name of the script or ISO 15924 code or localepublic static final int getCodeFromName(String nameOrAbbr)
nameOrAbbr
- name of the script or ISO 15924 codepublic static final int getScript(int codepoint)
codepoint
- UChar32 codepointpublic static final boolean hasScript(int c, int sc)
Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/.
c
- code pointsc
- script codepublic static final int getScriptExtensions(int c, BitSet set)
UNKNOWN
code is put into the set
and also returned.
Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/.
c
- code pointset
- set of script code integers; will be cleared, then bits are set
corresponding to c's Script_Extensionspublic static final String getName(int scriptCode)
scriptCode
- int script codeIllegalArgumentException
- if the script code is not validpublic static final String getShortName(int scriptCode)
scriptCode
- int script codeIllegalArgumentException
- if the script code is not validpublic static final String getSampleString(int script)
script
- script codepublic static final UScript.ScriptUsage getUsage(int script)
UScript.ScriptUsage.NOT_ENCODED
if the script is not encoded in Unicode.script
- script codeUScript.ScriptUsage
public static final boolean isRightToLeft(int script)
script
- script codepublic static final boolean breaksBetweenLetters(int script)
script
- script codepublic static final boolean isCased(int script)
script
- script codeCopyright © 2016 Unicode, Inc. and others.