UnicodeCharacterClass enumeration

The major class of a Unicode character’s general category.

public enum UnicodeCharacterClass

Values

name value description
Letter 0 Indicates that the character is a letter; this class includes the Unicode categories “Lu”, “Ll”, “Lt”, “Lm”, “Lo”.
Mark 1 Indicates that the character is a mark; this class includes the Unicode categories “Mn”, “Mc”, “Me”.
Number 2 Indicates that the character is a number; this class includes the Unicode categories “Nd”, “Nl”, “No”.
Punctuation 3 Indicates that the character is punctuation; this class includes the Unicode categories “Pc”, “Pd”, “Po”, “Pc”, “Pi”, “Pf”, “Po”.
Symbol 4 Indicates that the character is a symbol; this class includes the Unicode categories “Sm”, “Sc”, “Sk”, “So”.
Separator 5 Indicates that the character is a separator; this class includes the Unicode categories “Zs”, “Zl”, “Zp”.
Other 6 Indicates that the character is a control, format, private use, surrogate, or unassigned code point; this class includes the Unicode categories “Cc”, “Cf”, “Cs”, “Co”, “Cn”.

See Also