ICU Collation Demo

User GuideLDML CollationICU4C DemosICU4J Demos


Append rules: (For examples see the CLDR tailoring data)

Settings (see LDML Table / Common combinations)
normalization: default off on
numeric: default off on
strength: default primary secondary
        tertiary quaternary identical
backwards
secondary:
default off on
case level: default off on
case first: default off lower upper
alternate: default non-ignorable shifted
max variable: default space punct symbol currency
& show diff strengths input line numbers sort keys raw collation elements

Input

Escape syntax: \uHHHH or \U00HHHHHH. (See UnicodeString::unescape() documentation.)

Output (Legend)

Legend

Difference strengths: = equal <1 primary <2 secondary <c case level <3 tertiary <4 quaternary <i identical level

Sort keys: , = 01 level separator   . = 00 terminator

CEs (Collation Elements): [] for completely ignorable or [primary,secondary,case+tertiary,optional quaternary]
case: _ = uncased/lowercase   m = mixed case   u = uppercase
  q1/q2/q3 = quaternary weight