International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

CESU-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA MIME All Aliases
CESU-8 CESU-8   CESU-8
ibm-9400


Codepage Layout

Currently showing the codepage starting with the bytes E893

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
84C0
84C1
84C2
84C3
84C4
84C5
84C6
84C7
84C8
84C9
84CA
84CB
84CC
84CD
84CE
84CF
80
90
84D0
84D1
84D2
84D3
84D4
84D5
84D6
84D7
84D8
84D9
84DA
84DB
84DC
84DD
84DE
84DF
90
A0
84E0
84E1
84E2
84E3
84E4
84E5
84E6
84E7
84E8
84E9
84EA
84EB
84EC
84ED
84EE
84EF
A0
B0
84F0
84F1
84F2
84F3
84F4
84F5
84F6
84F7
84F8
84F9
84FA
84FB
84FC
84FD
84FE
84FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_CESU8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[\u0000-\U0010FFFF]