International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes EC97

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
C5C0
C5C1
C5C2
C5C3
C5C4
C5C5
C5C6
C5C7
C5C8
C5C9
C5CA
C5CB
C5CC
C5CD
C5CE
C5CF
80
90
C5D0
C5D1
C5D2
C5D3
C5D4
C5D5
C5D6
C5D7
C5D8
C5D9
C5DA
C5DB
C5DC
C5DD
C5DE
C5DF
90
A0
C5E0
C5E1
C5E2
C5E3
C5E4
C5E5
C5E6
C5E7
C5E8
C5E9
C5EA
C5EB
C5EC
C5ED
C5EE
C5EF
A0
B0
C5F0
C5F1
C5F2
C5F3
C5F4
C5F5
C5F6
C5F7
C5F8
C5F9
C5FA
C5FB
C5FC
C5FD
C5FE
C5FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]