International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes EF93

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
F4C0
F4C1
F4C2
F4C3
F4C4
F4C5
F4C6
F4C7
F4C8
F4C9
F4CA
F4CB
F4CC
F4CD
F4CE
F4CF
80
90
F4D0
F4D1
F4D2
F4D3
F4D4
F4D5
F4D6
F4D7
F4D8
F4D9
F4DA
F4DB
F4DC
F4DD
F4DE
F4DF
90
A0
F4E0
F4E1
F4E2
F4E3
F4E4
F4E5
F4E6
F4E7
F4E8
F4E9
F4EA
F4EB
F4EC
F4ED
F4EE
F4EF
A0
B0
F4F0
F4F1
F4F2
F4F3
F4F4
F4F5
F4F6
F4F7
F4F8
F4F9
F4FA
F4FB
F4FC
F4FD
F4FE
F4FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]