International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes EF9B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
F6C0
F6C1
F6C2
F6C3
F6C4
F6C5
F6C6
F6C7
F6C8
F6C9
F6CA
F6CB
F6CC
F6CD
F6CE
F6CF
80
90
F6D0
F6D1
F6D2
F6D3
F6D4
F6D5
F6D6
F6D7
F6D8
F6D9
F6DA
F6DB
F6DC
F6DD
F6DE
F6DF
90
A0
F6E0
F6E1
F6E2
F6E3
F6E4
F6E5
F6E6
F6E7
F6E8
F6E9
F6EA
F6EB
F6EC
F6ED
F6EE
F6EF
A0
B0
F6F0
F6F1
F6F2
F6F3
F6F4
F6F5
F6F6
F6F7
F6F8
F6F9
F6FA
F6FB
F6FC
F6FD
F6FE
F6FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]