International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes E8AF

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
8BC0
8BC1
8BC2
8BC3
8BC4
8BC5
8BC6
8BC7
8BC8
8BC9
8BCA
8BCB
8BCC
8BCD
8BCE
8BCF
80
90
8BD0
8BD1
8BD2
8BD3
8BD4
8BD5
8BD6
8BD7
8BD8
8BD9
8BDA
8BDB
8BDC
8BDD
8BDE
8BDF
90
A0
8BE0
8BE1
8BE2
8BE3
8BE4
8BE5
8BE6
8BE7
8BE8
8BE9
8BEA
8BEB
8BEC
8BED
8BEE
8BEF
A0
B0
8BF0
8BF1
8BF2
8BF3
8BF4
8BF5
8BF6
8BF7
8BF8
8BF9
8BFA
8BFB
8BFC
8BFD
8BFE
诿
8BFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]