International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes E0BA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
0E80
0E81
0E82
0E83
0E84
0E85
0E86
0E87
0E88
0E89
0E8A
0E8B
0E8C
0E8D
0E8E
0E8F
80
90
0E90
0E91
0E92
0E93
0E94
0E95
0E96
0E97
0E98
0E99
0E9A
0E9B
0E9C
0E9D
0E9E
0E9F
90
A0
0EA0
0EA1
0EA2
0EA3
0EA4
0EA5
0EA6
0EA7
0EA8
0EA9
0EAA
0EAB
0EAC
0EAD
0EAE
0EAF
A0
B0
0EB0
 ັ
0EB1
0EB2
0EB3
 ິ
0EB4
 ີ
0EB5
 ຶ
0EB6
 ື
0EB7
 ຸ
0EB8
 ູ
0EB9
 ຺
0EBA
 ົ
0EBB
 ຼ
0EBC
0EBD
0EBE
຿
0EBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]