International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A081

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𠁀
20040
𠁁
20041
𠁂
20042
𠁃
20043
𠁄
20044
𠁅
20045
𠁆
20046
𠁇
20047
𠁈
20048
𠁉
20049
𠁊
2004A
𠁋
2004B
𠁌
2004C
𠁍
2004D
𠁎
2004E
𠁏
2004F
80
90
𠁐
20050
𠁑
20051
𠁒
20052
𠁓
20053
𠁔
20054
𠁕
20055
𠁖
20056
𠁗
20057
𠁘
20058
𠁙
20059
𠁚
2005A
𠁛
2005B
𠁜
2005C
𠁝
2005D
𠁞
2005E
𠁟
2005F
90
A0
𠁠
20060
𠁡
20061
𠁢
20062
𠁣
20063
𠁤
20064
𠁥
20065
𠁦
20066
𠁧
20067
𠁨
20068
𠁩
20069
𠁪
2006A
𠁫
2006B
𠁬
2006C
𠁭
2006D
𠁮
2006E
𠁯
2006F
A0
B0
𠁰
20070
𠁱
20071
𠁲
20072
𠁳
20073
𠁴
20074
𠁵
20075
𠁶
20076
𠁷
20077
𠁸
20078
𠁹
20079
𠁺
2007A
𠁻
2007B
𠁼
2007C
𠁽
2007D
𠁾
2007E
𠁿
2007F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]