International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F390BA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐺀
D0E80
󐺁
D0E81
󐺂
D0E82
󐺃
D0E83
󐺄
D0E84
󐺅
D0E85
󐺆
D0E86
󐺇
D0E87
󐺈
D0E88
󐺉
D0E89
󐺊
D0E8A
󐺋
D0E8B
󐺌
D0E8C
󐺍
D0E8D
󐺎
D0E8E
󐺏
D0E8F
80
90
󐺐
D0E90
󐺑
D0E91
󐺒
D0E92
󐺓
D0E93
󐺔
D0E94
󐺕
D0E95
󐺖
D0E96
󐺗
D0E97
󐺘
D0E98
󐺙
D0E99
󐺚
D0E9A
󐺛
D0E9B
󐺜
D0E9C
󐺝
D0E9D
󐺞
D0E9E
󐺟
D0E9F
90
A0
󐺠
D0EA0
󐺡
D0EA1
󐺢
D0EA2
󐺣
D0EA3
󐺤
D0EA4
󐺥
D0EA5
󐺦
D0EA6
󐺧
D0EA7
󐺨
D0EA8
󐺩
D0EA9
󐺪
D0EAA
󐺫
D0EAB
󐺬
D0EAC
󐺭
D0EAD
󐺮
D0EAE
󐺯
D0EAF
A0
B0
󐺰
D0EB0
󐺱
D0EB1
󐺲
D0EB2
󐺳
D0EB3
󐺴
D0EB4
󐺵
D0EB5
󐺶
D0EB6
󐺷
D0EB7
󐺸
D0EB8
󐺹
D0EB9
󐺺
D0EBA
󐺻
D0EBB
󐺼
D0EBC
󐺽
D0EBD
󐺾
D0EBE
󐺿
D0EBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]