International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48C95

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􌕀
10C540
􌕁
10C541
􌕂
10C542
􌕃
10C543
􌕄
10C544
􌕅
10C545
􌕆
10C546
􌕇
10C547
􌕈
10C548
􌕉
10C549
􌕊
10C54A
􌕋
10C54B
􌕌
10C54C
􌕍
10C54D
􌕎
10C54E
􌕏
10C54F
80
90
􌕐
10C550
􌕑
10C551
􌕒
10C552
􌕓
10C553
􌕔
10C554
􌕕
10C555
􌕖
10C556
􌕗
10C557
􌕘
10C558
􌕙
10C559
􌕚
10C55A
􌕛
10C55B
􌕜
10C55C
􌕝
10C55D
􌕞
10C55E
􌕟
10C55F
90
A0
􌕠
10C560
􌕡
10C561
􌕢
10C562
􌕣
10C563
􌕤
10C564
􌕥
10C565
􌕦
10C566
􌕧
10C567
􌕨
10C568
􌕩
10C569
􌕪
10C56A
􌕫
10C56B
􌕬
10C56C
􌕭
10C56D
􌕮
10C56E
􌕯
10C56F
A0
B0
􌕰
10C570
􌕱
10C571
􌕲
10C572
􌕳
10C573
􌕴
10C574
􌕵
10C575
􌕶
10C576
􌕷
10C577
􌕸
10C578
􌕹
10C579
􌕺
10C57A
􌕻
10C57B
􌕼
10C57C
􌕽
10C57D
􌕾
10C57E
􌕿
10C57F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]