International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09A95

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚕀
1A540
𚕁
1A541
𚕂
1A542
𚕃
1A543
𚕄
1A544
𚕅
1A545
𚕆
1A546
𚕇
1A547
𚕈
1A548
𚕉
1A549
𚕊
1A54A
𚕋
1A54B
𚕌
1A54C
𚕍
1A54D
𚕎
1A54E
𚕏
1A54F
80
90
𚕐
1A550
𚕑
1A551
𚕒
1A552
𚕓
1A553
𚕔
1A554
𚕕
1A555
𚕖
1A556
𚕗
1A557
𚕘
1A558
𚕙
1A559
𚕚
1A55A
𚕛
1A55B
𚕜
1A55C
𚕝
1A55D
𚕞
1A55E
𚕟
1A55F
90
A0
𚕠
1A560
𚕡
1A561
𚕢
1A562
𚕣
1A563
𚕤
1A564
𚕥
1A565
𚕦
1A566
𚕧
1A567
𚕨
1A568
𚕩
1A569
𚕪
1A56A
𚕫
1A56B
𚕬
1A56C
𚕭
1A56D
𚕮
1A56E
𚕯
1A56F
A0
B0
𚕰
1A570
𚕱
1A571
𚕲
1A572
𚕳
1A573
𚕴
1A574
𚕵
1A575
𚕶
1A576
𚕷
1A577
𚕸
1A578
𚕹
1A579
𚕺
1A57A
𚕻
1A57B
𚕼
1A57C
𚕽
1A57D
𚕾
1A57E
𚕿
1A57F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]