International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B795

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𷕀
37540
𷕁
37541
𷕂
37542
𷕃
37543
𷕄
37544
𷕅
37545
𷕆
37546
𷕇
37547
𷕈
37548
𷕉
37549
𷕊
3754A
𷕋
3754B
𷕌
3754C
𷕍
3754D
𷕎
3754E
𷕏
3754F
80
90
𷕐
37550
𷕑
37551
𷕒
37552
𷕓
37553
𷕔
37554
𷕕
37555
𷕖
37556
𷕗
37557
𷕘
37558
𷕙
37559
𷕚
3755A
𷕛
3755B
𷕜
3755C
𷕝
3755D
𷕞
3755E
𷕟
3755F
90
A0
𷕠
37560
𷕡
37561
𷕢
37562
𷕣
37563
𷕤
37564
𷕥
37565
𷕦
37566
𷕧
37567
𷕨
37568
𷕩
37569
𷕪
3756A
𷕫
3756B
𷕬
3756C
𷕭
3756D
𷕮
3756E
𷕯
3756F
A0
B0
𷕰
37570
𷕱
37571
𷕲
37572
𷕳
37573
𷕴
37574
𷕵
37575
𷕶
37576
𷕷
37577
𷕸
37578
𷕹
37579
𷕺
3757A
𷕻
3757B
𷕼
3757C
𷕽
3757D
𷕾
3757E
𷕿
3757F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]