International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39095

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐕀
D0540
󐕁
D0541
󐕂
D0542
󐕃
D0543
󐕄
D0544
󐕅
D0545
󐕆
D0546
󐕇
D0547
󐕈
D0548
󐕉
D0549
󐕊
D054A
󐕋
D054B
󐕌
D054C
󐕍
D054D
󐕎
D054E
󐕏
D054F
80
90
󐕐
D0550
󐕑
D0551
󐕒
D0552
󐕓
D0553
󐕔
D0554
󐕕
D0555
󐕖
D0556
󐕗
D0557
󐕘
D0558
󐕙
D0559
󐕚
D055A
󐕛
D055B
󐕜
D055C
󐕝
D055D
󐕞
D055E
󐕟
D055F
90
A0
󐕠
D0560
󐕡
D0561
󐕢
D0562
󐕣
D0563
󐕤
D0564
󐕥
D0565
󐕦
D0566
󐕧
D0567
󐕨
D0568
󐕩
D0569
󐕪
D056A
󐕫
D056B
󐕬
D056C
󐕭
D056D
󐕮
D056E
󐕯
D056F
A0
B0
󐕰
D0570
󐕱
D0571
󐕲
D0572
󐕳
D0573
󐕴
D0574
󐕵
D0575
󐕶
D0576
󐕷
D0577
󐕸
D0578
󐕹
D0579
󐕺
D057A
󐕻
D057B
󐕼
D057C
󐕽
D057D
󐕾
D057E
󐕿
D057F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]