International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A59C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򥜀
A5700
򥜁
A5701
򥜂
A5702
򥜃
A5703
򥜄
A5704
򥜅
A5705
򥜆
A5706
򥜇
A5707
򥜈
A5708
򥜉
A5709
򥜊
A570A
򥜋
A570B
򥜌
A570C
򥜍
A570D
򥜎
A570E
򥜏
A570F
80
90
򥜐
A5710
򥜑
A5711
򥜒
A5712
򥜓
A5713
򥜔
A5714
򥜕
A5715
򥜖
A5716
򥜗
A5717
򥜘
A5718
򥜙
A5719
򥜚
A571A
򥜛
A571B
򥜜
A571C
򥜝
A571D
򥜞
A571E
򥜟
A571F
90
A0
򥜠
A5720
򥜡
A5721
򥜢
A5722
򥜣
A5723
򥜤
A5724
򥜥
A5725
򥜦
A5726
򥜧
A5727
򥜨
A5728
򥜩
A5729
򥜪
A572A
򥜫
A572B
򥜬
A572C
򥜭
A572D
򥜮
A572E
򥜯
A572F
A0
B0
򥜰
A5730
򥜱
A5731
򥜲
A5732
򥜳
A5733
򥜴
A5734
򥜵
A5735
򥜶
A5736
򥜷
A5737
򥜸
A5738
򥜹
A5739
򥜺
A573A
򥜻
A573B
򥜼
A573C
򥜽
A573D
򥜾
A573E
򥜿
A573F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]