International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09195

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𑕀
11540
𑕁
11541
𑕂
11542
𑕃
11543
𑕄
11544
𑕅
11545
𑕆
11546
𑕇
11547
𑕈
11548
𑕉
11549
𑕊
1154A
𑕋
1154B
𑕌
1154C
𑕍
1154D
𑕎
1154E
𑕏
1154F
80
90
𑕐
11550
𑕑
11551
𑕒
11552
𑕓
11553
𑕔
11554
𑕕
11555
𑕖
11556
𑕗
11557
𑕘
11558
𑕙
11559
𑕚
1155A
𑕛
1155B
𑕜
1155C
𑕝
1155D
𑕞
1155E
𑕟
1155F
90
A0
𑕠
11560
𑕡
11561
𑕢
11562
𑕣
11563
𑕤
11564
𑕥
11565
𑕦
11566
𑕧
11567
𑕨
11568
𑕩
11569
𑕪
1156A
𑕫
1156B
𑕬
1156C
𑕭
1156D
𑕮
1156E
𑕯
1156F
A0
B0
𑕰
11570
𑕱
11571
𑕲
11572
𑕳
11573
𑕴
11574
𑕵
11575
𑕶
11576
𑕷
11577
𑕸
11578
𑕹
11579
𑕺
1157A
𑕻
1157B
𑕼
1157C
𑕽
1157D
𑕾
1157E
𑕿
1157F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]