International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A091

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𠑀
20440
𠑁
20441
𠑂
20442
𠑃
20443
𠑄
20444
𠑅
20445
𠑆
20446
𠑇
20447
𠑈
20448
𠑉
20449
𠑊
2044A
𠑋
2044B
𠑌
2044C
𠑍
2044D
𠑎
2044E
𠑏
2044F
80
90
𠑐
20450
𠑑
20451
𠑒
20452
𠑓
20453
𠑔
20454
𠑕
20455
𠑖
20456
𠑗
20457
𠑘
20458
𠑙
20459
𠑚
2045A
𠑛
2045B
𠑜
2045C
𠑝
2045D
𠑞
2045E
𠑟
2045F
90
A0
𠑠
20460
𠑡
20461
𠑢
20462
𠑣
20463
𠑤
20464
𠑥
20465
𠑦
20466
𠑧
20467
𠑨
20468
𠑩
20469
𠑪
2046A
𠑫
2046B
𠑬
2046C
𠑭
2046D
𠑮
2046E
𠑯
2046F
A0
B0
𠑰
20470
𠑱
20471
𠑲
20472
𠑳
20473
𠑴
20474
𠑵
20475
𠑶
20476
𠑷
20477
𠑸
20478
𠑹
20479
𠑺
2047A
𠑻
2047B
𠑼
2047C
𠑽
2047D
𠑾
2047E
𠑿
2047F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]