International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09291

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𒑀
12440
𒑁
12441
𒑂
12442
𒑃
12443
𒑄
12444
𒑅
12445
𒑆
12446
𒑇
12447
𒑈
12448
𒑉
12449
𒑊
1244A
𒑋
1244B
𒑌
1244C
𒑍
1244D
𒑎
1244E
𒑏
1244F
80
90
𒑐
12450
𒑑
12451
𒑒
12452
𒑓
12453
𒑔
12454
𒑕
12455
𒑖
12456
𒑗
12457
𒑘
12458
𒑙
12459
𒑚
1245A
𒑛
1245B
𒑜
1245C
𒑝
1245D
𒑞
1245E
𒑟
1245F
90
A0
𒑠
12460
𒑡
12461
𒑢
12462
𒑣
12463
𒑤
12464
𒑥
12465
𒑦
12466
𒑧
12467
𒑨
12468
𒑩
12469
𒑪
1246A
𒑫
1246B
𒑬
1246C
𒑭
1246D
𒑮
1246E
𒑯
1246F
A0
B0
𒑰
12470
𒑱
12471
𒑲
12472
𒑳
12473
𒑴
12474
𒑵
12475
𒑶
12476
𒑷
12477
𒑸
12478
𒑹
12479
𒑺
1247A
𒑻
1247B
𒑼
1247C
𒑽
1247D
𒑾
1247E
𒑿
1247F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]