International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A591

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𥑀
25440
𥑁
25441
𥑂
25442
𥑃
25443
𥑄
25444
𥑅
25445
𥑆
25446
𥑇
25447
𥑈
25448
𥑉
25449
𥑊
2544A
𥑋
2544B
𥑌
2544C
𥑍
2544D
𥑎
2544E
𥑏
2544F
80
90
𥑐
25450
𥑑
25451
𥑒
25452
𥑓
25453
𥑔
25454
𥑕
25455
𥑖
25456
𥑗
25457
𥑘
25458
𥑙
25459
𥑚
2545A
𥑛
2545B
𥑜
2545C
𥑝
2545D
𥑞
2545E
𥑟
2545F
90
A0
𥑠
25460
𥑡
25461
𥑢
25462
𥑣
25463
𥑤
25464
𥑥
25465
𥑦
25466
𥑧
25467
𥑨
25468
𥑩
25469
𥑪
2546A
𥑫
2546B
𥑬
2546C
𥑭
2546D
𥑮
2546E
𥑯
2546F
A0
B0
𥑰
25470
𥑱
25471
𥑲
25472
𥑳
25473
𥑴
25474
𥑵
25475
𥑶
25476
𥑷
25477
𥑸
25478
𥑹
25479
𥑺
2547A
𥑻
2547B
𥑼
2547C
𥑽
2547D
𥑾
2547E
𥑿
2547F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]