International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A391

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𣑀
23440
𣑁
23441
𣑂
23442
𣑃
23443
𣑄
23444
𣑅
23445
𣑆
23446
𣑇
23447
𣑈
23448
𣑉
23449
𣑊
2344A
𣑋
2344B
𣑌
2344C
𣑍
2344D
𣑎
2344E
𣑏
2344F
80
90
𣑐
23450
𣑑
23451
𣑒
23452
𣑓
23453
𣑔
23454
𣑕
23455
𣑖
23456
𣑗
23457
𣑘
23458
𣑙
23459
𣑚
2345A
𣑛
2345B
𣑜
2345C
𣑝
2345D
𣑞
2345E
𣑟
2345F
90
A0
𣑠
23460
𣑡
23461
𣑢
23462
𣑣
23463
𣑤
23464
𣑥
23465
𣑦
23466
𣑧
23467
𣑨
23468
𣑩
23469
𣑪
2346A
𣑫
2346B
𣑬
2346C
𣑭
2346D
𣑮
2346E
𣑯
2346F
A0
B0
𣑰
23470
𣑱
23471
𣑲
23472
𣑳
23473
𣑴
23474
𣑵
23475
𣑶
23476
𣑷
23477
𣑸
23478
𣑹
23479
𣑺
2347A
𣑻
2347B
𣑼
2347C
𣑽
2347D
𣑾
2347E
𣑿
2347F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]