International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48291

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􂑀
102440
􂑁
102441
􂑂
102442
􂑃
102443
􂑄
102444
􂑅
102445
􂑆
102446
􂑇
102447
􂑈
102448
􂑉
102449
􂑊
10244A
􂑋
10244B
􂑌
10244C
􂑍
10244D
􂑎
10244E
􂑏
10244F
80
90
􂑐
102450
􂑑
102451
􂑒
102452
􂑓
102453
􂑔
102454
􂑕
102455
􂑖
102456
􂑗
102457
􂑘
102458
􂑙
102459
􂑚
10245A
􂑛
10245B
􂑜
10245C
􂑝
10245D
􂑞
10245E
􂑟
10245F
90
A0
􂑠
102460
􂑡
102461
􂑢
102462
􂑣
102463
􂑤
102464
􂑥
102465
􂑦
102466
􂑧
102467
􂑨
102468
􂑩
102469
􂑪
10246A
􂑫
10246B
􂑬
10246C
􂑭
10246D
􂑮
10246E
􂑯
10246F
A0
B0
􂑰
102470
􂑱
102471
􂑲
102472
􂑳
102473
􂑴
102474
􂑵
102475
􂑶
102476
􂑷
102477
􂑸
102478
􂑹
102479
􂑺
10247A
􂑻
10247B
􂑼
10247C
􂑽
10247D
􂑾
10247E
􂑿
10247F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]