International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48791

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􇑀
107440
􇑁
107441
􇑂
107442
􇑃
107443
􇑄
107444
􇑅
107445
􇑆
107446
􇑇
107447
􇑈
107448
􇑉
107449
􇑊
10744A
􇑋
10744B
􇑌
10744C
􇑍
10744D
􇑎
10744E
􇑏
10744F
80
90
􇑐
107450
􇑑
107451
􇑒
107452
􇑓
107453
􇑔
107454
􇑕
107455
􇑖
107456
􇑗
107457
􇑘
107458
􇑙
107459
􇑚
10745A
􇑛
10745B
􇑜
10745C
􇑝
10745D
􇑞
10745E
􇑟
10745F
90
A0
􇑠
107460
􇑡
107461
􇑢
107462
􇑣
107463
􇑤
107464
􇑥
107465
􇑦
107466
􇑧
107467
􇑨
107468
􇑩
107469
􇑪
10746A
􇑫
10746B
􇑬
10746C
􇑭
10746D
􇑮
10746E
􇑯
10746F
A0
B0
􇑰
107470
􇑱
107471
􇑲
107472
􇑳
107473
􇑴
107474
􇑵
107475
􇑶
107476
􇑷
107477
􇑸
107478
􇑹
107479
􇑺
10747A
􇑻
10747B
􇑼
10747C
􇑽
10747D
􇑾
10747E
􇑿
10747F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]