International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48C91

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􌑀
10C440
􌑁
10C441
􌑂
10C442
􌑃
10C443
􌑄
10C444
􌑅
10C445
􌑆
10C446
􌑇
10C447
􌑈
10C448
􌑉
10C449
􌑊
10C44A
􌑋
10C44B
􌑌
10C44C
􌑍
10C44D
􌑎
10C44E
􌑏
10C44F
80
90
􌑐
10C450
􌑑
10C451
􌑒
10C452
􌑓
10C453
􌑔
10C454
􌑕
10C455
􌑖
10C456
􌑗
10C457
􌑘
10C458
􌑙
10C459
􌑚
10C45A
􌑛
10C45B
􌑜
10C45C
􌑝
10C45D
􌑞
10C45E
􌑟
10C45F
90
A0
􌑠
10C460
􌑡
10C461
􌑢
10C462
􌑣
10C463
􌑤
10C464
􌑥
10C465
􌑦
10C466
􌑧
10C467
􌑨
10C468
􌑩
10C469
􌑪
10C46A
􌑫
10C46B
􌑬
10C46C
􌑭
10C46D
􌑮
10C46E
􌑯
10C46F
A0
B0
􌑰
10C470
􌑱
10C471
􌑲
10C472
􌑳
10C473
􌑴
10C474
􌑵
10C475
􌑶
10C476
􌑷
10C477
􌑸
10C478
􌑹
10C479
􌑺
10C47A
􌑻
10C47B
􌑼
10C47C
􌑽
10C47D
􌑾
10C47E
􌑿
10C47F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]