International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18A91

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񊑀
4A440
񊑁
4A441
񊑂
4A442
񊑃
4A443
񊑄
4A444
񊑅
4A445
񊑆
4A446
񊑇
4A447
񊑈
4A448
񊑉
4A449
񊑊
4A44A
񊑋
4A44B
񊑌
4A44C
񊑍
4A44D
񊑎
4A44E
񊑏
4A44F
80
90
񊑐
4A450
񊑑
4A451
񊑒
4A452
񊑓
4A453
񊑔
4A454
񊑕
4A455
񊑖
4A456
񊑗
4A457
񊑘
4A458
񊑙
4A459
񊑚
4A45A
񊑛
4A45B
񊑜
4A45C
񊑝
4A45D
񊑞
4A45E
񊑟
4A45F
90
A0
񊑠
4A460
񊑡
4A461
񊑢
4A462
񊑣
4A463
񊑤
4A464
񊑥
4A465
񊑦
4A466
񊑧
4A467
񊑨
4A468
񊑩
4A469
񊑪
4A46A
񊑫
4A46B
񊑬
4A46C
񊑭
4A46D
񊑮
4A46E
񊑯
4A46F
A0
B0
񊑰
4A470
񊑱
4A471
񊑲
4A472
񊑳
4A473
񊑴
4A474
񊑵
4A475
񊑶
4A476
񊑷
4A477
񊑸
4A478
񊑹
4A479
񊑺
4A47A
񊑻
4A47B
񊑼
4A47C
񊑽
4A47D
񊑾
4A47E
񊑿
4A47F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]