International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28C91

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򌑀
8C440
򌑁
8C441
򌑂
8C442
򌑃
8C443
򌑄
8C444
򌑅
8C445
򌑆
8C446
򌑇
8C447
򌑈
8C448
򌑉
8C449
򌑊
8C44A
򌑋
8C44B
򌑌
8C44C
򌑍
8C44D
򌑎
8C44E
򌑏
8C44F
80
90
򌑐
8C450
򌑑
8C451
򌑒
8C452
򌑓
8C453
򌑔
8C454
򌑕
8C455
򌑖
8C456
򌑗
8C457
򌑘
8C458
򌑙
8C459
򌑚
8C45A
򌑛
8C45B
򌑜
8C45C
򌑝
8C45D
򌑞
8C45E
򌑟
8C45F
90
A0
򌑠
8C460
򌑡
8C461
򌑢
8C462
򌑣
8C463
򌑤
8C464
򌑥
8C465
򌑦
8C466
򌑧
8C467
򌑨
8C468
򌑩
8C469
򌑪
8C46A
򌑫
8C46B
򌑬
8C46C
򌑭
8C46D
򌑮
8C46E
򌑯
8C46F
A0
B0
򌑰
8C470
򌑱
8C471
򌑲
8C472
򌑳
8C473
򌑴
8C474
򌑵
8C475
򌑶
8C476
򌑷
8C477
򌑸
8C478
򌑹
8C479
򌑺
8C47A
򌑻
8C47B
򌑼
8C47C
򌑽
8C47D
򌑾
8C47E
򌑿
8C47F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]