International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A891

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󨑀
E8440
󨑁
E8441
󨑂
E8442
󨑃
E8443
󨑄
E8444
󨑅
E8445
󨑆
E8446
󨑇
E8447
󨑈
E8448
󨑉
E8449
󨑊
E844A
󨑋
E844B
󨑌
E844C
󨑍
E844D
󨑎
E844E
󨑏
E844F
80
90
󨑐
E8450
󨑑
E8451
󨑒
E8452
󨑓
E8453
󨑔
E8454
󨑕
E8455
󨑖
E8456
󨑗
E8457
󨑘
E8458
󨑙
E8459
󨑚
E845A
󨑛
E845B
󨑜
E845C
󨑝
E845D
󨑞
E845E
󨑟
E845F
90
A0
󨑠
E8460
󨑡
E8461
󨑢
E8462
󨑣
E8463
󨑤
E8464
󨑥
E8465
󨑦
E8466
󨑧
E8467
󨑨
E8468
󨑩
E8469
󨑪
E846A
󨑫
E846B
󨑬
E846C
󨑭
E846D
󨑮
E846E
󨑯
E846F
A0
B0
󨑰
E8470
󨑱
E8471
󨑲
E8472
󨑳
E8473
󨑴
E8474
󨑵
E8475
󨑶
E8476
󨑷
E8477
󨑸
E8478
󨑹
E8479
󨑺
E847A
󨑻
E847B
󨑼
E847C
󨑽
E847D
󨑾
E847E
󨑿
E847F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]