International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B991

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𹑀
39440
𹑁
39441
𹑂
39442
𹑃
39443
𹑄
39444
𹑅
39445
𹑆
39446
𹑇
39447
𹑈
39448
𹑉
39449
𹑊
3944A
𹑋
3944B
𹑌
3944C
𹑍
3944D
𹑎
3944E
𹑏
3944F
80
90
𹑐
39450
𹑑
39451
𹑒
39452
𹑓
39453
𹑔
39454
𹑕
39455
𹑖
39456
𹑗
39457
𹑘
39458
𹑙
39459
𹑚
3945A
𹑛
3945B
𹑜
3945C
𹑝
3945D
𹑞
3945E
𹑟
3945F
90
A0
𹑠
39460
𹑡
39461
𹑢
39462
𹑣
39463
𹑤
39464
𹑥
39465
𹑦
39466
𹑧
39467
𹑨
39468
𹑩
39469
𹑪
3946A
𹑫
3946B
𹑬
3946C
𹑭
3946D
𹑮
3946E
𹑯
3946F
A0
B0
𹑰
39470
𹑱
39471
𹑲
39472
𹑳
39473
𹑴
39474
𹑵
39475
𹑶
39476
𹑷
39477
𹑸
39478
𹑹
39479
𹑺
3947A
𹑻
3947B
𹑼
3947C
𹑽
3947D
𹑾
3947E
𹑿
3947F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]