International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A192

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򡒀
A1480
򡒁
A1481
򡒂
A1482
򡒃
A1483
򡒄
A1484
򡒅
A1485
򡒆
A1486
򡒇
A1487
򡒈
A1488
򡒉
A1489
򡒊
A148A
򡒋
A148B
򡒌
A148C
򡒍
A148D
򡒎
A148E
򡒏
A148F
80
90
򡒐
A1490
򡒑
A1491
򡒒
A1492
򡒓
A1493
򡒔
A1494
򡒕
A1495
򡒖
A1496
򡒗
A1497
򡒘
A1498
򡒙
A1499
򡒚
A149A
򡒛
A149B
򡒜
A149C
򡒝
A149D
򡒞
A149E
򡒟
A149F
90
A0
򡒠
A14A0
򡒡
A14A1
򡒢
A14A2
򡒣
A14A3
򡒤
A14A4
򡒥
A14A5
򡒦
A14A6
򡒧
A14A7
򡒨
A14A8
򡒩
A14A9
򡒪
A14AA
򡒫
A14AB
򡒬
A14AC
򡒭
A14AD
򡒮
A14AE
򡒯
A14AF
A0
B0
򡒰
A14B0
򡒱
A14B1
򡒲
A14B2
򡒳
A14B3
򡒴
A14B4
򡒵
A14B5
򡒶
A14B6
򡒷
A14B7
򡒸
A14B8
򡒹
A14B9
򡒺
A14BA
򡒻
A14BB
򡒼
A14BC
򡒽
A14BD
򡒾
A14BE
򡒿
A14BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]