International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B695

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򶕀
B6540
򶕁
B6541
򶕂
B6542
򶕃
B6543
򶕄
B6544
򶕅
B6545
򶕆
B6546
򶕇
B6547
򶕈
B6548
򶕉
B6549
򶕊
B654A
򶕋
B654B
򶕌
B654C
򶕍
B654D
򶕎
B654E
򶕏
B654F
80
90
򶕐
B6550
򶕑
B6551
򶕒
B6552
򶕓
B6553
򶕔
B6554
򶕕
B6555
򶕖
B6556
򶕗
B6557
򶕘
B6558
򶕙
B6559
򶕚
B655A
򶕛
B655B
򶕜
B655C
򶕝
B655D
򶕞
B655E
򶕟
B655F
90
A0
򶕠
B6560
򶕡
B6561
򶕢
B6562
򶕣
B6563
򶕤
B6564
򶕥
B6565
򶕦
B6566
򶕧
B6567
򶕨
B6568
򶕩
B6569
򶕪
B656A
򶕫
B656B
򶕬
B656C
򶕭
B656D
򶕮
B656E
򶕯
B656F
A0
B0
򶕰
B6570
򶕱
B6571
򶕲
B6572
򶕳
B6573
򶕴
B6574
򶕵
B6575
򶕶
B6576
򶕷
B6577
򶕸
B6578
򶕹
B6579
򶕺
B657A
򶕻
B657B
򶕼
B657C
򶕽
B657D
򶕾
B657E
򶕿
B657F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]