International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B595

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󵕀
F5540
󵕁
F5541
󵕂
F5542
󵕃
F5543
󵕄
F5544
󵕅
F5545
󵕆
F5546
󵕇
F5547
󵕈
F5548
󵕉
F5549
󵕊
F554A
󵕋
F554B
󵕌
F554C
󵕍
F554D
󵕎
F554E
󵕏
F554F
80
90
󵕐
F5550
󵕑
F5551
󵕒
F5552
󵕓
F5553
󵕔
F5554
󵕕
F5555
󵕖
F5556
󵕗
F5557
󵕘
F5558
󵕙
F5559
󵕚
F555A
󵕛
F555B
󵕜
F555C
󵕝
F555D
󵕞
F555E
󵕟
F555F
90
A0
󵕠
F5560
󵕡
F5561
󵕢
F5562
󵕣
F5563
󵕤
F5564
󵕥
F5565
󵕦
F5566
󵕧
F5567
󵕨
F5568
󵕩
F5569
󵕪
F556A
󵕫
F556B
󵕬
F556C
󵕭
F556D
󵕮
F556E
󵕯
F556F
A0
B0
󵕰
F5570
󵕱
F5571
󵕲
F5572
󵕳
F5573
󵕴
F5574
󵕵
F5575
󵕶
F5576
󵕷
F5577
󵕸
F5578
󵕹
F5579
󵕺
F557A
󵕻
F557B
󵕼
F557C
󵕽
F557D
󵕾
F557E
󵕿
F557F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]