International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F292BA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򒺀
92E80
򒺁
92E81
򒺂
92E82
򒺃
92E83
򒺄
92E84
򒺅
92E85
򒺆
92E86
򒺇
92E87
򒺈
92E88
򒺉
92E89
򒺊
92E8A
򒺋
92E8B
򒺌
92E8C
򒺍
92E8D
򒺎
92E8E
򒺏
92E8F
80
90
򒺐
92E90
򒺑
92E91
򒺒
92E92
򒺓
92E93
򒺔
92E94
򒺕
92E95
򒺖
92E96
򒺗
92E97
򒺘
92E98
򒺙
92E99
򒺚
92E9A
򒺛
92E9B
򒺜
92E9C
򒺝
92E9D
򒺞
92E9E
򒺟
92E9F
90
A0
򒺠
92EA0
򒺡
92EA1
򒺢
92EA2
򒺣
92EA3
򒺤
92EA4
򒺥
92EA5
򒺦
92EA6
򒺧
92EA7
򒺨
92EA8
򒺩
92EA9
򒺪
92EAA
򒺫
92EAB
򒺬
92EAC
򒺭
92EAD
򒺮
92EAE
򒺯
92EAF
A0
B0
򒺰
92EB0
򒺱
92EB1
򒺲
92EB2
򒺳
92EB3
򒺴
92EB4
򒺵
92EB5
򒺶
92EB6
򒺷
92EB7
򒺸
92EB8
򒺹
92EB9
򒺺
92EBA
򒺻
92EBB
򒺼
92EBC
򒺽
92EBD
򒺾
92EBE
򒺿
92EBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]