International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F289BA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉺀
89E80
򉺁
89E81
򉺂
89E82
򉺃
89E83
򉺄
89E84
򉺅
89E85
򉺆
89E86
򉺇
89E87
򉺈
89E88
򉺉
89E89
򉺊
89E8A
򉺋
89E8B
򉺌
89E8C
򉺍
89E8D
򉺎
89E8E
򉺏
89E8F
80
90
򉺐
89E90
򉺑
89E91
򉺒
89E92
򉺓
89E93
򉺔
89E94
򉺕
89E95
򉺖
89E96
򉺗
89E97
򉺘
89E98
򉺙
89E99
򉺚
89E9A
򉺛
89E9B
򉺜
89E9C
򉺝
89E9D
򉺞
89E9E
򉺟
89E9F
90
A0
򉺠
89EA0
򉺡
89EA1
򉺢
89EA2
򉺣
89EA3
򉺤
89EA4
򉺥
89EA5
򉺦
89EA6
򉺧
89EA7
򉺨
89EA8
򉺩
89EA9
򉺪
89EAA
򉺫
89EAB
򉺬
89EAC
򉺭
89EAD
򉺮
89EAE
򉺯
89EAF
A0
B0
򉺰
89EB0
򉺱
89EB1
򉺲
89EB2
򉺳
89EB3
򉺴
89EB4
򉺵
89EB5
򉺶
89EB6
򉺷
89EB7
򉺸
89EB8
򉺹
89EB9
򉺺
89EBA
򉺻
89EBB
򉺼
89EBC
򉺽
89EBD
򉺾
89EBE
򉺿
89EBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]