International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F385BA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󅺀
C5E80
󅺁
C5E81
󅺂
C5E82
󅺃
C5E83
󅺄
C5E84
󅺅
C5E85
󅺆
C5E86
󅺇
C5E87
󅺈
C5E88
󅺉
C5E89
󅺊
C5E8A
󅺋
C5E8B
󅺌
C5E8C
󅺍
C5E8D
󅺎
C5E8E
󅺏
C5E8F
80
90
󅺐
C5E90
󅺑
C5E91
󅺒
C5E92
󅺓
C5E93
󅺔
C5E94
󅺕
C5E95
󅺖
C5E96
󅺗
C5E97
󅺘
C5E98
󅺙
C5E99
󅺚
C5E9A
󅺛
C5E9B
󅺜
C5E9C
󅺝
C5E9D
󅺞
C5E9E
󅺟
C5E9F
90
A0
󅺠
C5EA0
󅺡
C5EA1
󅺢
C5EA2
󅺣
C5EA3
󅺤
C5EA4
󅺥
C5EA5
󅺦
C5EA6
󅺧
C5EA7
󅺨
C5EA8
󅺩
C5EA9
󅺪
C5EAA
󅺫
C5EAB
󅺬
C5EAC
󅺭
C5EAD
󅺮
C5EAE
󅺯
C5EAF
A0
B0
󅺰
C5EB0
󅺱
C5EB1
󅺲
C5EB2
󅺳
C5EB3
󅺴
C5EB4
󅺵
C5EB5
󅺶
C5EB6
󅺷
C5EB7
󅺸
C5EB8
󅺹
C5EB9
󅺺
C5EBA
󅺻
C5EBB
󅺼
C5EBC
󅺽
C5EBD
󅺾
C5EBE
󅺿
C5EBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]