International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A2BA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󢺀
E2E80
󢺁
E2E81
󢺂
E2E82
󢺃
E2E83
󢺄
E2E84
󢺅
E2E85
󢺆
E2E86
󢺇
E2E87
󢺈
E2E88
󢺉
E2E89
󢺊
E2E8A
󢺋
E2E8B
󢺌
E2E8C
󢺍
E2E8D
󢺎
E2E8E
󢺏
E2E8F
80
90
󢺐
E2E90
󢺑
E2E91
󢺒
E2E92
󢺓
E2E93
󢺔
E2E94
󢺕
E2E95
󢺖
E2E96
󢺗
E2E97
󢺘
E2E98
󢺙
E2E99
󢺚
E2E9A
󢺛
E2E9B
󢺜
E2E9C
󢺝
E2E9D
󢺞
E2E9E
󢺟
E2E9F
90
A0
󢺠
E2EA0
󢺡
E2EA1
󢺢
E2EA2
󢺣
E2EA3
󢺤
E2EA4
󢺥
E2EA5
󢺦
E2EA6
󢺧
E2EA7
󢺨
E2EA8
󢺩
E2EA9
󢺪
E2EAA
󢺫
E2EAB
󢺬
E2EAC
󢺭
E2EAD
󢺮
E2EAE
󢺯
E2EAF
A0
B0
󢺰
E2EB0
󢺱
E2EB1
󢺲
E2EB2
󢺳
E2EB3
󢺴
E2EB4
󢺵
E2EB5
󢺶
E2EB6
󢺷
E2EB7
󢺸
E2EB8
󢺹
E2EB9
󢺺
E2EBA
󢺻
E2EBB
󢺼
E2EBC
󢺽
E2EBD
󢺾
E2EBE
󢺿
E2EBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]