International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A787

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󧇀
E71C0
󧇁
E71C1
󧇂
E71C2
󧇃
E71C3
󧇄
E71C4
󧇅
E71C5
󧇆
E71C6
󧇇
E71C7
󧇈
E71C8
󧇉
E71C9
󧇊
E71CA
󧇋
E71CB
󧇌
E71CC
󧇍
E71CD
󧇎
E71CE
󧇏
E71CF
80
90
󧇐
E71D0
󧇑
E71D1
󧇒
E71D2
󧇓
E71D3
󧇔
E71D4
󧇕
E71D5
󧇖
E71D6
󧇗
E71D7
󧇘
E71D8
󧇙
E71D9
󧇚
E71DA
󧇛
E71DB
󧇜
E71DC
󧇝
E71DD
󧇞
E71DE
󧇟
E71DF
90
A0
󧇠
E71E0
󧇡
E71E1
󧇢
E71E2
󧇣
E71E3
󧇤
E71E4
󧇥
E71E5
󧇦
E71E6
󧇧
E71E7
󧇨
E71E8
󧇩
E71E9
󧇪
E71EA
󧇫
E71EB
󧇬
E71EC
󧇭
E71ED
󧇮
E71EE
󧇯
E71EF
A0
B0
󧇰
E71F0
󧇱
E71F1
󧇲
E71F2
󧇳
E71F3
󧇴
E71F4
󧇵
E71F5
󧇶
E71F6
󧇷
E71F7
󧇸
E71F8
󧇹
E71F9
󧇺
E71FA
󧇻
E71FB
󧇼
E71FC
󧇽
E71FD
󧇾
E71FE
󧇿
E71FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]