International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48386

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􃆀
103180
􃆁
103181
􃆂
103182
􃆃
103183
􃆄
103184
􃆅
103185
􃆆
103186
􃆇
103187
􃆈
103188
􃆉
103189
􃆊
10318A
􃆋
10318B
􃆌
10318C
􃆍
10318D
􃆎
10318E
􃆏
10318F
80
90
􃆐
103190
􃆑
103191
􃆒
103192
􃆓
103193
􃆔
103194
􃆕
103195
􃆖
103196
􃆗
103197
􃆘
103198
􃆙
103199
􃆚
10319A
􃆛
10319B
􃆜
10319C
􃆝
10319D
􃆞
10319E
􃆟
10319F
90
A0
􃆠
1031A0
􃆡
1031A1
􃆢
1031A2
􃆣
1031A3
􃆤
1031A4
􃆥
1031A5
􃆦
1031A6
􃆧
1031A7
􃆨
1031A8
􃆩
1031A9
􃆪
1031AA
􃆫
1031AB
􃆬
1031AC
􃆭
1031AD
􃆮
1031AE
􃆯
1031AF
A0
B0
􃆰
1031B0
􃆱
1031B1
􃆲
1031B2
􃆳
1031B3
􃆴
1031B4
􃆵
1031B5
􃆶
1031B6
􃆷
1031B7
􃆸
1031B8
􃆹
1031B9
􃆺
1031BA
􃆻
1031BB
􃆼
1031BC
􃆽
1031BD
􃆾
1031BE
􃆿
1031BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]