International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A086

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
 󠆀
E0180
 󠆁
E0181
 󠆂
E0182
 󠆃
E0183
 󠆄
E0184
 󠆅
E0185
 󠆆
E0186
 󠆇
E0187
 󠆈
E0188
 󠆉
E0189
 󠆊
E018A
 󠆋
E018B
 󠆌
E018C
 󠆍
E018D
 󠆎
E018E
 󠆏
E018F
80
90
 󠆐
E0190
 󠆑
E0191
 󠆒
E0192
 󠆓
E0193
 󠆔
E0194
 󠆕
E0195
 󠆖
E0196
 󠆗
E0197
 󠆘
E0198
 󠆙
E0199
 󠆚
E019A
 󠆛
E019B
 󠆜
E019C
 󠆝
E019D
 󠆞
E019E
 󠆟
E019F
90
A0
 󠆠
E01A0
 󠆡
E01A1
 󠆢
E01A2
 󠆣
E01A3
 󠆤
E01A4
 󠆥
E01A5
 󠆦
E01A6
 󠆧
E01A7
 󠆨
E01A8
 󠆩
E01A9
 󠆪
E01AA
 󠆫
E01AB
 󠆬
E01AC
 󠆭
E01AD
 󠆮
E01AE
 󠆯
E01AF
A0
B0
 󠆰
E01B0
 󠆱
E01B1
 󠆲
E01B2
 󠆳
E01B3
 󠆴
E01B4
 󠆵
E01B5
 󠆶
E01B6
 󠆷
E01B7
 󠆸
E01B8
 󠆹
E01B9
 󠆺
E01BA
 󠆻
E01BB
 󠆼
E01BC
 󠆽
E01BD
 󠆾
E01BE
 󠆿
E01BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]