International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38188

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁈀
C1200
󁈁
C1201
󁈂
C1202
󁈃
C1203
󁈄
C1204
󁈅
C1205
󁈆
C1206
󁈇
C1207
󁈈
C1208
󁈉
C1209
󁈊
C120A
󁈋
C120B
󁈌
C120C
󁈍
C120D
󁈎
C120E
󁈏
C120F
80
90
󁈐
C1210
󁈑
C1211
󁈒
C1212
󁈓
C1213
󁈔
C1214
󁈕
C1215
󁈖
C1216
󁈗
C1217
󁈘
C1218
󁈙
C1219
󁈚
C121A
󁈛
C121B
󁈜
C121C
󁈝
C121D
󁈞
C121E
󁈟
C121F
90
A0
󁈠
C1220
󁈡
C1221
󁈢
C1222
󁈣
C1223
󁈤
C1224
󁈥
C1225
󁈦
C1226
󁈧
C1227
󁈨
C1228
󁈩
C1229
󁈪
C122A
󁈫
C122B
󁈬
C122C
󁈭
C122D
󁈮
C122E
󁈯
C122F
A0
B0
󁈰
C1230
󁈱
C1231
󁈲
C1232
󁈳
C1233
󁈴
C1234
󁈵
C1235
󁈶
C1236
󁈷
C1237
󁈸
C1238
󁈹
C1239
󁈺
C123A
󁈻
C123B
󁈼
C123C
󁈽
C123D
󁈾
C123E
󁈿
C123F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]