International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B189

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򱉀
B1240
򱉁
B1241
򱉂
B1242
򱉃
B1243
򱉄
B1244
򱉅
B1245
򱉆
B1246
򱉇
B1247
򱉈
B1248
򱉉
B1249
򱉊
B124A
򱉋
B124B
򱉌
B124C
򱉍
B124D
򱉎
B124E
򱉏
B124F
80
90
򱉐
B1250
򱉑
B1251
򱉒
B1252
򱉓
B1253
򱉔
B1254
򱉕
B1255
򱉖
B1256
򱉗
B1257
򱉘
B1258
򱉙
B1259
򱉚
B125A
򱉛
B125B
򱉜
B125C
򱉝
B125D
򱉞
B125E
򱉟
B125F
90
A0
򱉠
B1260
򱉡
B1261
򱉢
B1262
򱉣
B1263
򱉤
B1264
򱉥
B1265
򱉦
B1266
򱉧
B1267
򱉨
B1268
򱉩
B1269
򱉪
B126A
򱉫
B126B
򱉬
B126C
򱉭
B126D
򱉮
B126E
򱉯
B126F
A0
B0
򱉰
B1270
򱉱
B1271
򱉲
B1272
򱉳
B1273
򱉴
B1274
򱉵
B1275
򱉶
B1276
򱉷
B1277
򱉸
B1278
򱉹
B1279
򱉺
B127A
򱉻
B127B
򱉼
B127C
򱉽
B127D
򱉾
B127E
򱉿
B127F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]