International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09290

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𒐀
12400
𒐁
12401
𒐂
12402
𒐃
12403
𒐄
12404
𒐅
12405
𒐆
12406
𒐇
12407
𒐈
12408
𒐉
12409
𒐊
1240A
𒐋
1240B
𒐌
1240C
𒐍
1240D
𒐎
1240E
𒐏
1240F
80
90
𒐐
12410
𒐑
12411
𒐒
12412
𒐓
12413
𒐔
12414
𒐕
12415
𒐖
12416
𒐗
12417
𒐘
12418
𒐙
12419
𒐚
1241A
𒐛
1241B
𒐜
1241C
𒐝
1241D
𒐞
1241E
𒐟
1241F
90
A0
𒐠
12420
𒐡
12421
𒐢
12422
𒐣
12423
𒐤
12424
𒐥
12425
𒐦
12426
𒐧
12427
𒐨
12428
𒐩
12429
𒐪
1242A
𒐫
1242B
𒐬
1242C
𒐭
1242D
𒐮
1242E
𒐯
1242F
A0
B0
𒐰
12430
𒐱
12431
𒐲
12432
𒐳
12433
𒐴
12434
𒐵
12435
𒐶
12436
𒐷
12437
𒐸
12438
𒐹
12439
𒐺
1243A
𒐻
1243B
𒐼
1243C
𒐽
1243D
𒐾
1243E
𒐿
1243F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]