International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18090

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񀐀
40400
񀐁
40401
񀐂
40402
񀐃
40403
񀐄
40404
񀐅
40405
񀐆
40406
񀐇
40407
񀐈
40408
񀐉
40409
񀐊
4040A
񀐋
4040B
񀐌
4040C
񀐍
4040D
񀐎
4040E
񀐏
4040F
80
90
񀐐
40410
񀐑
40411
񀐒
40412
񀐓
40413
񀐔
40414
񀐕
40415
񀐖
40416
񀐗
40417
񀐘
40418
񀐙
40419
񀐚
4041A
񀐛
4041B
񀐜
4041C
񀐝
4041D
񀐞
4041E
񀐟
4041F
90
A0
񀐠
40420
񀐡
40421
񀐢
40422
񀐣
40423
񀐤
40424
񀐥
40425
񀐦
40426
񀐧
40427
񀐨
40428
񀐩
40429
񀐪
4042A
񀐫
4042B
񀐬
4042C
񀐭
4042D
񀐮
4042E
񀐯
4042F
A0
B0
񀐰
40430
񀐱
40431
񀐲
40432
񀐳
40433
񀐴
40434
񀐵
40435
񀐶
40436
񀐷
40437
񀐸
40438
񀐹
40439
񀐺
4043A
񀐻
4043B
񀐼
4043C
񀐽
4043D
񀐾
4043E
񀐿
4043F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]