International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F19290

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񒐀
52400
񒐁
52401
񒐂
52402
񒐃
52403
񒐄
52404
񒐅
52405
񒐆
52406
񒐇
52407
񒐈
52408
񒐉
52409
񒐊
5240A
񒐋
5240B
񒐌
5240C
񒐍
5240D
񒐎
5240E
񒐏
5240F
80
90
񒐐
52410
񒐑
52411
񒐒
52412
񒐓
52413
񒐔
52414
񒐕
52415
񒐖
52416
񒐗
52417
񒐘
52418
񒐙
52419
񒐚
5241A
񒐛
5241B
񒐜
5241C
񒐝
5241D
񒐞
5241E
񒐟
5241F
90
A0
񒐠
52420
񒐡
52421
񒐢
52422
񒐣
52423
񒐤
52424
񒐥
52425
񒐦
52426
񒐧
52427
񒐨
52428
񒐩
52429
񒐪
5242A
񒐫
5242B
񒐬
5242C
񒐭
5242D
񒐮
5242E
񒐯
5242F
A0
B0
񒐰
52430
񒐱
52431
񒐲
52432
񒐳
52433
񒐴
52434
񒐵
52435
񒐶
52436
񒐷
52437
񒐸
52438
񒐹
52439
񒐺
5243A
񒐻
5243B
񒐼
5243C
񒐽
5243D
񒐾
5243E
񒐿
5243F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]