International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B690

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󶐀
F6400
󶐁
F6401
󶐂
F6402
󶐃
F6403
󶐄
F6404
󶐅
F6405
󶐆
F6406
󶐇
F6407
󶐈
F6408
󶐉
F6409
󶐊
F640A
󶐋
F640B
󶐌
F640C
󶐍
F640D
󶐎
F640E
󶐏
F640F
80
90
󶐐
F6410
󶐑
F6411
󶐒
F6412
󶐓
F6413
󶐔
F6414
󶐕
F6415
󶐖
F6416
󶐗
F6417
󶐘
F6418
󶐙
F6419
󶐚
F641A
󶐛
F641B
󶐜
F641C
󶐝
F641D
󶐞
F641E
󶐟
F641F
90
A0
󶐠
F6420
󶐡
F6421
󶐢
F6422
󶐣
F6423
󶐤
F6424
󶐥
F6425
󶐦
F6426
󶐧
F6427
󶐨
F6428
󶐩
F6429
󶐪
F642A
󶐫
F642B
󶐬
F642C
󶐭
F642D
󶐮
F642E
󶐯
F642F
A0
B0
󶐰
F6430
󶐱
F6431
󶐲
F6432
󶐳
F6433
󶐴
F6434
󶐵
F6435
󶐶
F6436
󶐷
F6437
󶐸
F6438
󶐹
F6439
󶐺
F643A
󶐻
F643B
󶐼
F643C
󶐽
F643D
󶐾
F643E
󶐿
F643F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]