International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39690

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󖐀
D6400
󖐁
D6401
󖐂
D6402
󖐃
D6403
󖐄
D6404
󖐅
D6405
󖐆
D6406
󖐇
D6407
󖐈
D6408
󖐉
D6409
󖐊
D640A
󖐋
D640B
󖐌
D640C
󖐍
D640D
󖐎
D640E
󖐏
D640F
80
90
󖐐
D6410
󖐑
D6411
󖐒
D6412
󖐓
D6413
󖐔
D6414
󖐕
D6415
󖐖
D6416
󖐗
D6417
󖐘
D6418
󖐙
D6419
󖐚
D641A
󖐛
D641B
󖐜
D641C
󖐝
D641D
󖐞
D641E
󖐟
D641F
90
A0
󖐠
D6420
󖐡
D6421
󖐢
D6422
󖐣
D6423
󖐤
D6424
󖐥
D6425
󖐦
D6426
󖐧
D6427
󖐨
D6428
󖐩
D6429
󖐪
D642A
󖐫
D642B
󖐬
D642C
󖐭
D642D
󖐮
D642E
󖐯
D642F
A0
B0
󖐰
D6430
󖐱
D6431
󖐲
D6432
󖐳
D6433
󖐴
D6434
󖐵
D6435
󖐶
D6436
󖐷
D6437
󖐸
D6438
󖐹
D6439
󖐺
D643A
󖐻
D643B
󖐼
D643C
󖐽
D643D
󖐾
D643E
󖐿
D643F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]