International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B890

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󸐀
F8400
󸐁
F8401
󸐂
F8402
󸐃
F8403
󸐄
F8404
󸐅
F8405
󸐆
F8406
󸐇
F8407
󸐈
F8408
󸐉
F8409
󸐊
F840A
󸐋
F840B
󸐌
F840C
󸐍
F840D
󸐎
F840E
󸐏
F840F
80
90
󸐐
F8410
󸐑
F8411
󸐒
F8412
󸐓
F8413
󸐔
F8414
󸐕
F8415
󸐖
F8416
󸐗
F8417
󸐘
F8418
󸐙
F8419
󸐚
F841A
󸐛
F841B
󸐜
F841C
󸐝
F841D
󸐞
F841E
󸐟
F841F
90
A0
󸐠
F8420
󸐡
F8421
󸐢
F8422
󸐣
F8423
󸐤
F8424
󸐥
F8425
󸐦
F8426
󸐧
F8427
󸐨
F8428
󸐩
F8429
󸐪
F842A
󸐫
F842B
󸐬
F842C
󸐭
F842D
󸐮
F842E
󸐯
F842F
A0
B0
󸐰
F8430
󸐱
F8431
󸐲
F8432
󸐳
F8433
󸐴
F8434
󸐵
F8435
󸐶
F8436
󸐷
F8437
󸐸
F8438
󸐹
F8439
󸐺
F843A
󸐻
F843B
󸐼
F843C
󸐽
F843D
󸐾
F843E
󸐿
F843F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]