International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
UTR22 IBM WINDOWS JAVA IANA MIME Untagged Aliases All Aliases
UTF-8   ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
UTF-8
UTF-8 UTF-8 UTF-8 cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8
UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AEAF

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󮯀
EEBC0
󮯁
EEBC1
󮯂
EEBC2
󮯃
EEBC3
󮯄
EEBC4
󮯅
EEBC5
󮯆
EEBC6
󮯇
EEBC7
󮯈
EEBC8
󮯉
EEBC9
󮯊
EEBCA
󮯋
EEBCB
󮯌
EEBCC
󮯍
EEBCD
󮯎
EEBCE
󮯏
EEBCF
80
90
󮯐
EEBD0
󮯑
EEBD1
󮯒
EEBD2
󮯓
EEBD3
󮯔
EEBD4
󮯕
EEBD5
󮯖
EEBD6
󮯗
EEBD7
󮯘
EEBD8
󮯙
EEBD9
󮯚
EEBDA
󮯛
EEBDB
󮯜
EEBDC
󮯝
EEBDD
󮯞
EEBDE
󮯟
EEBDF
90
A0
󮯠
EEBE0
󮯡
EEBE1
󮯢
EEBE2
󮯣
EEBE3
󮯤
EEBE4
󮯥
EEBE5
󮯦
EEBE6
󮯧
EEBE7
󮯨
EEBE8
󮯩
EEBE9
󮯪
EEBEA
󮯫
EEBEB
󮯬
EEBEC
󮯭
EEBED
󮯮
EEBEE
󮯯
EEBEF
A0
B0
󮯰
EEBF0
󮯱
EEBF1
󮯲
EEBF2
󮯳
EEBF3
󮯴
EEBF4
󮯵
EEBF5
󮯶
EEBF6
󮯷
EEBF7
󮯸
EEBF8
󮯹
EEBF9
󮯺
EEBFA
󮯻
EEBFB
󮯼
EEBFC
󮯽
EEBFD
󮯾
EEBFE
󮯿
EEBFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]