International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F299A7

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򙧀
999C0
򙧁
999C1
򙧂
999C2
򙧃
999C3
򙧄
999C4
򙧅
999C5
򙧆
999C6
򙧇
999C7
򙧈
999C8
򙧉
999C9
򙧊
999CA
򙧋
999CB
򙧌
999CC
򙧍
999CD
򙧎
999CE
򙧏
999CF
80
90
򙧐
999D0
򙧑
999D1
򙧒
999D2
򙧓
999D3
򙧔
999D4
򙧕
999D5
򙧖
999D6
򙧗
999D7
򙧘
999D8
򙧙
999D9
򙧚
999DA
򙧛
999DB
򙧜
999DC
򙧝
999DD
򙧞
999DE
򙧟
999DF
90
A0
򙧠
999E0
򙧡
999E1
򙧢
999E2
򙧣
999E3
򙧤
999E4
򙧥
999E5
򙧦
999E6
򙧧
999E7
򙧨
999E8
򙧩
999E9
򙧪
999EA
򙧫
999EB
򙧬
999EC
򙧭
999ED
򙧮
999EE
򙧯
999EF
A0
B0
򙧰
999F0
򙧱
999F1
򙧲
999F2
򙧳
999F3
򙧴
999F4
򙧵
999F5
򙧶
999F6
򙧷
999F7
򙧸
999F8
򙧹
999F9
򙧺
999FA
򙧻
999FB
򙧼
999FC
򙧽
999FD
򙧾
999FE
򙧿
999FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]