International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4848B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􄋀
1042C0
􄋁
1042C1
􄋂
1042C2
􄋃
1042C3
􄋄
1042C4
􄋅
1042C5
􄋆
1042C6
􄋇
1042C7
􄋈
1042C8
􄋉
1042C9
􄋊
1042CA
􄋋
1042CB
􄋌
1042CC
􄋍
1042CD
􄋎
1042CE
􄋏
1042CF
80
90
􄋐
1042D0
􄋑
1042D1
􄋒
1042D2
􄋓
1042D3
􄋔
1042D4
􄋕
1042D5
􄋖
1042D6
􄋗
1042D7
􄋘
1042D8
􄋙
1042D9
􄋚
1042DA
􄋛
1042DB
􄋜
1042DC
􄋝
1042DD
􄋞
1042DE
􄋟
1042DF
90
A0
􄋠
1042E0
􄋡
1042E1
􄋢
1042E2
􄋣
1042E3
􄋤
1042E4
􄋥
1042E5
􄋦
1042E6
􄋧
1042E7
􄋨
1042E8
􄋩
1042E9
􄋪
1042EA
􄋫
1042EB
􄋬
1042EC
􄋭
1042ED
􄋮
1042EE
􄋯
1042EF
A0
B0
􄋰
1042F0
􄋱
1042F1
􄋲
1042F2
􄋳
1042F3
􄋴
1042F4
􄋵
1042F5
􄋶
1042F6
􄋷
1042F7
􄋸
1042F8
􄋹
1042F9
􄋺
1042FA
􄋻
1042FB
􄋼
1042FC
􄋽
1042FD
􄋾
1042FE
􄋿
1042FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]