International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2908B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򐋀
902C0
򐋁
902C1
򐋂
902C2
򐋃
902C3
򐋄
902C4
򐋅
902C5
򐋆
902C6
򐋇
902C7
򐋈
902C8
򐋉
902C9
򐋊
902CA
򐋋
902CB
򐋌
902CC
򐋍
902CD
򐋎
902CE
򐋏
902CF
80
90
򐋐
902D0
򐋑
902D1
򐋒
902D2
򐋓
902D3
򐋔
902D4
򐋕
902D5
򐋖
902D6
򐋗
902D7
򐋘
902D8
򐋙
902D9
򐋚
902DA
򐋛
902DB
򐋜
902DC
򐋝
902DD
򐋞
902DE
򐋟
902DF
90
A0
򐋠
902E0
򐋡
902E1
򐋢
902E2
򐋣
902E3
򐋤
902E4
򐋥
902E5
򐋦
902E6
򐋧
902E7
򐋨
902E8
򐋩
902E9
򐋪
902EA
򐋫
902EB
򐋬
902EC
򐋭
902ED
򐋮
902EE
򐋯
902EF
A0
B0
򐋰
902F0
򐋱
902F1
򐋲
902F2
򐋳
902F3
򐋴
902F4
򐋵
902F5
򐋶
902F6
򐋷
902F7
򐋸
902F8
򐋹
902F9
򐋺
902FA
򐋻
902FB
򐋼
902FC
򐋽
902FD
򐋾
902FE
򐋿
902FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]