International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1988B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񘋀
582C0
񘋁
582C1
񘋂
582C2
񘋃
582C3
񘋄
582C4
񘋅
582C5
񘋆
582C6
񘋇
582C7
񘋈
582C8
񘋉
582C9
񘋊
582CA
񘋋
582CB
񘋌
582CC
񘋍
582CD
񘋎
582CE
񘋏
582CF
80
90
񘋐
582D0
񘋑
582D1
񘋒
582D2
񘋓
582D3
񘋔
582D4
񘋕
582D5
񘋖
582D6
񘋗
582D7
񘋘
582D8
񘋙
582D9
񘋚
582DA
񘋛
582DB
񘋜
582DC
񘋝
582DD
񘋞
582DE
񘋟
582DF
90
A0
񘋠
582E0
񘋡
582E1
񘋢
582E2
񘋣
582E3
񘋤
582E4
񘋥
582E5
񘋦
582E6
񘋧
582E7
񘋨
582E8
񘋩
582E9
񘋪
582EA
񘋫
582EB
񘋬
582EC
񘋭
582ED
񘋮
582EE
񘋯
582EF
A0
B0
񘋰
582F0
񘋱
582F1
񘋲
582F2
񘋳
582F3
񘋴
582F4
񘋵
582F5
񘋶
582F6
񘋷
582F7
񘋸
582F8
񘋹
582F9
񘋺
582FA
񘋻
582FB
񘋼
582FC
񘋽
582FD
񘋾
582FE
񘋿
582FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]