International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29684

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򖄀
96100
򖄁
96101
򖄂
96102
򖄃
96103
򖄄
96104
򖄅
96105
򖄆
96106
򖄇
96107
򖄈
96108
򖄉
96109
򖄊
9610A
򖄋
9610B
򖄌
9610C
򖄍
9610D
򖄎
9610E
򖄏
9610F
80
90
򖄐
96110
򖄑
96111
򖄒
96112
򖄓
96113
򖄔
96114
򖄕
96115
򖄖
96116
򖄗
96117
򖄘
96118
򖄙
96119
򖄚
9611A
򖄛
9611B
򖄜
9611C
򖄝
9611D
򖄞
9611E
򖄟
9611F
90
A0
򖄠
96120
򖄡
96121
򖄢
96122
򖄣
96123
򖄤
96124
򖄥
96125
򖄦
96126
򖄧
96127
򖄨
96128
򖄩
96129
򖄪
9612A
򖄫
9612B
򖄬
9612C
򖄭
9612D
򖄮
9612E
򖄯
9612F
A0
B0
򖄰
96130
򖄱
96131
򖄲
96132
򖄳
96133
򖄴
96134
򖄵
96135
򖄶
96136
򖄷
96137
򖄸
96138
򖄹
96139
򖄺
9613A
򖄻
9613B
򖄼
9613C
򖄽
9613D
򖄾
9613E
򖄿
9613F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]