International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18288

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񂈀
42200
񂈁
42201
񂈂
42202
񂈃
42203
񂈄
42204
񂈅
42205
񂈆
42206
񂈇
42207
񂈈
42208
񂈉
42209
񂈊
4220A
񂈋
4220B
񂈌
4220C
񂈍
4220D
񂈎
4220E
񂈏
4220F
80
90
񂈐
42210
񂈑
42211
񂈒
42212
񂈓
42213
񂈔
42214
񂈕
42215
񂈖
42216
񂈗
42217
񂈘
42218
񂈙
42219
񂈚
4221A
񂈛
4221B
񂈜
4221C
񂈝
4221D
񂈞
4221E
񂈟
4221F
90
A0
񂈠
42220
񂈡
42221
񂈢
42222
񂈣
42223
񂈤
42224
񂈥
42225
񂈦
42226
񂈧
42227
񂈨
42228
񂈩
42229
񂈪
4222A
񂈫
4222B
񂈬
4222C
񂈭
4222D
񂈮
4222E
񂈯
4222F
A0
B0
񂈰
42230
񂈱
42231
񂈲
42232
񂈳
42233
񂈴
42234
񂈵
42235
񂈶
42236
񂈷
42237
񂈸
42238
񂈹
42239
񂈺
4223A
񂈻
4223B
񂈼
4223C
񂈽
4223D
񂈾
4223E
񂈿
4223F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]