International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A088

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񠈀
60200
񠈁
60201
񠈂
60202
񠈃
60203
񠈄
60204
񠈅
60205
񠈆
60206
񠈇
60207
񠈈
60208
񠈉
60209
񠈊
6020A
񠈋
6020B
񠈌
6020C
񠈍
6020D
񠈎
6020E
񠈏
6020F
80
90
񠈐
60210
񠈑
60211
񠈒
60212
񠈓
60213
񠈔
60214
񠈕
60215
񠈖
60216
񠈗
60217
񠈘
60218
񠈙
60219
񠈚
6021A
񠈛
6021B
񠈜
6021C
񠈝
6021D
񠈞
6021E
񠈟
6021F
90
A0
񠈠
60220
񠈡
60221
񠈢
60222
񠈣
60223
񠈤
60224
񠈥
60225
񠈦
60226
񠈧
60227
񠈨
60228
񠈩
60229
񠈪
6022A
񠈫
6022B
񠈬
6022C
񠈭
6022D
񠈮
6022E
񠈯
6022F
A0
B0
񠈰
60230
񠈱
60231
񠈲
60232
񠈳
60233
񠈴
60234
񠈵
60235
񠈶
60236
񠈷
60237
񠈸
60238
񠈹
60239
񠈺
6023A
񠈻
6023B
񠈼
6023C
񠈽
6023D
񠈾
6023E
񠈿
6023F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]