International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29088

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򐈀
90200
򐈁
90201
򐈂
90202
򐈃
90203
򐈄
90204
򐈅
90205
򐈆
90206
򐈇
90207
򐈈
90208
򐈉
90209
򐈊
9020A
򐈋
9020B
򐈌
9020C
򐈍
9020D
򐈎
9020E
򐈏
9020F
80
90
򐈐
90210
򐈑
90211
򐈒
90212
򐈓
90213
򐈔
90214
򐈕
90215
򐈖
90216
򐈗
90217
򐈘
90218
򐈙
90219
򐈚
9021A
򐈛
9021B
򐈜
9021C
򐈝
9021D
򐈞
9021E
򐈟
9021F
90
A0
򐈠
90220
򐈡
90221
򐈢
90222
򐈣
90223
򐈤
90224
򐈥
90225
򐈦
90226
򐈧
90227
򐈨
90228
򐈩
90229
򐈪
9022A
򐈫
9022B
򐈬
9022C
򐈭
9022D
򐈮
9022E
򐈯
9022F
A0
B0
򐈰
90230
򐈱
90231
򐈲
90232
򐈳
90233
򐈴
90234
򐈵
90235
򐈶
90236
򐈷
90237
򐈸
90238
򐈹
90239
򐈺
9023A
򐈻
9023B
򐈼
9023C
򐈽
9023D
򐈾
9023E
򐈿
9023F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]