International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29988

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򙈀
99200
򙈁
99201
򙈂
99202
򙈃
99203
򙈄
99204
򙈅
99205
򙈆
99206
򙈇
99207
򙈈
99208
򙈉
99209
򙈊
9920A
򙈋
9920B
򙈌
9920C
򙈍
9920D
򙈎
9920E
򙈏
9920F
80
90
򙈐
99210
򙈑
99211
򙈒
99212
򙈓
99213
򙈔
99214
򙈕
99215
򙈖
99216
򙈗
99217
򙈘
99218
򙈙
99219
򙈚
9921A
򙈛
9921B
򙈜
9921C
򙈝
9921D
򙈞
9921E
򙈟
9921F
90
A0
򙈠
99220
򙈡
99221
򙈢
99222
򙈣
99223
򙈤
99224
򙈥
99225
򙈦
99226
򙈧
99227
򙈨
99228
򙈩
99229
򙈪
9922A
򙈫
9922B
򙈬
9922C
򙈭
9922D
򙈮
9922E
򙈯
9922F
A0
B0
򙈰
99230
򙈱
99231
򙈲
99232
򙈳
99233
򙈴
99234
򙈵
99235
򙈶
99236
򙈷
99237
򙈸
99238
򙈹
99239
򙈺
9923A
򙈻
9923B
򙈼
9923C
򙈽
9923D
򙈾
9923E
򙈿
9923F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]