International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28986

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉆀
89180
򉆁
89181
򉆂
89182
򉆃
89183
򉆄
89184
򉆅
89185
򉆆
89186
򉆇
89187
򉆈
89188
򉆉
89189
򉆊
8918A
򉆋
8918B
򉆌
8918C
򉆍
8918D
򉆎
8918E
򉆏
8918F
80
90
򉆐
89190
򉆑
89191
򉆒
89192
򉆓
89193
򉆔
89194
򉆕
89195
򉆖
89196
򉆗
89197
򉆘
89198
򉆙
89199
򉆚
8919A
򉆛
8919B
򉆜
8919C
򉆝
8919D
򉆞
8919E
򉆟
8919F
90
A0
򉆠
891A0
򉆡
891A1
򉆢
891A2
򉆣
891A3
򉆤
891A4
򉆥
891A5
򉆦
891A6
򉆧
891A7
򉆨
891A8
򉆩
891A9
򉆪
891AA
򉆫
891AB
򉆬
891AC
򉆭
891AD
򉆮
891AE
򉆯
891AF
A0
B0
򉆰
891B0
򉆱
891B1
򉆲
891B2
򉆳
891B3
򉆴
891B4
򉆵
891B5
򉆶
891B6
򉆷
891B7
򉆸
891B8
򉆹
891B9
򉆺
891BA
򉆻
891BB
򉆼
891BC
򉆽
891BD
򉆾
891BE
򉆿
891BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]