International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B086

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󰆀
F0180
󰆁
F0181
󰆂
F0182
󰆃
F0183
󰆄
F0184
󰆅
F0185
󰆆
F0186
󰆇
F0187
󰆈
F0188
󰆉
F0189
󰆊
F018A
󰆋
F018B
󰆌
F018C
󰆍
F018D
󰆎
F018E
󰆏
F018F
80
90
󰆐
F0190
󰆑
F0191
󰆒
F0192
󰆓
F0193
󰆔
F0194
󰆕
F0195
󰆖
F0196
󰆗
F0197
󰆘
F0198
󰆙
F0199
󰆚
F019A
󰆛
F019B
󰆜
F019C
󰆝
F019D
󰆞
F019E
󰆟
F019F
90
A0
󰆠
F01A0
󰆡
F01A1
󰆢
F01A2
󰆣
F01A3
󰆤
F01A4
󰆥
F01A5
󰆦
F01A6
󰆧
F01A7
󰆨
F01A8
󰆩
F01A9
󰆪
F01AA
󰆫
F01AB
󰆬
F01AC
󰆭
F01AD
󰆮
F01AE
󰆯
F01AF
A0
B0
󰆰
F01B0
󰆱
F01B1
󰆲
F01B2
󰆳
F01B3
󰆴
F01B4
󰆵
F01B5
󰆶
F01B6
󰆷
F01B7
󰆸
F01B8
󰆹
F01B9
󰆺
F01BA
󰆻
F01BB
󰆼
F01BC
󰆽
F01BD
󰆾
F01BE
󰆿
F01BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]