International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B986

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󹆀
F9180
󹆁
F9181
󹆂
F9182
󹆃
F9183
󹆄
F9184
󹆅
F9185
󹆆
F9186
󹆇
F9187
󹆈
F9188
󹆉
F9189
󹆊
F918A
󹆋
F918B
󹆌
F918C
󹆍
F918D
󹆎
F918E
󹆏
F918F
80
90
󹆐
F9190
󹆑
F9191
󹆒
F9192
󹆓
F9193
󹆔
F9194
󹆕
F9195
󹆖
F9196
󹆗
F9197
󹆘
F9198
󹆙
F9199
󹆚
F919A
󹆛
F919B
󹆜
F919C
󹆝
F919D
󹆞
F919E
󹆟
F919F
90
A0
󹆠
F91A0
󹆡
F91A1
󹆢
F91A2
󹆣
F91A3
󹆤
F91A4
󹆥
F91A5
󹆦
F91A6
󹆧
F91A7
󹆨
F91A8
󹆩
F91A9
󹆪
F91AA
󹆫
F91AB
󹆬
F91AC
󹆭
F91AD
󹆮
F91AE
󹆯
F91AF
A0
B0
󹆰
F91B0
󹆱
F91B1
󹆲
F91B2
󹆳
F91B3
󹆴
F91B4
󹆵
F91B5
󹆶
F91B6
󹆷
F91B7
󹆸
F91B8
󹆹
F91B9
󹆺
F91BA
󹆻
F91BB
󹆼
F91BC
󹆽
F91BD
󹆾
F91BE
󹆿
F91BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]