International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B184

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򱄀
B1100
򱄁
B1101
򱄂
B1102
򱄃
B1103
򱄄
B1104
򱄅
B1105
򱄆
B1106
򱄇
B1107
򱄈
B1108
򱄉
B1109
򱄊
B110A
򱄋
B110B
򱄌
B110C
򱄍
B110D
򱄎
B110E
򱄏
B110F
80
90
򱄐
B1110
򱄑
B1111
򱄒
B1112
򱄓
B1113
򱄔
B1114
򱄕
B1115
򱄖
B1116
򱄗
B1117
򱄘
B1118
򱄙
B1119
򱄚
B111A
򱄛
B111B
򱄜
B111C
򱄝
B111D
򱄞
B111E
򱄟
B111F
90
A0
򱄠
B1120
򱄡
B1121
򱄢
B1122
򱄣
B1123
򱄤
B1124
򱄥
B1125
򱄦
B1126
򱄧
B1127
򱄨
B1128
򱄩
B1129
򱄪
B112A
򱄫
B112B
򱄬
B112C
򱄭
B112D
򱄮
B112E
򱄯
B112F
A0
B0
򱄰
B1130
򱄱
B1131
򱄲
B1132
򱄳
B1133
򱄴
B1134
򱄵
B1135
򱄶
B1136
򱄷
B1137
򱄸
B1138
򱄹
B1139
򱄺
B113A
򱄻
B113B
򱄼
B113C
򱄽
B113D
򱄾
B113E
򱄿
B113F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]