International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes C9

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
ɀ
0240
Ɂ
0241
ɂ
0242
Ƀ
0243
Ʉ
0244
Ʌ
0245
Ɇ
0246
ɇ
0247
Ɉ
0248
ɉ
0249
Ɋ
024A
ɋ
024B
Ɍ
024C
ɍ
024D
Ɏ
024E
ɏ
024F
80
90
ɐ
0250
ɑ
0251
ɒ
0252
ɓ
0253
ɔ
0254
ɕ
0255
ɖ
0256
ɗ
0257
ɘ
0258
ə
0259
ɚ
025A
ɛ
025B
ɜ
025C
ɝ
025D
ɞ
025E
ɟ
025F
90
A0
ɠ
0260
ɡ
0261
ɢ
0262
ɣ
0263
ɤ
0264
ɥ
0265
ɦ
0266
ɧ
0267
ɨ
0268
ɩ
0269
ɪ
026A
ɫ
026B
ɬ
026C
ɭ
026D
ɮ
026E
ɯ
026F
A0
B0
ɰ
0270
ɱ
0271
ɲ
0272
ɳ
0273
ɴ
0274
ɵ
0275
ɶ
0276
ɷ
0277
ɸ
0278
ɹ
0279
ɺ
027A
ɻ
027B
ɼ
027C
ɽ
027D
ɾ
027E
ɿ
027F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]