International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes EF8B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
F2C0
F2C1
F2C2
F2C3
F2C4
F2C5
F2C6
F2C7
F2C8
F2C9
F2CA
F2CB
F2CC
F2CD
F2CE
F2CF
80
90
F2D0
F2D1
F2D2
F2D3
F2D4
F2D5
F2D6
F2D7
F2D8
F2D9
F2DA
F2DB
F2DC
F2DD
F2DE
F2DF
90
A0
F2E0
F2E1
F2E2
F2E3
F2E4
F2E5
F2E6
F2E7
F2E8
F2E9
F2EA
F2EB
F2EC
F2ED
F2EE
F2EF
A0
B0
F2F0
F2F1
F2F2
F2F3
F2F4
F2F5
F2F6
F2F7
F2F8
F2F9
F2FA
F2FB
F2FC
F2FD
F2FE
F2FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]