International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes EF97

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
F5C0
F5C1
F5C2
F5C3
F5C4
F5C5
F5C6
F5C7
F5C8
F5C9
F5CA
F5CB
F5CC
F5CD
F5CE
F5CF
80
90
F5D0
F5D1
F5D2
F5D3
F5D4
F5D5
F5D6
F5D7
F5D8
F5D9
F5DA
F5DB
F5DC
F5DD
F5DE
F5DF
90
A0
F5E0
F5E1
F5E2
F5E3
F5E4
F5E5
F5E6
F5E7
F5E8
F5E9
F5EA
F5EB
F5EC
F5ED
F5EE
F5EF
A0
B0
F5F0
F5F1
F5F2
F5F3
F5F4
F5F5
F5F6
F5F7
F5F8
F5F9
F5FA
F5FB
F5FC
F5FD
F5FE
F5FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]