International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes EC9F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
C7C0
C7C1
C7C2
C7C3
C7C4
C7C5
C7C6
C7C7
C7C8
C7C9
C7CA
C7CB
C7CC
C7CD
C7CE
C7CF
80
90
C7D0
C7D1
C7D2
C7D3
C7D4
C7D5
C7D6
C7D7
C7D8
C7D9
C7DA
C7DB
C7DC
C7DD
C7DE
C7DF
90
A0
C7E0
C7E1
C7E2
C7E3
C7E4
C7E5
C7E6
C7E7
C7E8
C7E9
C7EA
C7EB
C7EC
C7ED
C7EE
C7EF
A0
B0
C7F0
C7F1
C7F2
C7F3
C7F4
C7F5
C7F6
C7F7
C7F8
C7F9
C7FA
C7FB
C7FC
C7FD
C7FE
C7FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]