International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes DD

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
 ݀
0740
 ݁
0741
 ݂
0742
 ݃
0743
 ݄
0744
 ݅
0745
 ݆
0746
 ݇
0747
 ݈
0748
 ݉
0749
 ݊
074A
݋
074B
݌
074C
ݍ
074D
ݎ
074E
ݏ
074F
80
90
ݐ
0750
ݑ
0751
ݒ
0752
ݓ
0753
ݔ
0754
ݕ
0755
ݖ
0756
ݗ
0757
ݘ
0758
ݙ
0759
ݚ
075A
ݛ
075B
ݜ
075C
ݝ
075D
ݞ
075E
ݟ
075F
90
A0
ݠ
0760
ݡ
0761
ݢ
0762
ݣ
0763
ݤ
0764
ݥ
0765
ݦ
0766
ݧ
0767
ݨ
0768
ݩ
0769
ݪ
076A
ݫ
076B
ݬ
076C
ݭ
076D
ݮ
076E
ݯ
076F
A0
B0
ݰ
0770
ݱ
0771
ݲ
0772
ݳ
0773
ݴ
0774
ݵ
0775
ݶ
0776
ݷ
0777
ݸ
0778
ݹ
0779
ݺ
077A
ݻ
077B
ݼ
077C
ݽ
077D
ݾ
077E
ݿ
077F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]