International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes DA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
ڀ
0680
ځ
0681
ڂ
0682
ڃ
0683
ڄ
0684
څ
0685
چ
0686
ڇ
0687
ڈ
0688
ډ
0689
ڊ
068A
ڋ
068B
ڌ
068C
ڍ
068D
ڎ
068E
ڏ
068F
80
90
ڐ
0690
ڑ
0691
ڒ
0692
ړ
0693
ڔ
0694
ڕ
0695
ږ
0696
ڗ
0697
ژ
0698
ڙ
0699
ښ
069A
ڛ
069B
ڜ
069C
ڝ
069D
ڞ
069E
ڟ
069F
90
A0
ڠ
06A0
ڡ
06A1
ڢ
06A2
ڣ
06A3
ڤ
06A4
ڥ
06A5
ڦ
06A6
ڧ
06A7
ڨ
06A8
ک
06A9
ڪ
06AA
ګ
06AB
ڬ
06AC
ڭ
06AD
ڮ
06AE
گ
06AF
A0
B0
ڰ
06B0
ڱ
06B1
ڲ
06B2
ڳ
06B3
ڴ
06B4
ڵ
06B5
ڶ
06B6
ڷ
06B7
ڸ
06B8
ڹ
06B9
ں
06BA
ڻ
06BB
ڼ
06BC
ڽ
06BD
ھ
06BE
ڿ
06BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]