International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1809C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񀜀
40700
񀜁
40701
񀜂
40702
񀜃
40703
񀜄
40704
񀜅
40705
񀜆
40706
񀜇
40707
񀜈
40708
񀜉
40709
񀜊
4070A
񀜋
4070B
񀜌
4070C
񀜍
4070D
񀜎
4070E
񀜏
4070F
80
90
񀜐
40710
񀜑
40711
񀜒
40712
񀜓
40713
񀜔
40714
񀜕
40715
񀜖
40716
񀜗
40717
񀜘
40718
񀜙
40719
񀜚
4071A
񀜛
4071B
񀜜
4071C
񀜝
4071D
񀜞
4071E
񀜟
4071F
90
A0
񀜠
40720
񀜡
40721
񀜢
40722
񀜣
40723
񀜤
40724
񀜥
40725
񀜦
40726
񀜧
40727
񀜨
40728
񀜩
40729
񀜪
4072A
񀜫
4072B
񀜬
4072C
񀜭
4072D
񀜮
4072E
񀜯
4072F
A0
B0
񀜰
40730
񀜱
40731
񀜲
40732
񀜳
40733
񀜴
40734
񀜵
40735
񀜶
40736
񀜷
40737
񀜸
40738
񀜹
40739
񀜺
4073A
񀜻
4073B
񀜼
4073C
񀜽
4073D
񀜾
4073E
񀜿
4073F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]