International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F485A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􅢀
105880
􅢁
105881
􅢂
105882
􅢃
105883
􅢄
105884
􅢅
105885
􅢆
105886
􅢇
105887
􅢈
105888
􅢉
105889
􅢊
10588A
􅢋
10588B
􅢌
10588C
􅢍
10588D
􅢎
10588E
􅢏
10588F
80
90
􅢐
105890
􅢑
105891
􅢒
105892
􅢓
105893
􅢔
105894
􅢕
105895
􅢖
105896
􅢗
105897
􅢘
105898
􅢙
105899
􅢚
10589A
􅢛
10589B
􅢜
10589C
􅢝
10589D
􅢞
10589E
􅢟
10589F
90
A0
􅢠
1058A0
􅢡
1058A1
􅢢
1058A2
􅢣
1058A3
􅢤
1058A4
􅢥
1058A5
􅢦
1058A6
􅢧
1058A7
􅢨
1058A8
􅢩
1058A9
􅢪
1058AA
􅢫
1058AB
􅢬
1058AC
􅢭
1058AD
􅢮
1058AE
􅢯
1058AF
A0
B0
􅢰
1058B0
􅢱
1058B1
􅢲
1058B2
􅢳
1058B3
􅢴
1058B4
􅢵
1058B5
􅢶
1058B6
􅢷
1058B7
􅢸
1058B8
􅢹
1058B9
􅢺
1058BA
􅢻
1058BB
􅢼
1058BC
􅢽
1058BD
􅢾
1058BE
􅢿
1058BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]