International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F090A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𐥀
10940
𐥁
10941
𐥂
10942
𐥃
10943
𐥄
10944
𐥅
10945
𐥆
10946
𐥇
10947
𐥈
10948
𐥉
10949
𐥊
1094A
𐥋
1094B
𐥌
1094C
𐥍
1094D
𐥎
1094E
𐥏
1094F
80
90
𐥐
10950
𐥑
10951
𐥒
10952
𐥓
10953
𐥔
10954
𐥕
10955
𐥖
10956
𐥗
10957
𐥘
10958
𐥙
10959
𐥚
1095A
𐥛
1095B
𐥜
1095C
𐥝
1095D
𐥞
1095E
𐥟
1095F
90
A0
𐥠
10960
𐥡
10961
𐥢
10962
𐥣
10963
𐥤
10964
𐥥
10965
𐥦
10966
𐥧
10967
𐥨
10968
𐥩
10969
𐥪
1096A
𐥫
1096B
𐥬
1096C
𐥭
1096D
𐥮
1096E
𐥯
1096F
A0
B0
𐥰
10970
𐥱
10971
𐥲
10972
𐥳
10973
𐥴
10974
𐥵
10975
𐥶
10976
𐥷
10977
𐥸
10978
𐥹
10979
𐥺
1097A
𐥻
1097B
𐥼
1097C
𐥽
1097D
𐥾
1097E
𐥿
1097F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]