International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F292A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򒡀
92840
򒡁
92841
򒡂
92842
򒡃
92843
򒡄
92844
򒡅
92845
򒡆
92846
򒡇
92847
򒡈
92848
򒡉
92849
򒡊
9284A
򒡋
9284B
򒡌
9284C
򒡍
9284D
򒡎
9284E
򒡏
9284F
80
90
򒡐
92850
򒡑
92851
򒡒
92852
򒡓
92853
򒡔
92854
򒡕
92855
򒡖
92856
򒡗
92857
򒡘
92858
򒡙
92859
򒡚
9285A
򒡛
9285B
򒡜
9285C
򒡝
9285D
򒡞
9285E
򒡟
9285F
90
A0
򒡠
92860
򒡡
92861
򒡢
92862
򒡣
92863
򒡤
92864
򒡥
92865
򒡦
92866
򒡧
92867
򒡨
92868
򒡩
92869
򒡪
9286A
򒡫
9286B
򒡬
9286C
򒡭
9286D
򒡮
9286E
򒡯
9286F
A0
B0
򒡰
92870
򒡱
92871
򒡲
92872
򒡳
92873
򒡴
92874
򒡵
92875
򒡶
92876
򒡷
92877
򒡸
92878
򒡹
92879
򒡺
9287A
򒡻
9287B
򒡼
9287C
򒡽
9287D
򒡾
9287E
򒡿
9287F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]