International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29FA1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򟡀
9F840
򟡁
9F841
򟡂
9F842
򟡃
9F843
򟡄
9F844
򟡅
9F845
򟡆
9F846
򟡇
9F847
򟡈
9F848
򟡉
9F849
򟡊
9F84A
򟡋
9F84B
򟡌
9F84C
򟡍
9F84D
򟡎
9F84E
򟡏
9F84F
80
90
򟡐
9F850
򟡑
9F851
򟡒
9F852
򟡓
9F853
򟡔
9F854
򟡕
9F855
򟡖
9F856
򟡗
9F857
򟡘
9F858
򟡙
9F859
򟡚
9F85A
򟡛
9F85B
򟡜
9F85C
򟡝
9F85D
򟡞
9F85E
򟡟
9F85F
90
A0
򟡠
9F860
򟡡
9F861
򟡢
9F862
򟡣
9F863
򟡤
9F864
򟡥
9F865
򟡦
9F866
򟡧
9F867
򟡨
9F868
򟡩
9F869
򟡪
9F86A
򟡫
9F86B
򟡬
9F86C
򟡭
9F86D
򟡮
9F86E
򟡯
9F86F
A0
B0
򟡰
9F870
򟡱
9F871
򟡲
9F872
򟡳
9F873
򟡴
9F874
򟡵
9F875
򟡶
9F876
򟡷
9F877
򟡸
9F878
򟡹
9F879
򟡺
9F87A
򟡻
9F87B
򟡼
9F87C
򟡽
9F87D
򟡾
9F87E
򟡿
9F87F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]