International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B1A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򱡀
B1840
򱡁
B1841
򱡂
B1842
򱡃
B1843
򱡄
B1844
򱡅
B1845
򱡆
B1846
򱡇
B1847
򱡈
B1848
򱡉
B1849
򱡊
B184A
򱡋
B184B
򱡌
B184C
򱡍
B184D
򱡎
B184E
򱡏
B184F
80
90
򱡐
B1850
򱡑
B1851
򱡒
B1852
򱡓
B1853
򱡔
B1854
򱡕
B1855
򱡖
B1856
򱡗
B1857
򱡘
B1858
򱡙
B1859
򱡚
B185A
򱡛
B185B
򱡜
B185C
򱡝
B185D
򱡞
B185E
򱡟
B185F
90
A0
򱡠
B1860
򱡡
B1861
򱡢
B1862
򱡣
B1863
򱡤
B1864
򱡥
B1865
򱡦
B1866
򱡧
B1867
򱡨
B1868
򱡩
B1869
򱡪
B186A
򱡫
B186B
򱡬
B186C
򱡭
B186D
򱡮
B186E
򱡯
B186F
A0
B0
򱡰
B1870
򱡱
B1871
򱡲
B1872
򱡳
B1873
򱡴
B1874
򱡵
B1875
򱡶
B1876
򱡷
B1877
򱡸
B1878
򱡹
B1879
򱡺
B187A
򱡻
B187B
򱡼
B187C
򱡽
B187D
򱡾
B187E
򱡿
B187F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]