International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A1A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򡠀
A1800
򡠁
A1801
򡠂
A1802
򡠃
A1803
򡠄
A1804
򡠅
A1805
򡠆
A1806
򡠇
A1807
򡠈
A1808
򡠉
A1809
򡠊
A180A
򡠋
A180B
򡠌
A180C
򡠍
A180D
򡠎
A180E
򡠏
A180F
80
90
򡠐
A1810
򡠑
A1811
򡠒
A1812
򡠓
A1813
򡠔
A1814
򡠕
A1815
򡠖
A1816
򡠗
A1817
򡠘
A1818
򡠙
A1819
򡠚
A181A
򡠛
A181B
򡠜
A181C
򡠝
A181D
򡠞
A181E
򡠟
A181F
90
A0
򡠠
A1820
򡠡
A1821
򡠢
A1822
򡠣
A1823
򡠤
A1824
򡠥
A1825
򡠦
A1826
򡠧
A1827
򡠨
A1828
򡠩
A1829
򡠪
A182A
򡠫
A182B
򡠬
A182C
򡠭
A182D
򡠮
A182E
򡠯
A182F
A0
B0
򡠰
A1830
򡠱
A1831
򡠲
A1832
򡠳
A1833
򡠴
A1834
򡠵
A1835
򡠶
A1836
򡠷
A1837
򡠸
A1838
򡠹
A1839
򡠺
A183A
򡠻
A183B
򡠼
A183C
򡠽
A183D
򡠾
A183E
򡠿
A183F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]