International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B9A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򹡀
B9840
򹡁
B9841
򹡂
B9842
򹡃
B9843
򹡄
B9844
򹡅
B9845
򹡆
B9846
򹡇
B9847
򹡈
B9848
򹡉
B9849
򹡊
B984A
򹡋
B984B
򹡌
B984C
򹡍
B984D
򹡎
B984E
򹡏
B984F
80
90
򹡐
B9850
򹡑
B9851
򹡒
B9852
򹡓
B9853
򹡔
B9854
򹡕
B9855
򹡖
B9856
򹡗
B9857
򹡘
B9858
򹡙
B9859
򹡚
B985A
򹡛
B985B
򹡜
B985C
򹡝
B985D
򹡞
B985E
򹡟
B985F
90
A0
򹡠
B9860
򹡡
B9861
򹡢
B9862
򹡣
B9863
򹡤
B9864
򹡥
B9865
򹡦
B9866
򹡧
B9867
򹡨
B9868
򹡩
B9869
򹡪
B986A
򹡫
B986B
򹡬
B986C
򹡭
B986D
򹡮
B986E
򹡯
B986F
A0
B0
򹡰
B9870
򹡱
B9871
򹡲
B9872
򹡳
B9873
򹡴
B9874
򹡵
B9875
򹡶
B9876
򹡷
B9877
򹡸
B9878
򹡹
B9879
򹡺
B987A
򹡻
B987B
򹡼
B987C
򹡽
B987D
򹡾
B987E
򹡿
B987F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]