International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F287A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򇡀
87840
򇡁
87841
򇡂
87842
򇡃
87843
򇡄
87844
򇡅
87845
򇡆
87846
򇡇
87847
򇡈
87848
򇡉
87849
򇡊
8784A
򇡋
8784B
򇡌
8784C
򇡍
8784D
򇡎
8784E
򇡏
8784F
80
90
򇡐
87850
򇡑
87851
򇡒
87852
򇡓
87853
򇡔
87854
򇡕
87855
򇡖
87856
򇡗
87857
򇡘
87858
򇡙
87859
򇡚
8785A
򇡛
8785B
򇡜
8785C
򇡝
8785D
򇡞
8785E
򇡟
8785F
90
A0
򇡠
87860
򇡡
87861
򇡢
87862
򇡣
87863
򇡤
87864
򇡥
87865
򇡦
87866
򇡧
87867
򇡨
87868
򇡩
87869
򇡪
8786A
򇡫
8786B
򇡬
8786C
򇡭
8786D
򇡮
8786E
򇡯
8786F
A0
B0
򇡰
87870
򇡱
87871
򇡲
87872
򇡳
87873
򇡴
87874
򇡵
87875
򇡶
87876
򇡷
87877
򇡸
87878
򇡹
87879
򇡺
8787A
򇡻
8787B
򇡼
8787C
򇡽
8787D
򇡾
8787E
򇡿
8787F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]