International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F281A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򁡀
81840
򁡁
81841
򁡂
81842
򁡃
81843
򁡄
81844
򁡅
81845
򁡆
81846
򁡇
81847
򁡈
81848
򁡉
81849
򁡊
8184A
򁡋
8184B
򁡌
8184C
򁡍
8184D
򁡎
8184E
򁡏
8184F
80
90
򁡐
81850
򁡑
81851
򁡒
81852
򁡓
81853
򁡔
81854
򁡕
81855
򁡖
81856
򁡗
81857
򁡘
81858
򁡙
81859
򁡚
8185A
򁡛
8185B
򁡜
8185C
򁡝
8185D
򁡞
8185E
򁡟
8185F
90
A0
򁡠
81860
򁡡
81861
򁡢
81862
򁡣
81863
򁡤
81864
򁡥
81865
򁡦
81866
򁡧
81867
򁡨
81868
򁡩
81869
򁡪
8186A
򁡫
8186B
򁡬
8186C
򁡭
8186D
򁡮
8186E
򁡯
8186F
A0
B0
򁡰
81870
򁡱
81871
򁡲
81872
򁡳
81873
򁡴
81874
򁡵
81875
򁡶
81876
򁡷
81877
򁡸
81878
򁡹
81879
򁡺
8187A
򁡻
8187B
򁡼
8187C
򁡽
8187D
򁡾
8187E
򁡿
8187F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]