International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B4A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󴡀
F4840
󴡁
F4841
󴡂
F4842
󴡃
F4843
󴡄
F4844
󴡅
F4845
󴡆
F4846
󴡇
F4847
󴡈
F4848
󴡉
F4849
󴡊
F484A
󴡋
F484B
󴡌
F484C
󴡍
F484D
󴡎
F484E
󴡏
F484F
80
90
󴡐
F4850
󴡑
F4851
󴡒
F4852
󴡓
F4853
󴡔
F4854
󴡕
F4855
󴡖
F4856
󴡗
F4857
󴡘
F4858
󴡙
F4859
󴡚
F485A
󴡛
F485B
󴡜
F485C
󴡝
F485D
󴡞
F485E
󴡟
F485F
90
A0
󴡠
F4860
󴡡
F4861
󴡢
F4862
󴡣
F4863
󴡤
F4864
󴡥
F4865
󴡦
F4866
󴡧
F4867
󴡨
F4868
󴡩
F4869
󴡪
F486A
󴡫
F486B
󴡬
F486C
󴡭
F486D
󴡮
F486E
󴡯
F486F
A0
B0
󴡰
F4870
󴡱
F4871
󴡲
F4872
󴡳
F4873
󴡴
F4874
󴡵
F4875
󴡶
F4876
󴡷
F4877
󴡸
F4878
󴡹
F4879
󴡺
F487A
󴡻
F487B
󴡼
F487C
󴡽
F487D
󴡾
F487E
󴡿
F487F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]