International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28CA1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򌡀
8C840
򌡁
8C841
򌡂
8C842
򌡃
8C843
򌡄
8C844
򌡅
8C845
򌡆
8C846
򌡇
8C847
򌡈
8C848
򌡉
8C849
򌡊
8C84A
򌡋
8C84B
򌡌
8C84C
򌡍
8C84D
򌡎
8C84E
򌡏
8C84F
80
90
򌡐
8C850
򌡑
8C851
򌡒
8C852
򌡓
8C853
򌡔
8C854
򌡕
8C855
򌡖
8C856
򌡗
8C857
򌡘
8C858
򌡙
8C859
򌡚
8C85A
򌡛
8C85B
򌡜
8C85C
򌡝
8C85D
򌡞
8C85E
򌡟
8C85F
90
A0
򌡠
8C860
򌡡
8C861
򌡢
8C862
򌡣
8C863
򌡤
8C864
򌡥
8C865
򌡦
8C866
򌡧
8C867
򌡨
8C868
򌡩
8C869
򌡪
8C86A
򌡫
8C86B
򌡬
8C86C
򌡭
8C86D
򌡮
8C86E
򌡯
8C86F
A0
B0
򌡰
8C870
򌡱
8C871
򌡲
8C872
򌡳
8C873
򌡴
8C874
򌡵
8C875
򌡶
8C876
򌡷
8C877
򌡸
8C878
򌡹
8C879
򌡺
8C87A
򌡻
8C87B
򌡼
8C87C
򌡽
8C87D
򌡾
8C87E
򌡿
8C87F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]