International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F190A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񐡀
50840
񐡁
50841
񐡂
50842
񐡃
50843
񐡄
50844
񐡅
50845
񐡆
50846
񐡇
50847
񐡈
50848
񐡉
50849
񐡊
5084A
񐡋
5084B
񐡌
5084C
񐡍
5084D
񐡎
5084E
񐡏
5084F
80
90
񐡐
50850
񐡑
50851
񐡒
50852
񐡓
50853
񐡔
50854
񐡕
50855
񐡖
50856
񐡗
50857
񐡘
50858
񐡙
50859
񐡚
5085A
񐡛
5085B
񐡜
5085C
񐡝
5085D
񐡞
5085E
񐡟
5085F
90
A0
񐡠
50860
񐡡
50861
񐡢
50862
񐡣
50863
񐡤
50864
񐡥
50865
񐡦
50866
񐡧
50867
񐡨
50868
񐡩
50869
񐡪
5086A
񐡫
5086B
񐡬
5086C
񐡭
5086D
񐡮
5086E
񐡯
5086F
A0
B0
񐡰
50870
񐡱
50871
񐡲
50872
񐡳
50873
񐡴
50874
񐡵
50875
񐡶
50876
񐡷
50877
񐡸
50878
񐡹
50879
񐡺
5087A
񐡻
5087B
񐡼
5087C
񐡽
5087D
񐡾
5087E
񐡿
5087F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]