International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F091BE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𑾀
11F80
𑾁
11F81
𑾂
11F82
𑾃
11F83
𑾄
11F84
𑾅
11F85
𑾆
11F86
𑾇
11F87
𑾈
11F88
𑾉
11F89
𑾊
11F8A
𑾋
11F8B
𑾌
11F8C
𑾍
11F8D
𑾎
11F8E
𑾏
11F8F
80
90
𑾐
11F90
𑾑
11F91
𑾒
11F92
𑾓
11F93
𑾔
11F94
𑾕
11F95
𑾖
11F96
𑾗
11F97
𑾘
11F98
𑾙
11F99
𑾚
11F9A
𑾛
11F9B
𑾜
11F9C
𑾝
11F9D
𑾞
11F9E
𑾟
11F9F
90
A0
𑾠
11FA0
𑾡
11FA1
𑾢
11FA2
𑾣
11FA3
𑾤
11FA4
𑾥
11FA5
𑾦
11FA6
𑾧
11FA7
𑾨
11FA8
𑾩
11FA9
𑾪
11FAA
𑾫
11FAB
𑾬
11FAC
𑾭
11FAD
𑾮
11FAE
𑾯
11FAF
A0
B0
𑾰
11FB0
𑾱
11FB1
𑾲
11FB2
𑾳
11FB3
𑾴
11FB4
𑾵
11FB5
𑾶
11FB6
𑾷
11FB7
𑾸
11FB8
𑾹
11FB9
𑾺
11FBA
𑾻
11FBB
𑾼
11FBC
𑾽
11FBD
𑾾
11FBE
𑾿
11FBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]