International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F481B9

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􁹀
101E40
􁹁
101E41
􁹂
101E42
􁹃
101E43
􁹄
101E44
􁹅
101E45
􁹆
101E46
􁹇
101E47
􁹈
101E48
􁹉
101E49
􁹊
101E4A
􁹋
101E4B
􁹌
101E4C
􁹍
101E4D
􁹎
101E4E
􁹏
101E4F
80
90
􁹐
101E50
􁹑
101E51
􁹒
101E52
􁹓
101E53
􁹔
101E54
􁹕
101E55
􁹖
101E56
􁹗
101E57
􁹘
101E58
􁹙
101E59
􁹚
101E5A
􁹛
101E5B
􁹜
101E5C
􁹝
101E5D
􁹞
101E5E
􁹟
101E5F
90
A0
􁹠
101E60
􁹡
101E61
􁹢
101E62
􁹣
101E63
􁹤
101E64
􁹥
101E65
􁹦
101E66
􁹧
101E67
􁹨
101E68
􁹩
101E69
􁹪
101E6A
􁹫
101E6B
􁹬
101E6C
􁹭
101E6D
􁹮
101E6E
􁹯
101E6F
A0
B0
􁹰
101E70
􁹱
101E71
􁹲
101E72
􁹳
101E73
􁹴
101E74
􁹵
101E75
􁹶
101E76
􁹷
101E77
􁹸
101E78
􁹹
101E79
􁹺
101E7A
􁹻
101E7B
􁹼
101E7C
􁹽
101E7D
􁹾
101E7E
􁹿
101E7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]