International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F480B9

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􀹀
100E40
􀹁
100E41
􀹂
100E42
􀹃
100E43
􀹄
100E44
􀹅
100E45
􀹆
100E46
􀹇
100E47
􀹈
100E48
􀹉
100E49
􀹊
100E4A
􀹋
100E4B
􀹌
100E4C
􀹍
100E4D
􀹎
100E4E
􀹏
100E4F
80
90
􀹐
100E50
􀹑
100E51
􀹒
100E52
􀹓
100E53
􀹔
100E54
􀹕
100E55
􀹖
100E56
􀹗
100E57
􀹘
100E58
􀹙
100E59
􀹚
100E5A
􀹛
100E5B
􀹜
100E5C
􀹝
100E5D
􀹞
100E5E
􀹟
100E5F
90
A0
􀹠
100E60
􀹡
100E61
􀹢
100E62
􀹣
100E63
􀹤
100E64
􀹥
100E65
􀹦
100E66
􀹧
100E67
􀹨
100E68
􀹩
100E69
􀹪
100E6A
􀹫
100E6B
􀹬
100E6C
􀹭
100E6D
􀹮
100E6E
􀹯
100E6F
A0
B0
􀹰
100E70
􀹱
100E71
􀹲
100E72
􀹳
100E73
􀹴
100E74
􀹵
100E75
􀹶
100E76
􀹷
100E77
􀹸
100E78
􀹹
100E79
􀹺
100E7A
􀹻
100E7B
􀹼
100E7C
􀹽
100E7D
􀹾
100E7E
􀹿
100E7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]