International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F298B9

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򘹀
98E40
򘹁
98E41
򘹂
98E42
򘹃
98E43
򘹄
98E44
򘹅
98E45
򘹆
98E46
򘹇
98E47
򘹈
98E48
򘹉
98E49
򘹊
98E4A
򘹋
98E4B
򘹌
98E4C
򘹍
98E4D
򘹎
98E4E
򘹏
98E4F
80
90
򘹐
98E50
򘹑
98E51
򘹒
98E52
򘹓
98E53
򘹔
98E54
򘹕
98E55
򘹖
98E56
򘹗
98E57
򘹘
98E58
򘹙
98E59
򘹚
98E5A
򘹛
98E5B
򘹜
98E5C
򘹝
98E5D
򘹞
98E5E
򘹟
98E5F
90
A0
򘹠
98E60
򘹡
98E61
򘹢
98E62
򘹣
98E63
򘹤
98E64
򘹥
98E65
򘹦
98E66
򘹧
98E67
򘹨
98E68
򘹩
98E69
򘹪
98E6A
򘹫
98E6B
򘹬
98E6C
򘹭
98E6D
򘹮
98E6E
򘹯
98E6F
A0
B0
򘹰
98E70
򘹱
98E71
򘹲
98E72
򘹳
98E73
򘹴
98E74
򘹵
98E75
򘹶
98E76
򘹷
98E77
򘹸
98E78
򘹹
98E79
򘹺
98E7A
򘹻
98E7B
򘹼
98E7C
򘹽
98E7D
򘹾
98E7E
򘹿
98E7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]