International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F289AD

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉭀
89B40
򉭁
89B41
򉭂
89B42
򉭃
89B43
򉭄
89B44
򉭅
89B45
򉭆
89B46
򉭇
89B47
򉭈
89B48
򉭉
89B49
򉭊
89B4A
򉭋
89B4B
򉭌
89B4C
򉭍
89B4D
򉭎
89B4E
򉭏
89B4F
80
90
򉭐
89B50
򉭑
89B51
򉭒
89B52
򉭓
89B53
򉭔
89B54
򉭕
89B55
򉭖
89B56
򉭗
89B57
򉭘
89B58
򉭙
89B59
򉭚
89B5A
򉭛
89B5B
򉭜
89B5C
򉭝
89B5D
򉭞
89B5E
򉭟
89B5F
90
A0
򉭠
89B60
򉭡
89B61
򉭢
89B62
򉭣
89B63
򉭤
89B64
򉭥
89B65
򉭦
89B66
򉭧
89B67
򉭨
89B68
򉭩
89B69
򉭪
89B6A
򉭫
89B6B
򉭬
89B6C
򉭭
89B6D
򉭮
89B6E
򉭯
89B6F
A0
B0
򉭰
89B70
򉭱
89B71
򉭲
89B72
򉭳
89B73
򉭴
89B74
򉭵
89B75
򉭶
89B76
򉭷
89B77
򉭸
89B78
򉭹
89B79
򉭺
89B7A
򉭻
89B7B
򉭼
89B7C
򉭽
89B7D
򉭾
89B7E
򉭿
89B7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]