International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AAAD

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󪭀
EAB40
󪭁
EAB41
󪭂
EAB42
󪭃
EAB43
󪭄
EAB44
󪭅
EAB45
󪭆
EAB46
󪭇
EAB47
󪭈
EAB48
󪭉
EAB49
󪭊
EAB4A
󪭋
EAB4B
󪭌
EAB4C
󪭍
EAB4D
󪭎
EAB4E
󪭏
EAB4F
80
90
󪭐
EAB50
󪭑
EAB51
󪭒
EAB52
󪭓
EAB53
󪭔
EAB54
󪭕
EAB55
󪭖
EAB56
󪭗
EAB57
󪭘
EAB58
󪭙
EAB59
󪭚
EAB5A
󪭛
EAB5B
󪭜
EAB5C
󪭝
EAB5D
󪭞
EAB5E
󪭟
EAB5F
90
A0
󪭠
EAB60
󪭡
EAB61
󪭢
EAB62
󪭣
EAB63
󪭤
EAB64
󪭥
EAB65
󪭦
EAB66
󪭧
EAB67
󪭨
EAB68
󪭩
EAB69
󪭪
EAB6A
󪭫
EAB6B
󪭬
EAB6C
󪭭
EAB6D
󪭮
EAB6E
󪭯
EAB6F
A0
B0
󪭰
EAB70
󪭱
EAB71
󪭲
EAB72
󪭳
EAB73
󪭴
EAB74
󪭵
EAB75
󪭶
EAB76
󪭷
EAB77
󪭸
EAB78
󪭹
EAB79
󪭺
EAB7A
󪭻
EAB7B
󪭼
EAB7C
󪭽
EAB7D
󪭾
EAB7E
󪭿
EAB7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]