International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09AB5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚵀
1AD40
𚵁
1AD41
𚵂
1AD42
𚵃
1AD43
𚵄
1AD44
𚵅
1AD45
𚵆
1AD46
𚵇
1AD47
𚵈
1AD48
𚵉
1AD49
𚵊
1AD4A
𚵋
1AD4B
𚵌
1AD4C
𚵍
1AD4D
𚵎
1AD4E
𚵏
1AD4F
80
90
𚵐
1AD50
𚵑
1AD51
𚵒
1AD52
𚵓
1AD53
𚵔
1AD54
𚵕
1AD55
𚵖
1AD56
𚵗
1AD57
𚵘
1AD58
𚵙
1AD59
𚵚
1AD5A
𚵛
1AD5B
𚵜
1AD5C
𚵝
1AD5D
𚵞
1AD5E
𚵟
1AD5F
90
A0
𚵠
1AD60
𚵡
1AD61
𚵢
1AD62
𚵣
1AD63
𚵤
1AD64
𚵥
1AD65
𚵦
1AD66
𚵧
1AD67
𚵨
1AD68
𚵩
1AD69
𚵪
1AD6A
𚵫
1AD6B
𚵬
1AD6C
𚵭
1AD6D
𚵮
1AD6E
𚵯
1AD6F
A0
B0
𚵰
1AD70
𚵱
1AD71
𚵲
1AD72
𚵳
1AD73
𚵴
1AD74
𚵵
1AD75
𚵶
1AD76
𚵷
1AD77
𚵸
1AD78
𚵹
1AD79
𚵺
1AD7A
𚵻
1AD7B
𚵼
1AD7C
𚵽
1AD7D
𚵾
1AD7E
𚵿
1AD7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]