International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09084

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𐄀
10100
𐄁
10101
𐄂
10102
𐄃
10103
𐄄
10104
𐄅
10105
𐄆
10106
𐄇
10107
𐄈
10108
𐄉
10109
𐄊
1010A
𐄋
1010B
𐄌
1010C
𐄍
1010D
𐄎
1010E
𐄏
1010F
80
90
𐄐
10110
𐄑
10111
𐄒
10112
𐄓
10113
𐄔
10114
𐄕
10115
𐄖
10116
𐄗
10117
𐄘
10118
𐄙
10119
𐄚
1011A
𐄛
1011B
𐄜
1011C
𐄝
1011D
𐄞
1011E
𐄟
1011F
90
A0
𐄠
10120
𐄡
10121
𐄢
10122
𐄣
10123
𐄤
10124
𐄥
10125
𐄦
10126
𐄧
10127
𐄨
10128
𐄩
10129
𐄪
1012A
𐄫
1012B
𐄬
1012C
𐄭
1012D
𐄮
1012E
𐄯
1012F
A0
B0
𐄰
10130
𐄱
10131
𐄲
10132
𐄳
10133
𐄴
10134
𐄵
10135
𐄶
10136
𐄷
10137
𐄸
10138
𐄹
10139
𐄺
1013A
𐄻
1013B
𐄼
1013C
𐄽
1013D
𐄾
1013E
𐄿
1013F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]