International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18084

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񀄀
40100
񀄁
40101
񀄂
40102
񀄃
40103
񀄄
40104
񀄅
40105
񀄆
40106
񀄇
40107
񀄈
40108
񀄉
40109
񀄊
4010A
񀄋
4010B
񀄌
4010C
񀄍
4010D
񀄎
4010E
񀄏
4010F
80
90
񀄐
40110
񀄑
40111
񀄒
40112
񀄓
40113
񀄔
40114
񀄕
40115
񀄖
40116
񀄗
40117
񀄘
40118
񀄙
40119
񀄚
4011A
񀄛
4011B
񀄜
4011C
񀄝
4011D
񀄞
4011E
񀄟
4011F
90
A0
񀄠
40120
񀄡
40121
񀄢
40122
񀄣
40123
񀄤
40124
񀄥
40125
񀄦
40126
񀄧
40127
񀄨
40128
񀄩
40129
񀄪
4012A
񀄫
4012B
񀄬
4012C
񀄭
4012D
񀄮
4012E
񀄯
4012F
A0
B0
񀄰
40130
񀄱
40131
񀄲
40132
񀄳
40133
񀄴
40134
񀄵
40135
񀄶
40136
񀄷
40137
񀄸
40138
񀄹
40139
񀄺
4013A
񀄻
4013B
񀄼
4013C
񀄽
4013D
񀄾
4013E
񀄿
4013F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]