International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28085

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򀅀
80140
򀅁
80141
򀅂
80142
򀅃
80143
򀅄
80144
򀅅
80145
򀅆
80146
򀅇
80147
򀅈
80148
򀅉
80149
򀅊
8014A
򀅋
8014B
򀅌
8014C
򀅍
8014D
򀅎
8014E
򀅏
8014F
80
90
򀅐
80150
򀅑
80151
򀅒
80152
򀅓
80153
򀅔
80154
򀅕
80155
򀅖
80156
򀅗
80157
򀅘
80158
򀅙
80159
򀅚
8015A
򀅛
8015B
򀅜
8015C
򀅝
8015D
򀅞
8015E
򀅟
8015F
90
A0
򀅠
80160
򀅡
80161
򀅢
80162
򀅣
80163
򀅤
80164
򀅥
80165
򀅦
80166
򀅧
80167
򀅨
80168
򀅩
80169
򀅪
8016A
򀅫
8016B
򀅬
8016C
򀅭
8016D
򀅮
8016E
򀅯
8016F
A0
B0
򀅰
80170
򀅱
80171
򀅲
80172
򀅳
80173
򀅴
80174
򀅵
80175
򀅶
80176
򀅷
80177
򀅸
80178
򀅹
80179
򀅺
8017A
򀅻
8017B
򀅼
8017C
򀅽
8017D
򀅾
8017E
򀅿
8017F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]