International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39085

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐅀
D0140
󐅁
D0141
󐅂
D0142
󐅃
D0143
󐅄
D0144
󐅅
D0145
󐅆
D0146
󐅇
D0147
󐅈
D0148
󐅉
D0149
󐅊
D014A
󐅋
D014B
󐅌
D014C
󐅍
D014D
󐅎
D014E
󐅏
D014F
80
90
󐅐
D0150
󐅑
D0151
󐅒
D0152
󐅓
D0153
󐅔
D0154
󐅕
D0155
󐅖
D0156
󐅗
D0157
󐅘
D0158
󐅙
D0159
󐅚
D015A
󐅛
D015B
󐅜
D015C
󐅝
D015D
󐅞
D015E
󐅟
D015F
90
A0
󐅠
D0160
󐅡
D0161
󐅢
D0162
󐅣
D0163
󐅤
D0164
󐅥
D0165
󐅦
D0166
󐅧
D0167
󐅨
D0168
󐅩
D0169
󐅪
D016A
󐅫
D016B
󐅬
D016C
󐅭
D016D
󐅮
D016E
󐅯
D016F
A0
B0
󐅰
D0170
󐅱
D0171
󐅲
D0172
󐅳
D0173
󐅴
D0174
󐅵
D0175
󐅶
D0176
󐅷
D0177
󐅸
D0178
󐅹
D0179
󐅺
D017A
󐅻
D017B
󐅼
D017C
󐅽
D017D
󐅾
D017E
󐅿
D017F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]