International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38E85

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󎅀
CE140
󎅁
CE141
󎅂
CE142
󎅃
CE143
󎅄
CE144
󎅅
CE145
󎅆
CE146
󎅇
CE147
󎅈
CE148
󎅉
CE149
󎅊
CE14A
󎅋
CE14B
󎅌
CE14C
󎅍
CE14D
󎅎
CE14E
󎅏
CE14F
80
90
󎅐
CE150
󎅑
CE151
󎅒
CE152
󎅓
CE153
󎅔
CE154
󎅕
CE155
󎅖
CE156
󎅗
CE157
󎅘
CE158
󎅙
CE159
󎅚
CE15A
󎅛
CE15B
󎅜
CE15C
󎅝
CE15D
󎅞
CE15E
󎅟
CE15F
90
A0
󎅠
CE160
󎅡
CE161
󎅢
CE162
󎅣
CE163
󎅤
CE164
󎅥
CE165
󎅦
CE166
󎅧
CE167
󎅨
CE168
󎅩
CE169
󎅪
CE16A
󎅫
CE16B
󎅬
CE16C
󎅭
CE16D
󎅮
CE16E
󎅯
CE16F
A0
B0
󎅰
CE170
󎅱
CE171
󎅲
CE172
󎅳
CE173
󎅴
CE174
󎅵
CE175
󎅶
CE176
󎅷
CE177
󎅸
CE178
󎅹
CE179
󎅺
CE17A
󎅻
CE17B
󎅼
CE17C
󎅽
CE17D
󎅾
CE17E
󎅿
CE17F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]