International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18A86

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񊆀
4A180
񊆁
4A181
񊆂
4A182
񊆃
4A183
񊆄
4A184
񊆅
4A185
񊆆
4A186
񊆇
4A187
񊆈
4A188
񊆉
4A189
񊆊
4A18A
񊆋
4A18B
񊆌
4A18C
񊆍
4A18D
񊆎
4A18E
񊆏
4A18F
80
90
񊆐
4A190
񊆑
4A191
񊆒
4A192
񊆓
4A193
񊆔
4A194
񊆕
4A195
񊆖
4A196
񊆗
4A197
񊆘
4A198
񊆙
4A199
񊆚
4A19A
񊆛
4A19B
񊆜
4A19C
񊆝
4A19D
񊆞
4A19E
񊆟
4A19F
90
A0
񊆠
4A1A0
񊆡
4A1A1
񊆢
4A1A2
񊆣
4A1A3
񊆤
4A1A4
񊆥
4A1A5
񊆦
4A1A6
񊆧
4A1A7
񊆨
4A1A8
񊆩
4A1A9
񊆪
4A1AA
񊆫
4A1AB
񊆬
4A1AC
񊆭
4A1AD
񊆮
4A1AE
񊆯
4A1AF
A0
B0
񊆰
4A1B0
񊆱
4A1B1
񊆲
4A1B2
񊆳
4A1B3
񊆴
4A1B4
񊆵
4A1B5
񊆶
4A1B6
񊆷
4A1B7
񊆸
4A1B8
񊆹
4A1B9
񊆺
4A1BA
񊆻
4A1BB
񊆼
4A1BC
񊆽
4A1BD
񊆾
4A1BE
񊆿
4A1BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]