International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2938C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򓌀
93300
򓌁
93301
򓌂
93302
򓌃
93303
򓌄
93304
򓌅
93305
򓌆
93306
򓌇
93307
򓌈
93308
򓌉
93309
򓌊
9330A
򓌋
9330B
򓌌
9330C
򓌍
9330D
򓌎
9330E
򓌏
9330F
80
90
򓌐
93310
򓌑
93311
򓌒
93312
򓌓
93313
򓌔
93314
򓌕
93315
򓌖
93316
򓌗
93317
򓌘
93318
򓌙
93319
򓌚
9331A
򓌛
9331B
򓌜
9331C
򓌝
9331D
򓌞
9331E
򓌟
9331F
90
A0
򓌠
93320
򓌡
93321
򓌢
93322
򓌣
93323
򓌤
93324
򓌥
93325
򓌦
93326
򓌧
93327
򓌨
93328
򓌩
93329
򓌪
9332A
򓌫
9332B
򓌬
9332C
򓌭
9332D
򓌮
9332E
򓌯
9332F
A0
B0
򓌰
93330
򓌱
93331
򓌲
93332
򓌳
93333
򓌴
93334
򓌵
93335
򓌶
93336
򓌷
93337
򓌸
93338
򓌹
93339
򓌺
9333A
򓌻
9333B
򓌼
9333C
򓌽
9333D
򓌾
9333E
򓌿
9333F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]