International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2828A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򂊀
82280
򂊁
82281
򂊂
82282
򂊃
82283
򂊄
82284
򂊅
82285
򂊆
82286
򂊇
82287
򂊈
82288
򂊉
82289
򂊊
8228A
򂊋
8228B
򂊌
8228C
򂊍
8228D
򂊎
8228E
򂊏
8228F
80
90
򂊐
82290
򂊑
82291
򂊒
82292
򂊓
82293
򂊔
82294
򂊕
82295
򂊖
82296
򂊗
82297
򂊘
82298
򂊙
82299
򂊚
8229A
򂊛
8229B
򂊜
8229C
򂊝
8229D
򂊞
8229E
򂊟
8229F
90
A0
򂊠
822A0
򂊡
822A1
򂊢
822A2
򂊣
822A3
򂊤
822A4
򂊥
822A5
򂊦
822A6
򂊧
822A7
򂊨
822A8
򂊩
822A9
򂊪
822AA
򂊫
822AB
򂊬
822AC
򂊭
822AD
򂊮
822AE
򂊯
822AF
A0
B0
򂊰
822B0
򂊱
822B1
򂊲
822B2
򂊳
822B3
򂊴
822B4
򂊵
822B5
򂊶
822B6
򂊷
822B7
򂊸
822B8
򂊹
822B9
򂊺
822BA
򂊻
822BB
򂊼
822BC
򂊽
822BD
򂊾
822BE
򂊿
822BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]