International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
WINDOWS
UTF-8 windows-65001
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F2808A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򀊀
80280
򀊁
80281
򀊂
80282
򀊃
80283
򀊄
80284
򀊅
80285
򀊆
80286
򀊇
80287
򀊈
80288
򀊉
80289
򀊊
8028A
򀊋
8028B
򀊌
8028C
򀊍
8028D
򀊎
8028E
򀊏
8028F
80
90
򀊐
80290
򀊑
80291
򀊒
80292
򀊓
80293
򀊔
80294
򀊕
80295
򀊖
80296
򀊗
80297
򀊘
80298
򀊙
80299
򀊚
8029A
򀊛
8029B
򀊜
8029C
򀊝
8029D
򀊞
8029E
򀊟
8029F
90
A0
򀊠
802A0
򀊡
802A1
򀊢
802A2
򀊣
802A3
򀊤
802A4
򀊥
802A5
򀊦
802A6
򀊧
802A7
򀊨
802A8
򀊩
802A9
򀊪
802AA
򀊫
802AB
򀊬
802AC
򀊭
802AD
򀊮
802AE
򀊯
802AF
A0
B0
򀊰
802B0
򀊱
802B1
򀊲
802B2
򀊳
802B3
򀊴
802B4
򀊵
802B5
򀊶
802B6
򀊷
802B7
򀊸
802B8
򀊹
802B9
򀊺
802BA
򀊻
802BB
򀊼
802BC
򀊽
802BD
򀊾
802BE
򀊿
802BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]