International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IANA MIME
UTF-8 UTF-8 UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F295A8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򕨀
95A00
򕨁
95A01
򕨂
95A02
򕨃
95A03
򕨄
95A04
򕨅
95A05
򕨆
95A06
򕨇
95A07
򕨈
95A08
򕨉
95A09
򕨊
95A0A
򕨋
95A0B
򕨌
95A0C
򕨍
95A0D
򕨎
95A0E
򕨏
95A0F
80
90
򕨐
95A10
򕨑
95A11
򕨒
95A12
򕨓
95A13
򕨔
95A14
򕨕
95A15
򕨖
95A16
򕨗
95A17
򕨘
95A18
򕨙
95A19
򕨚
95A1A
򕨛
95A1B
򕨜
95A1C
򕨝
95A1D
򕨞
95A1E
򕨟
95A1F
90
A0
򕨠
95A20
򕨡
95A21
򕨢
95A22
򕨣
95A23
򕨤
95A24
򕨥
95A25
򕨦
95A26
򕨧
95A27
򕨨
95A28
򕨩
95A29
򕨪
95A2A
򕨫
95A2B
򕨬
95A2C
򕨭
95A2D
򕨮
95A2E
򕨯
95A2F
A0
B0
򕨰
95A30
򕨱
95A31
򕨲
95A32
򕨳
95A33
򕨴
95A34
򕨵
95A35
򕨶
95A36
򕨷
95A37
򕨸
95A38
򕨹
95A39
򕨺
95A3A
򕨻
95A3B
򕨼
95A3C
򕨽
95A3D
򕨾
95A3E
򕨿
95A3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]