International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
WINDOWS
UTF-8 windows-65001
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F182AD

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񂭀
42B40
񂭁
42B41
񂭂
42B42
񂭃
42B43
񂭄
42B44
񂭅
42B45
񂭆
42B46
񂭇
42B47
񂭈
42B48
񂭉
42B49
񂭊
42B4A
񂭋
42B4B
񂭌
42B4C
񂭍
42B4D
񂭎
42B4E
񂭏
42B4F
80
90
񂭐
42B50
񂭑
42B51
񂭒
42B52
񂭓
42B53
񂭔
42B54
񂭕
42B55
񂭖
42B56
񂭗
42B57
񂭘
42B58
񂭙
42B59
񂭚
42B5A
񂭛
42B5B
񂭜
42B5C
񂭝
42B5D
񂭞
42B5E
񂭟
42B5F
90
A0
񂭠
42B60
񂭡
42B61
񂭢
42B62
񂭣
42B63
񂭤
42B64
񂭥
42B65
񂭦
42B66
񂭧
42B67
񂭨
42B68
񂭩
42B69
񂭪
42B6A
񂭫
42B6B
񂭬
42B6C
񂭭
42B6D
񂭮
42B6E
񂭯
42B6F
A0
B0
񂭰
42B70
񂭱
42B71
񂭲
42B72
񂭳
42B73
񂭴
42B74
񂭵
42B75
񂭶
42B76
񂭷
42B77
񂭸
42B78
񂭹
42B79
񂭺
42B7A
񂭻
42B7B
񂭼
42B7C
񂭽
42B7D
񂭾
42B7E
񂭿
42B7F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]