International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
WINDOWS
UTF-8 windows-65001
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F180BA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񀺀
40E80
񀺁
40E81
񀺂
40E82
񀺃
40E83
񀺄
40E84
񀺅
40E85
񀺆
40E86
񀺇
40E87
񀺈
40E88
񀺉
40E89
񀺊
40E8A
񀺋
40E8B
񀺌
40E8C
񀺍
40E8D
񀺎
40E8E
񀺏
40E8F
80
90
񀺐
40E90
񀺑
40E91
񀺒
40E92
񀺓
40E93
񀺔
40E94
񀺕
40E95
񀺖
40E96
񀺗
40E97
񀺘
40E98
񀺙
40E99
񀺚
40E9A
񀺛
40E9B
񀺜
40E9C
񀺝
40E9D
񀺞
40E9E
񀺟
40E9F
90
A0
񀺠
40EA0
񀺡
40EA1
񀺢
40EA2
񀺣
40EA3
񀺤
40EA4
񀺥
40EA5
񀺦
40EA6
񀺧
40EA7
񀺨
40EA8
񀺩
40EA9
񀺪
40EAA
񀺫
40EAB
񀺬
40EAC
񀺭
40EAD
񀺮
40EAE
񀺯
40EAF
A0
B0
񀺰
40EB0
񀺱
40EB1
񀺲
40EB2
񀺳
40EB3
񀺴
40EB4
񀺵
40EB5
񀺶
40EB6
񀺷
40EB7
񀺸
40EB8
񀺹
40EB9
񀺺
40EBA
񀺻
40EBB
񀺼
40EBC
񀺽
40EBD
񀺾
40EBE
񀺿
40EBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]