International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F380AA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀪀
C0A80
󀪁
C0A81
󀪂
C0A82
󀪃
C0A83
󀪄
C0A84
󀪅
C0A85
󀪆
C0A86
󀪇
C0A87
󀪈
C0A88
󀪉
C0A89
󀪊
C0A8A
󀪋
C0A8B
󀪌
C0A8C
󀪍
C0A8D
󀪎
C0A8E
󀪏
C0A8F
80
90
󀪐
C0A90
󀪑
C0A91
󀪒
C0A92
󀪓
C0A93
󀪔
C0A94
󀪕
C0A95
󀪖
C0A96
󀪗
C0A97
󀪘
C0A98
󀪙
C0A99
󀪚
C0A9A
󀪛
C0A9B
󀪜
C0A9C
󀪝
C0A9D
󀪞
C0A9E
󀪟
C0A9F
90
A0
󀪠
C0AA0
󀪡
C0AA1
󀪢
C0AA2
󀪣
C0AA3
󀪤
C0AA4
󀪥
C0AA5
󀪦
C0AA6
󀪧
C0AA7
󀪨
C0AA8
󀪩
C0AA9
󀪪
C0AAA
󀪫
C0AAB
󀪬
C0AAC
󀪭
C0AAD
󀪮
C0AAE
󀪯
C0AAF
A0
B0
󀪰
C0AB0
󀪱
C0AB1
󀪲
C0AB2
󀪳
C0AB3
󀪴
C0AB4
󀪵
C0AB5
󀪶
C0AB6
󀪷
C0AB7
󀪸
C0AB8
󀪹
C0AB9
󀪺
C0ABA
󀪻
C0ABB
󀪼
C0ABC
󀪽
C0ABD
󀪾
C0ABE
󀪿
C0ABF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]