International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B8AA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𸪀
38A80
𸪁
38A81
𸪂
38A82
𸪃
38A83
𸪄
38A84
𸪅
38A85
𸪆
38A86
𸪇
38A87
𸪈
38A88
𸪉
38A89
𸪊
38A8A
𸪋
38A8B
𸪌
38A8C
𸪍
38A8D
𸪎
38A8E
𸪏
38A8F
80
90
𸪐
38A90
𸪑
38A91
𸪒
38A92
𸪓
38A93
𸪔
38A94
𸪕
38A95
𸪖
38A96
𸪗
38A97
𸪘
38A98
𸪙
38A99
𸪚
38A9A
𸪛
38A9B
𸪜
38A9C
𸪝
38A9D
𸪞
38A9E
𸪟
38A9F
90
A0
𸪠
38AA0
𸪡
38AA1
𸪢
38AA2
𸪣
38AA3
𸪤
38AA4
𸪥
38AA5
𸪦
38AA6
𸪧
38AA7
𸪨
38AA8
𸪩
38AA9
𸪪
38AAA
𸪫
38AAB
𸪬
38AAC
𸪭
38AAD
𸪮
38AAE
𸪯
38AAF
A0
B0
𸪰
38AB0
𸪱
38AB1
𸪲
38AB2
𸪳
38AB3
𸪴
38AB4
𸪵
38AB5
𸪶
38AB6
𸪷
38AB7
𸪸
38AB8
𸪹
38AB9
𸪺
38ABA
𸪻
38ABB
𸪼
38ABC
𸪽
38ABD
𸪾
38ABE
𸪿
38ABF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]