International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A4AA

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𤪀
24A80
𤪁
24A81
𤪂
24A82
𤪃
24A83
𤪄
24A84
𤪅
24A85
𤪆
24A86
𤪇
24A87
𤪈
24A88
𤪉
24A89
𤪊
24A8A
𤪋
24A8B
𤪌
24A8C
𤪍
24A8D
𤪎
24A8E
𤪏
24A8F
80
90
𤪐
24A90
𤪑
24A91
𤪒
24A92
𤪓
24A93
𤪔
24A94
𤪕
24A95
𤪖
24A96
𤪗
24A97
𤪘
24A98
𤪙
24A99
𤪚
24A9A
𤪛
24A9B
𤪜
24A9C
𤪝
24A9D
𤪞
24A9E
𤪟
24A9F
90
A0
𤪠
24AA0
𤪡
24AA1
𤪢
24AA2
𤪣
24AA3
𤪤
24AA4
𤪥
24AA5
𤪦
24AA6
𤪧
24AA7
𤪨
24AA8
𤪩
24AA9
𤪪
24AAA
𤪫
24AAB
𤪬
24AAC
𤪭
24AAD
𤪮
24AAE
𤪯
24AAF
A0
B0
𤪰
24AB0
𤪱
24AB1
𤪲
24AB2
𤪳
24AB3
𤪴
24AB4
𤪵
24AB5
𤪶
24AB6
𤪷
24AB7
𤪸
24AB8
𤪹
24AB9
𤪺
24ABA
𤪻
24ABB
𤪼
24ABC
𤪽
24ABD
𤪾
24ABE
𤪿
24ABF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]