International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F090AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𐮀
10B80
𐮁
10B81
𐮂
10B82
𐮃
10B83
𐮄
10B84
𐮅
10B85
𐮆
10B86
𐮇
10B87
𐮈
10B88
𐮉
10B89
𐮊
10B8A
𐮋
10B8B
𐮌
10B8C
𐮍
10B8D
𐮎
10B8E
𐮏
10B8F
80
90
𐮐
10B90
𐮑
10B91
𐮒
10B92
𐮓
10B93
𐮔
10B94
𐮕
10B95
𐮖
10B96
𐮗
10B97
𐮘
10B98
𐮙
10B99
𐮚
10B9A
𐮛
10B9B
𐮜
10B9C
𐮝
10B9D
𐮞
10B9E
𐮟
10B9F
90
A0
𐮠
10BA0
𐮡
10BA1
𐮢
10BA2
𐮣
10BA3
𐮤
10BA4
𐮥
10BA5
𐮦
10BA6
𐮧
10BA7
𐮨
10BA8
𐮩
10BA9
𐮪
10BAA
𐮫
10BAB
𐮬
10BAC
𐮭
10BAD
𐮮
10BAE
𐮯
10BAF
A0
B0
𐮰
10BB0
𐮱
10BB1
𐮲
10BB2
𐮳
10BB3
𐮴
10BB4
𐮵
10BB5
𐮶
10BB6
𐮷
10BB7
𐮸
10BB8
𐮹
10BB9
𐮺
10BBA
𐮻
10BBB
𐮼
10BBC
𐮽
10BBD
𐮾
10BBE
𐮿
10BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]