International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F487AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􇮀
107B80
􇮁
107B81
􇮂
107B82
􇮃
107B83
􇮄
107B84
􇮅
107B85
􇮆
107B86
􇮇
107B87
􇮈
107B88
􇮉
107B89
􇮊
107B8A
􇮋
107B8B
􇮌
107B8C
􇮍
107B8D
􇮎
107B8E
􇮏
107B8F
80
90
􇮐
107B90
􇮑
107B91
􇮒
107B92
􇮓
107B93
􇮔
107B94
􇮕
107B95
􇮖
107B96
􇮗
107B97
􇮘
107B98
􇮙
107B99
􇮚
107B9A
􇮛
107B9B
􇮜
107B9C
􇮝
107B9D
􇮞
107B9E
􇮟
107B9F
90
A0
􇮠
107BA0
􇮡
107BA1
􇮢
107BA2
􇮣
107BA3
􇮤
107BA4
􇮥
107BA5
􇮦
107BA6
􇮧
107BA7
􇮨
107BA8
􇮩
107BA9
􇮪
107BAA
􇮫
107BAB
􇮬
107BAC
􇮭
107BAD
􇮮
107BAE
􇮯
107BAF
A0
B0
􇮰
107BB0
􇮱
107BB1
􇮲
107BB2
􇮳
107BB3
􇮴
107BB4
􇮵
107BB5
􇮶
107BB6
􇮷
107BB7
􇮸
107BB8
􇮹
107BB9
􇮺
107BBA
􇮻
107BBB
􇮼
107BBC
􇮽
107BBD
􇮾
107BBE
􇮿
107BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]