International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F187AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񇮀
47B80
񇮁
47B81
񇮂
47B82
񇮃
47B83
񇮄
47B84
񇮅
47B85
񇮆
47B86
񇮇
47B87
񇮈
47B88
񇮉
47B89
񇮊
47B8A
񇮋
47B8B
񇮌
47B8C
񇮍
47B8D
񇮎
47B8E
񇮏
47B8F
80
90
񇮐
47B90
񇮑
47B91
񇮒
47B92
񇮓
47B93
񇮔
47B94
񇮕
47B95
񇮖
47B96
񇮗
47B97
񇮘
47B98
񇮙
47B99
񇮚
47B9A
񇮛
47B9B
񇮜
47B9C
񇮝
47B9D
񇮞
47B9E
񇮟
47B9F
90
A0
񇮠
47BA0
񇮡
47BA1
񇮢
47BA2
񇮣
47BA3
񇮤
47BA4
񇮥
47BA5
񇮦
47BA6
񇮧
47BA7
񇮨
47BA8
񇮩
47BA9
񇮪
47BAA
񇮫
47BAB
񇮬
47BAC
񇮭
47BAD
񇮮
47BAE
񇮯
47BAF
A0
B0
񇮰
47BB0
񇮱
47BB1
񇮲
47BB2
񇮳
47BB3
񇮴
47BB4
񇮵
47BB5
񇮶
47BB6
񇮷
47BB7
񇮸
47BB8
񇮹
47BB9
񇮺
47BBA
񇮻
47BBB
񇮼
47BBC
񇮽
47BBD
񇮾
47BBE
񇮿
47BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]