International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B4AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𴮀
34B80
𴮁
34B81
𴮂
34B82
𴮃
34B83
𴮄
34B84
𴮅
34B85
𴮆
34B86
𴮇
34B87
𴮈
34B88
𴮉
34B89
𴮊
34B8A
𴮋
34B8B
𴮌
34B8C
𴮍
34B8D
𴮎
34B8E
𴮏
34B8F
80
90
𴮐
34B90
𴮑
34B91
𴮒
34B92
𴮓
34B93
𴮔
34B94
𴮕
34B95
𴮖
34B96
𴮗
34B97
𴮘
34B98
𴮙
34B99
𴮚
34B9A
𴮛
34B9B
𴮜
34B9C
𴮝
34B9D
𴮞
34B9E
𴮟
34B9F
90
A0
𴮠
34BA0
𴮡
34BA1
𴮢
34BA2
𴮣
34BA3
𴮤
34BA4
𴮥
34BA5
𴮦
34BA6
𴮧
34BA7
𴮨
34BA8
𴮩
34BA9
𴮪
34BAA
𴮫
34BAB
𴮬
34BAC
𴮭
34BAD
𴮮
34BAE
𴮯
34BAF
A0
B0
𴮰
34BB0
𴮱
34BB1
𴮲
34BB2
𴮳
34BB3
𴮴
34BB4
𴮵
34BB5
𴮶
34BB6
𴮷
34BB7
𴮸
34BB8
𴮹
34BB9
𴮺
34BBA
𴮻
34BBB
𴮼
34BBC
𴮽
34BBD
𴮾
34BBE
𴮿
34BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]