International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A2AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󢮀
E2B80
󢮁
E2B81
󢮂
E2B82
󢮃
E2B83
󢮄
E2B84
󢮅
E2B85
󢮆
E2B86
󢮇
E2B87
󢮈
E2B88
󢮉
E2B89
󢮊
E2B8A
󢮋
E2B8B
󢮌
E2B8C
󢮍
E2B8D
󢮎
E2B8E
󢮏
E2B8F
80
90
󢮐
E2B90
󢮑
E2B91
󢮒
E2B92
󢮓
E2B93
󢮔
E2B94
󢮕
E2B95
󢮖
E2B96
󢮗
E2B97
󢮘
E2B98
󢮙
E2B99
󢮚
E2B9A
󢮛
E2B9B
󢮜
E2B9C
󢮝
E2B9D
󢮞
E2B9E
󢮟
E2B9F
90
A0
󢮠
E2BA0
󢮡
E2BA1
󢮢
E2BA2
󢮣
E2BA3
󢮤
E2BA4
󢮥
E2BA5
󢮦
E2BA6
󢮧
E2BA7
󢮨
E2BA8
󢮩
E2BA9
󢮪
E2BAA
󢮫
E2BAB
󢮬
E2BAC
󢮭
E2BAD
󢮮
E2BAE
󢮯
E2BAF
A0
B0
󢮰
E2BB0
󢮱
E2BB1
󢮲
E2BB2
󢮳
E2BB3
󢮴
E2BB4
󢮵
E2BB5
󢮶
E2BB6
󢮷
E2BB7
󢮸
E2BB8
󢮹
E2BB9
󢮺
E2BBA
󢮻
E2BBB
󢮼
E2BBC
󢮽
E2BBD
󢮾
E2BBE
󢮿
E2BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]