International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B5AE

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󵮀
F5B80
󵮁
F5B81
󵮂
F5B82
󵮃
F5B83
󵮄
F5B84
󵮅
F5B85
󵮆
F5B86
󵮇
F5B87
󵮈
F5B88
󵮉
F5B89
󵮊
F5B8A
󵮋
F5B8B
󵮌
F5B8C
󵮍
F5B8D
󵮎
F5B8E
󵮏
F5B8F
80
90
󵮐
F5B90
󵮑
F5B91
󵮒
F5B92
󵮓
F5B93
󵮔
F5B94
󵮕
F5B95
󵮖
F5B96
󵮗
F5B97
󵮘
F5B98
󵮙
F5B99
󵮚
F5B9A
󵮛
F5B9B
󵮜
F5B9C
󵮝
F5B9D
󵮞
F5B9E
󵮟
F5B9F
90
A0
󵮠
F5BA0
󵮡
F5BA1
󵮢
F5BA2
󵮣
F5BA3
󵮤
F5BA4
󵮥
F5BA5
󵮦
F5BA6
󵮧
F5BA7
󵮨
F5BA8
󵮩
F5BA9
󵮪
F5BAA
󵮫
F5BAB
󵮬
F5BAC
󵮭
F5BAD
󵮮
F5BAE
󵮯
F5BAF
A0
B0
󵮰
F5BB0
󵮱
F5BB1
󵮲
F5BB2
󵮳
F5BB3
󵮴
F5BB4
󵮵
F5BB5
󵮶
F5BB6
󵮷
F5BB7
󵮸
F5BB8
󵮹
F5BB9
󵮺
F5BBA
󵮻
F5BBB
󵮼
F5BBC
󵮽
F5BBD
󵮾
F5BBE
󵮿
F5BBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]