International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09AB7

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚷀
1ADC0
𚷁
1ADC1
𚷂
1ADC2
𚷃
1ADC3
𚷄
1ADC4
𚷅
1ADC5
𚷆
1ADC6
𚷇
1ADC7
𚷈
1ADC8
𚷉
1ADC9
𚷊
1ADCA
𚷋
1ADCB
𚷌
1ADCC
𚷍
1ADCD
𚷎
1ADCE
𚷏
1ADCF
80
90
𚷐
1ADD0
𚷑
1ADD1
𚷒
1ADD2
𚷓
1ADD3
𚷔
1ADD4
𚷕
1ADD5
𚷖
1ADD6
𚷗
1ADD7
𚷘
1ADD8
𚷙
1ADD9
𚷚
1ADDA
𚷛
1ADDB
𚷜
1ADDC
𚷝
1ADDD
𚷞
1ADDE
𚷟
1ADDF
90
A0
𚷠
1ADE0
𚷡
1ADE1
𚷢
1ADE2
𚷣
1ADE3
𚷤
1ADE4
𚷥
1ADE5
𚷦
1ADE6
𚷧
1ADE7
𚷨
1ADE8
𚷩
1ADE9
𚷪
1ADEA
𚷫
1ADEB
𚷬
1ADEC
𚷭
1ADED
𚷮
1ADEE
𚷯
1ADEF
A0
B0
𚷰
1ADF0
𚷱
1ADF1
𚷲
1ADF2
𚷳
1ADF3
𚷴
1ADF4
𚷵
1ADF5
𚷶
1ADF6
𚷷
1ADF7
𚷸
1ADF8
𚷹
1ADF9
𚷺
1ADFA
𚷻
1ADFB
𚷼
1ADFC
𚷽
1ADFD
𚷾
1ADFE
𚷿
1ADFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]