International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F390B7

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐷀
D0DC0
󐷁
D0DC1
󐷂
D0DC2
󐷃
D0DC3
󐷄
D0DC4
󐷅
D0DC5
󐷆
D0DC6
󐷇
D0DC7
󐷈
D0DC8
󐷉
D0DC9
󐷊
D0DCA
󐷋
D0DCB
󐷌
D0DCC
󐷍
D0DCD
󐷎
D0DCE
󐷏
D0DCF
80
90
󐷐
D0DD0
󐷑
D0DD1
󐷒
D0DD2
󐷓
D0DD3
󐷔
D0DD4
󐷕
D0DD5
󐷖
D0DD6
󐷗
D0DD7
󐷘
D0DD8
󐷙
D0DD9
󐷚
D0DDA
󐷛
D0DDB
󐷜
D0DDC
󐷝
D0DDD
󐷞
D0DDE
󐷟
D0DDF
90
A0
󐷠
D0DE0
󐷡
D0DE1
󐷢
D0DE2
󐷣
D0DE3
󐷤
D0DE4
󐷥
D0DE5
󐷦
D0DE6
󐷧
D0DE7
󐷨
D0DE8
󐷩
D0DE9
󐷪
D0DEA
󐷫
D0DEB
󐷬
D0DEC
󐷭
D0DED
󐷮
D0DEE
󐷯
D0DEF
A0
B0
󐷰
D0DF0
󐷱
D0DF1
󐷲
D0DF2
󐷳
D0DF3
󐷴
D0DF4
󐷵
D0DF5
󐷶
D0DF6
󐷷
D0DF7
󐷸
D0DF8
󐷹
D0DF9
󐷺
D0DFA
󐷻
D0DFB
󐷼
D0DFC
󐷽
D0DFD
󐷾
D0DFE
󐷿
D0DFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]