International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F381B7

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁷀
C1DC0
󁷁
C1DC1
󁷂
C1DC2
󁷃
C1DC3
󁷄
C1DC4
󁷅
C1DC5
󁷆
C1DC6
󁷇
C1DC7
󁷈
C1DC8
󁷉
C1DC9
󁷊
C1DCA
󁷋
C1DCB
󁷌
C1DCC
󁷍
C1DCD
󁷎
C1DCE
󁷏
C1DCF
80
90
󁷐
C1DD0
󁷑
C1DD1
󁷒
C1DD2
󁷓
C1DD3
󁷔
C1DD4
󁷕
C1DD5
󁷖
C1DD6
󁷗
C1DD7
󁷘
C1DD8
󁷙
C1DD9
󁷚
C1DDA
󁷛
C1DDB
󁷜
C1DDC
󁷝
C1DDD
󁷞
C1DDE
󁷟
C1DDF
90
A0
󁷠
C1DE0
󁷡
C1DE1
󁷢
C1DE2
󁷣
C1DE3
󁷤
C1DE4
󁷥
C1DE5
󁷦
C1DE6
󁷧
C1DE7
󁷨
C1DE8
󁷩
C1DE9
󁷪
C1DEA
󁷫
C1DEB
󁷬
C1DEC
󁷭
C1DED
󁷮
C1DEE
󁷯
C1DEF
A0
B0
󁷰
C1DF0
󁷱
C1DF1
󁷲
C1DF2
󁷳
C1DF3
󁷴
C1DF4
󁷵
C1DF5
󁷶
C1DF6
󁷷
C1DF7
󁷸
C1DF8
󁷹
C1DF9
󁷺
C1DFA
󁷻
C1DFB
󁷼
C1DFC
󁷽
C1DFD
󁷾
C1DFE
󁷿
C1DFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]