International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A08A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𠊀
20280
𠊁
20281
𠊂
20282
𠊃
20283
𠊄
20284
𠊅
20285
𠊆
20286
𠊇
20287
𠊈
20288
𠊉
20289
𠊊
2028A
𠊋
2028B
𠊌
2028C
𠊍
2028D
𠊎
2028E
𠊏
2028F
80
90
𠊐
20290
𠊑
20291
𠊒
20292
𠊓
20293
𠊔
20294
𠊕
20295
𠊖
20296
𠊗
20297
𠊘
20298
𠊙
20299
𠊚
2029A
𠊛
2029B
𠊜
2029C
𠊝
2029D
𠊞
2029E
𠊟
2029F
90
A0
𠊠
202A0
𠊡
202A1
𠊢
202A2
𠊣
202A3
𠊤
202A4
𠊥
202A5
𠊦
202A6
𠊧
202A7
𠊨
202A8
𠊩
202A9
𠊪
202AA
𠊫
202AB
𠊬
202AC
𠊭
202AD
𠊮
202AE
𠊯
202AF
A0
B0
𠊰
202B0
𠊱
202B1
𠊲
202B2
𠊳
202B3
𠊴
202B4
𠊵
202B5
𠊶
202B6
𠊷
202B7
𠊸
202B8
𠊹
202B9
𠊺
202BA
𠊻
202BB
𠊼
202BC
𠊽
202BD
𠊾
202BE
𠊿
202BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]