International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4858A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􅊀
105280
􅊁
105281
􅊂
105282
􅊃
105283
􅊄
105284
􅊅
105285
􅊆
105286
􅊇
105287
􅊈
105288
􅊉
105289
􅊊
10528A
􅊋
10528B
􅊌
10528C
􅊍
10528D
􅊎
10528E
􅊏
10528F
80
90
􅊐
105290
􅊑
105291
􅊒
105292
􅊓
105293
􅊔
105294
􅊕
105295
􅊖
105296
􅊗
105297
􅊘
105298
􅊙
105299
􅊚
10529A
􅊛
10529B
􅊜
10529C
􅊝
10529D
􅊞
10529E
􅊟
10529F
90
A0
􅊠
1052A0
􅊡
1052A1
􅊢
1052A2
􅊣
1052A3
􅊤
1052A4
􅊥
1052A5
􅊦
1052A6
􅊧
1052A7
􅊨
1052A8
􅊩
1052A9
􅊪
1052AA
􅊫
1052AB
􅊬
1052AC
􅊭
1052AD
􅊮
1052AE
􅊯
1052AF
A0
B0
􅊰
1052B0
􅊱
1052B1
􅊲
1052B2
􅊳
1052B3
􅊴
1052B4
􅊵
1052B5
􅊶
1052B6
􅊷
1052B7
􅊸
1052B8
􅊹
1052B9
􅊺
1052BA
􅊻
1052BB
􅊼
1052BC
􅊽
1052BD
􅊾
1052BE
􅊿
1052BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]