International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09C8A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𜊀
1C280
𜊁
1C281
𜊂
1C282
𜊃
1C283
𜊄
1C284
𜊅
1C285
𜊆
1C286
𜊇
1C287
𜊈
1C288
𜊉
1C289
𜊊
1C28A
𜊋
1C28B
𜊌
1C28C
𜊍
1C28D
𜊎
1C28E
𜊏
1C28F
80
90
𜊐
1C290
𜊑
1C291
𜊒
1C292
𜊓
1C293
𜊔
1C294
𜊕
1C295
𜊖
1C296
𜊗
1C297
𜊘
1C298
𜊙
1C299
𜊚
1C29A
𜊛
1C29B
𜊜
1C29C
𜊝
1C29D
𜊞
1C29E
𜊟
1C29F
90
A0
𜊠
1C2A0
𜊡
1C2A1
𜊢
1C2A2
𜊣
1C2A3
𜊤
1C2A4
𜊥
1C2A5
𜊦
1C2A6
𜊧
1C2A7
𜊨
1C2A8
𜊩
1C2A9
𜊪
1C2AA
𜊫
1C2AB
𜊬
1C2AC
𜊭
1C2AD
𜊮
1C2AE
𜊯
1C2AF
A0
B0
𜊰
1C2B0
𜊱
1C2B1
𜊲
1C2B2
𜊳
1C2B3
𜊴
1C2B4
𜊵
1C2B5
𜊶
1C2B6
𜊷
1C2B7
𜊸
1C2B8
𜊹
1C2B9
𜊺
1C2BA
𜊻
1C2BB
𜊼
1C2BC
𜊽
1C2BD
𜊾
1C2BE
𜊿
1C2BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]