International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
WINDOWS
UTF-8 windows-65001
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F2809C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򀜀
80700
򀜁
80701
򀜂
80702
򀜃
80703
򀜄
80704
򀜅
80705
򀜆
80706
򀜇
80707
򀜈
80708
򀜉
80709
򀜊
8070A
򀜋
8070B
򀜌
8070C
򀜍
8070D
򀜎
8070E
򀜏
8070F
80
90
򀜐
80710
򀜑
80711
򀜒
80712
򀜓
80713
򀜔
80714
򀜕
80715
򀜖
80716
򀜗
80717
򀜘
80718
򀜙
80719
򀜚
8071A
򀜛
8071B
򀜜
8071C
򀜝
8071D
򀜞
8071E
򀜟
8071F
90
A0
򀜠
80720
򀜡
80721
򀜢
80722
򀜣
80723
򀜤
80724
򀜥
80725
򀜦
80726
򀜧
80727
򀜨
80728
򀜩
80729
򀜪
8072A
򀜫
8072B
򀜬
8072C
򀜭
8072D
򀜮
8072E
򀜯
8072F
A0
B0
򀜰
80730
򀜱
80731
򀜲
80732
򀜳
80733
򀜴
80734
򀜵
80735
򀜶
80736
򀜷
80737
򀜸
80738
򀜹
80739
򀜺
8073A
򀜻
8073B
򀜼
8073C
򀜽
8073D
򀜾
8073E
򀜿
8073F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]