International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F288A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򈥀
88940
򈥁
88941
򈥂
88942
򈥃
88943
򈥄
88944
򈥅
88945
򈥆
88946
򈥇
88947
򈥈
88948
򈥉
88949
򈥊
8894A
򈥋
8894B
򈥌
8894C
򈥍
8894D
򈥎
8894E
򈥏
8894F
80
90
򈥐
88950
򈥑
88951
򈥒
88952
򈥓
88953
򈥔
88954
򈥕
88955
򈥖
88956
򈥗
88957
򈥘
88958
򈥙
88959
򈥚
8895A
򈥛
8895B
򈥜
8895C
򈥝
8895D
򈥞
8895E
򈥟
8895F
90
A0
򈥠
88960
򈥡
88961
򈥢
88962
򈥣
88963
򈥤
88964
򈥥
88965
򈥦
88966
򈥧
88967
򈥨
88968
򈥩
88969
򈥪
8896A
򈥫
8896B
򈥬
8896C
򈥭
8896D
򈥮
8896E
򈥯
8896F
A0
B0
򈥰
88970
򈥱
88971
򈥲
88972
򈥳
88973
򈥴
88974
򈥵
88975
򈥶
88976
򈥷
88977
򈥸
88978
򈥹
88979
򈥺
8897A
򈥻
8897B
򈥼
8897C
򈥽
8897D
򈥾
8897E
򈥿
8897F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]