International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2BCA5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򼥀
BC940
򼥁
BC941
򼥂
BC942
򼥃
BC943
򼥄
BC944
򼥅
BC945
򼥆
BC946
򼥇
BC947
򼥈
BC948
򼥉
BC949
򼥊
BC94A
򼥋
BC94B
򼥌
BC94C
򼥍
BC94D
򼥎
BC94E
򼥏
BC94F
80
90
򼥐
BC950
򼥑
BC951
򼥒
BC952
򼥓
BC953
򼥔
BC954
򼥕
BC955
򼥖
BC956
򼥗
BC957
򼥘
BC958
򼥙
BC959
򼥚
BC95A
򼥛
BC95B
򼥜
BC95C
򼥝
BC95D
򼥞
BC95E
򼥟
BC95F
90
A0
򼥠
BC960
򼥡
BC961
򼥢
BC962
򼥣
BC963
򼥤
BC964
򼥥
BC965
򼥦
BC966
򼥧
BC967
򼥨
BC968
򼥩
BC969
򼥪
BC96A
򼥫
BC96B
򼥬
BC96C
򼥭
BC96D
򼥮
BC96E
򼥯
BC96F
A0
B0
򼥰
BC970
򼥱
BC971
򼥲
BC972
򼥳
BC973
򼥴
BC974
򼥵
BC975
򼥶
BC976
򼥷
BC977
򼥸
BC978
򼥹
BC979
򼥺
BC97A
򼥻
BC97B
򼥼
BC97C
򼥽
BC97D
򼥾
BC97E
򼥿
BC97F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]