International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A4A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󤥀
E4940
󤥁
E4941
󤥂
E4942
󤥃
E4943
󤥄
E4944
󤥅
E4945
󤥆
E4946
󤥇
E4947
󤥈
E4948
󤥉
E4949
󤥊
E494A
󤥋
E494B
󤥌
E494C
󤥍
E494D
󤥎
E494E
󤥏
E494F
80
90
󤥐
E4950
󤥑
E4951
󤥒
E4952
󤥓
E4953
󤥔
E4954
󤥕
E4955
󤥖
E4956
󤥗
E4957
󤥘
E4958
󤥙
E4959
󤥚
E495A
󤥛
E495B
󤥜
E495C
󤥝
E495D
󤥞
E495E
󤥟
E495F
90
A0
󤥠
E4960
󤥡
E4961
󤥢
E4962
󤥣
E4963
󤥤
E4964
󤥥
E4965
󤥦
E4966
󤥧
E4967
󤥨
E4968
󤥩
E4969
󤥪
E496A
󤥫
E496B
󤥬
E496C
󤥭
E496D
󤥮
E496E
󤥯
E496F
A0
B0
󤥰
E4970
󤥱
E4971
󤥲
E4972
󤥳
E4973
󤥴
E4974
󤥵
E4975
󤥶
E4976
󤥷
E4977
󤥸
E4978
󤥹
E4979
󤥺
E497A
󤥻
E497B
󤥼
E497C
󤥽
E497D
󤥾
E497E
󤥿
E497F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]