International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B9A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𹥀
39940
𹥁
39941
𹥂
39942
𹥃
39943
𹥄
39944
𹥅
39945
𹥆
39946
𹥇
39947
𹥈
39948
𹥉
39949
𹥊
3994A
𹥋
3994B
𹥌
3994C
𹥍
3994D
𹥎
3994E
𹥏
3994F
80
90
𹥐
39950
𹥑
39951
𹥒
39952
𹥓
39953
𹥔
39954
𹥕
39955
𹥖
39956
𹥗
39957
𹥘
39958
𹥙
39959
𹥚
3995A
𹥛
3995B
𹥜
3995C
𹥝
3995D
𹥞
3995E
𹥟
3995F
90
A0
𹥠
39960
𹥡
39961
𹥢
39962
𹥣
39963
𹥤
39964
𹥥
39965
𹥦
39966
𹥧
39967
𹥨
39968
𹥩
39969
𹥪
3996A
𹥫
3996B
𹥬
3996C
𹥭
3996D
𹥮
3996E
𹥯
3996F
A0
B0
𹥰
39970
𹥱
39971
𹥲
39972
𹥳
39973
𹥴
39974
𹥵
39975
𹥶
39976
𹥷
39977
𹥸
39978
𹥹
39979
𹥺
3997A
𹥻
3997B
𹥼
3997C
𹥽
3997D
𹥾
3997E
𹥿
3997F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]