International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09AA5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚥀
1A940
𚥁
1A941
𚥂
1A942
𚥃
1A943
𚥄
1A944
𚥅
1A945
𚥆
1A946
𚥇
1A947
𚥈
1A948
𚥉
1A949
𚥊
1A94A
𚥋
1A94B
𚥌
1A94C
𚥍
1A94D
𚥎
1A94E
𚥏
1A94F
80
90
𚥐
1A950
𚥑
1A951
𚥒
1A952
𚥓
1A953
𚥔
1A954
𚥕
1A955
𚥖
1A956
𚥗
1A957
𚥘
1A958
𚥙
1A959
𚥚
1A95A
𚥛
1A95B
𚥜
1A95C
𚥝
1A95D
𚥞
1A95E
𚥟
1A95F
90
A0
𚥠
1A960
𚥡
1A961
𚥢
1A962
𚥣
1A963
𚥤
1A964
𚥥
1A965
𚥦
1A966
𚥧
1A967
𚥨
1A968
𚥩
1A969
𚥪
1A96A
𚥫
1A96B
𚥬
1A96C
𚥭
1A96D
𚥮
1A96E
𚥯
1A96F
A0
B0
𚥰
1A970
𚥱
1A971
𚥲
1A972
𚥳
1A973
𚥴
1A974
𚥵
1A975
𚥶
1A976
𚥷
1A977
𚥸
1A978
𚥹
1A979
𚥺
1A97A
𚥻
1A97B
𚥼
1A97C
𚥽
1A97D
𚥾
1A97E
𚥿
1A97F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]