International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B2A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𲥀
32940
𲥁
32941
𲥂
32942
𲥃
32943
𲥄
32944
𲥅
32945
𲥆
32946
𲥇
32947
𲥈
32948
𲥉
32949
𲥊
3294A
𲥋
3294B
𲥌
3294C
𲥍
3294D
𲥎
3294E
𲥏
3294F
80
90
𲥐
32950
𲥑
32951
𲥒
32952
𲥓
32953
𲥔
32954
𲥕
32955
𲥖
32956
𲥗
32957
𲥘
32958
𲥙
32959
𲥚
3295A
𲥛
3295B
𲥜
3295C
𲥝
3295D
𲥞
3295E
𲥟
3295F
90
A0
𲥠
32960
𲥡
32961
𲥢
32962
𲥣
32963
𲥤
32964
𲥥
32965
𲥦
32966
𲥧
32967
𲥨
32968
𲥩
32969
𲥪
3296A
𲥫
3296B
𲥬
3296C
𲥭
3296D
𲥮
3296E
𲥯
3296F
A0
B0
𲥰
32970
𲥱
32971
𲥲
32972
𲥳
32973
𲥴
32974
𲥵
32975
𲥶
32976
𲥷
32977
𲥸
32978
𲥹
32979
𲥺
3297A
𲥻
3297B
𲥼
3297C
𲥽
3297D
𲥾
3297E
𲥿
3297F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]