International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09DAC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𝬀
1DB00
𝬁
1DB01
𝬂
1DB02
𝬃
1DB03
𝬄
1DB04
𝬅
1DB05
𝬆
1DB06
𝬇
1DB07
𝬈
1DB08
𝬉
1DB09
𝬊
1DB0A
𝬋
1DB0B
𝬌
1DB0C
𝬍
1DB0D
𝬎
1DB0E
𝬏
1DB0F
80
90
𝬐
1DB10
𝬑
1DB11
𝬒
1DB12
𝬓
1DB13
𝬔
1DB14
𝬕
1DB15
𝬖
1DB16
𝬗
1DB17
𝬘
1DB18
𝬙
1DB19
𝬚
1DB1A
𝬛
1DB1B
𝬜
1DB1C
𝬝
1DB1D
𝬞
1DB1E
𝬟
1DB1F
90
A0
𝬠
1DB20
𝬡
1DB21
𝬢
1DB22
𝬣
1DB23
𝬤
1DB24
𝬥
1DB25
𝬦
1DB26
𝬧
1DB27
𝬨
1DB28
𝬩
1DB29
𝬪
1DB2A
𝬫
1DB2B
𝬬
1DB2C
𝬭
1DB2D
𝬮
1DB2E
𝬯
1DB2F
A0
B0
𝬰
1DB30
𝬱
1DB31
𝬲
1DB32
𝬳
1DB33
𝬴
1DB34
𝬵
1DB35
𝬶
1DB36
𝬷
1DB37
𝬸
1DB38
𝬹
1DB39
𝬺
1DB3A
𝬻
1DB3B
𝬼
1DB3C
𝬽
1DB3D
𝬾
1DB3E
𝬿
1DB3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]