International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29CB8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򜸀
9CE00
򜸁
9CE01
򜸂
9CE02
򜸃
9CE03
򜸄
9CE04
򜸅
9CE05
򜸆
9CE06
򜸇
9CE07
򜸈
9CE08
򜸉
9CE09
򜸊
9CE0A
򜸋
9CE0B
򜸌
9CE0C
򜸍
9CE0D
򜸎
9CE0E
򜸏
9CE0F
80
90
򜸐
9CE10
򜸑
9CE11
򜸒
9CE12
򜸓
9CE13
򜸔
9CE14
򜸕
9CE15
򜸖
9CE16
򜸗
9CE17
򜸘
9CE18
򜸙
9CE19
򜸚
9CE1A
򜸛
9CE1B
򜸜
9CE1C
򜸝
9CE1D
򜸞
9CE1E
򜸟
9CE1F
90
A0
򜸠
9CE20
򜸡
9CE21
򜸢
9CE22
򜸣
9CE23
򜸤
9CE24
򜸥
9CE25
򜸦
9CE26
򜸧
9CE27
򜸨
9CE28
򜸩
9CE29
򜸪
9CE2A
򜸫
9CE2B
򜸬
9CE2C
򜸭
9CE2D
򜸮
9CE2E
򜸯
9CE2F
A0
B0
򜸰
9CE30
򜸱
9CE31
򜸲
9CE32
򜸳
9CE33
򜸴
9CE34
򜸵
9CE35
򜸶
9CE36
򜸷
9CE37
򜸸
9CE38
򜸹
9CE39
򜸺
9CE3A
򜸻
9CE3B
򜸼
9CE3C
򜸽
9CE3D
򜸾
9CE3E
򜸿
9CE3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]