International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F287B8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򇸀
87E00
򇸁
87E01
򇸂
87E02
򇸃
87E03
򇸄
87E04
򇸅
87E05
򇸆
87E06
򇸇
87E07
򇸈
87E08
򇸉
87E09
򇸊
87E0A
򇸋
87E0B
򇸌
87E0C
򇸍
87E0D
򇸎
87E0E
򇸏
87E0F
80
90
򇸐
87E10
򇸑
87E11
򇸒
87E12
򇸓
87E13
򇸔
87E14
򇸕
87E15
򇸖
87E16
򇸗
87E17
򇸘
87E18
򇸙
87E19
򇸚
87E1A
򇸛
87E1B
򇸜
87E1C
򇸝
87E1D
򇸞
87E1E
򇸟
87E1F
90
A0
򇸠
87E20
򇸡
87E21
򇸢
87E22
򇸣
87E23
򇸤
87E24
򇸥
87E25
򇸦
87E26
򇸧
87E27
򇸨
87E28
򇸩
87E29
򇸪
87E2A
򇸫
87E2B
򇸬
87E2C
򇸭
87E2D
򇸮
87E2E
򇸯
87E2F
A0
B0
򇸰
87E30
򇸱
87E31
򇸲
87E32
򇸳
87E33
򇸴
87E34
򇸵
87E35
򇸶
87E36
򇸷
87E37
򇸸
87E38
򇸹
87E39
򇸺
87E3A
򇸻
87E3B
򇸼
87E3C
򇸽
87E3D
򇸾
87E3E
򇸿
87E3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]