International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F298B8

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򘸀
98E00
򘸁
98E01
򘸂
98E02
򘸃
98E03
򘸄
98E04
򘸅
98E05
򘸆
98E06
򘸇
98E07
򘸈
98E08
򘸉
98E09
򘸊
98E0A
򘸋
98E0B
򘸌
98E0C
򘸍
98E0D
򘸎
98E0E
򘸏
98E0F
80
90
򘸐
98E10
򘸑
98E11
򘸒
98E12
򘸓
98E13
򘸔
98E14
򘸕
98E15
򘸖
98E16
򘸗
98E17
򘸘
98E18
򘸙
98E19
򘸚
98E1A
򘸛
98E1B
򘸜
98E1C
򘸝
98E1D
򘸞
98E1E
򘸟
98E1F
90
A0
򘸠
98E20
򘸡
98E21
򘸢
98E22
򘸣
98E23
򘸤
98E24
򘸥
98E25
򘸦
98E26
򘸧
98E27
򘸨
98E28
򘸩
98E29
򘸪
98E2A
򘸫
98E2B
򘸬
98E2C
򘸭
98E2D
򘸮
98E2E
򘸯
98E2F
A0
B0
򘸰
98E30
򘸱
98E31
򘸲
98E32
򘸳
98E33
򘸴
98E34
򘸵
98E35
򘸶
98E36
򘸷
98E37
򘸸
98E38
򘸹
98E39
򘸺
98E3A
򘸻
98E3B
򘸼
98E3C
򘸽
98E3D
򘸾
98E3E
򘸿
98E3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]