International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2849F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򄟀
847C0
򄟁
847C1
򄟂
847C2
򄟃
847C3
򄟄
847C4
򄟅
847C5
򄟆
847C6
򄟇
847C7
򄟈
847C8
򄟉
847C9
򄟊
847CA
򄟋
847CB
򄟌
847CC
򄟍
847CD
򄟎
847CE
򄟏
847CF
80
90
򄟐
847D0
򄟑
847D1
򄟒
847D2
򄟓
847D3
򄟔
847D4
򄟕
847D5
򄟖
847D6
򄟗
847D7
򄟘
847D8
򄟙
847D9
򄟚
847DA
򄟛
847DB
򄟜
847DC
򄟝
847DD
򄟞
847DE
򄟟
847DF
90
A0
򄟠
847E0
򄟡
847E1
򄟢
847E2
򄟣
847E3
򄟤
847E4
򄟥
847E5
򄟦
847E6
򄟧
847E7
򄟨
847E8
򄟩
847E9
򄟪
847EA
򄟫
847EB
򄟬
847EC
򄟭
847ED
򄟮
847EE
򄟯
847EF
A0
B0
򄟰
847F0
򄟱
847F1
򄟲
847F2
򄟳
847F3
򄟴
847F4
򄟵
847F5
򄟶
847F6
򄟷
847F7
򄟸
847F8
򄟹
847F9
򄟺
847FA
򄟻
847FB
򄟼
847FC
򄟽
847FD
򄟾
847FE
򄟿
847FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]