International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48787

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􇇀
1071C0
􇇁
1071C1
􇇂
1071C2
􇇃
1071C3
􇇄
1071C4
􇇅
1071C5
􇇆
1071C6
􇇇
1071C7
􇇈
1071C8
􇇉
1071C9
􇇊
1071CA
􇇋
1071CB
􇇌
1071CC
􇇍
1071CD
􇇎
1071CE
􇇏
1071CF
80
90
􇇐
1071D0
􇇑
1071D1
􇇒
1071D2
􇇓
1071D3
􇇔
1071D4
􇇕
1071D5
􇇖
1071D6
􇇗
1071D7
􇇘
1071D8
􇇙
1071D9
􇇚
1071DA
􇇛
1071DB
􇇜
1071DC
􇇝
1071DD
􇇞
1071DE
􇇟
1071DF
90
A0
􇇠
1071E0
􇇡
1071E1
􇇢
1071E2
􇇣
1071E3
􇇤
1071E4
􇇥
1071E5
􇇦
1071E6
􇇧
1071E7
􇇨
1071E8
􇇩
1071E9
􇇪
1071EA
􇇫
1071EB
􇇬
1071EC
􇇭
1071ED
􇇮
1071EE
􇇯
1071EF
A0
B0
􇇰
1071F0
􇇱
1071F1
􇇲
1071F2
􇇳
1071F3
􇇴
1071F4
􇇵
1071F5
􇇶
1071F6
􇇷
1071F7
􇇸
1071F8
􇇹
1071F9
􇇺
1071FA
􇇻
1071FB
􇇼
1071FC
􇇽
1071FD
􇇾
1071FE
􇇿
1071FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]