International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AD87

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󭇀
ED1C0
󭇁
ED1C1
󭇂
ED1C2
󭇃
ED1C3
󭇄
ED1C4
󭇅
ED1C5
󭇆
ED1C6
󭇇
ED1C7
󭇈
ED1C8
󭇉
ED1C9
󭇊
ED1CA
󭇋
ED1CB
󭇌
ED1CC
󭇍
ED1CD
󭇎
ED1CE
󭇏
ED1CF
80
90
󭇐
ED1D0
󭇑
ED1D1
󭇒
ED1D2
󭇓
ED1D3
󭇔
ED1D4
󭇕
ED1D5
󭇖
ED1D6
󭇗
ED1D7
󭇘
ED1D8
󭇙
ED1D9
󭇚
ED1DA
󭇛
ED1DB
󭇜
ED1DC
󭇝
ED1DD
󭇞
ED1DE
󭇟
ED1DF
90
A0
󭇠
ED1E0
󭇡
ED1E1
󭇢
ED1E2
󭇣
ED1E3
󭇤
ED1E4
󭇥
ED1E5
󭇦
ED1E6
󭇧
ED1E7
󭇨
ED1E8
󭇩
ED1E9
󭇪
ED1EA
󭇫
ED1EB
󭇬
ED1EC
󭇭
ED1ED
󭇮
ED1EE
󭇯
ED1EF
A0
B0
󭇰
ED1F0
󭇱
ED1F1
󭇲
ED1F2
󭇳
ED1F3
󭇴
ED1F4
󭇵
ED1F5
󭇶
ED1F6
󭇷
ED1F7
󭇸
ED1F8
󭇹
ED1F9
󭇺
ED1FA
󭇻
ED1FB
󭇼
ED1FC
󭇽
ED1FD
󭇾
ED1FE
󭇿
ED1FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]