International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38B87

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󋇀
CB1C0
󋇁
CB1C1
󋇂
CB1C2
󋇃
CB1C3
󋇄
CB1C4
󋇅
CB1C5
󋇆
CB1C6
󋇇
CB1C7
󋇈
CB1C8
󋇉
CB1C9
󋇊
CB1CA
󋇋
CB1CB
󋇌
CB1CC
󋇍
CB1CD
󋇎
CB1CE
󋇏
CB1CF
80
90
󋇐
CB1D0
󋇑
CB1D1
󋇒
CB1D2
󋇓
CB1D3
󋇔
CB1D4
󋇕
CB1D5
󋇖
CB1D6
󋇗
CB1D7
󋇘
CB1D8
󋇙
CB1D9
󋇚
CB1DA
󋇛
CB1DB
󋇜
CB1DC
󋇝
CB1DD
󋇞
CB1DE
󋇟
CB1DF
90
A0
󋇠
CB1E0
󋇡
CB1E1
󋇢
CB1E2
󋇣
CB1E3
󋇤
CB1E4
󋇥
CB1E5
󋇦
CB1E6
󋇧
CB1E7
󋇨
CB1E8
󋇩
CB1E9
󋇪
CB1EA
󋇫
CB1EB
󋇬
CB1EC
󋇭
CB1ED
󋇮
CB1EE
󋇯
CB1EF
A0
B0
󋇰
CB1F0
󋇱
CB1F1
󋇲
CB1F2
󋇳
CB1F3
󋇴
CB1F4
󋇵
CB1F5
󋇶
CB1F6
󋇷
CB1F7
󋇸
CB1F8
󋇹
CB1F9
󋇺
CB1FA
󋇻
CB1FB
󋇼
CB1FC
󋇽
CB1FD
󋇾
CB1FE
󋇿
CB1FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]