International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A487

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󤇀
E41C0
󤇁
E41C1
󤇂
E41C2
󤇃
E41C3
󤇄
E41C4
󤇅
E41C5
󤇆
E41C6
󤇇
E41C7
󤇈
E41C8
󤇉
E41C9
󤇊
E41CA
󤇋
E41CB
󤇌
E41CC
󤇍
E41CD
󤇎
E41CE
󤇏
E41CF
80
90
󤇐
E41D0
󤇑
E41D1
󤇒
E41D2
󤇓
E41D3
󤇔
E41D4
󤇕
E41D5
󤇖
E41D6
󤇗
E41D7
󤇘
E41D8
󤇙
E41D9
󤇚
E41DA
󤇛
E41DB
󤇜
E41DC
󤇝
E41DD
󤇞
E41DE
󤇟
E41DF
90
A0
󤇠
E41E0
󤇡
E41E1
󤇢
E41E2
󤇣
E41E3
󤇤
E41E4
󤇥
E41E5
󤇦
E41E6
󤇧
E41E7
󤇨
E41E8
󤇩
E41E9
󤇪
E41EA
󤇫
E41EB
󤇬
E41EC
󤇭
E41ED
󤇮
E41EE
󤇯
E41EF
A0
B0
󤇰
E41F0
󤇱
E41F1
󤇲
E41F2
󤇳
E41F3
󤇴
E41F4
󤇵
E41F5
󤇶
E41F6
󤇷
E41F7
󤇸
E41F8
󤇹
E41F9
󤇺
E41FA
󤇻
E41FB
󤇼
E41FC
󤇽
E41FD
󤇾
E41FE
󤇿
E41FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]