International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A487

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򤇀
A41C0
򤇁
A41C1
򤇂
A41C2
򤇃
A41C3
򤇄
A41C4
򤇅
A41C5
򤇆
A41C6
򤇇
A41C7
򤇈
A41C8
򤇉
A41C9
򤇊
A41CA
򤇋
A41CB
򤇌
A41CC
򤇍
A41CD
򤇎
A41CE
򤇏
A41CF
80
90
򤇐
A41D0
򤇑
A41D1
򤇒
A41D2
򤇓
A41D3
򤇔
A41D4
򤇕
A41D5
򤇖
A41D6
򤇗
A41D7
򤇘
A41D8
򤇙
A41D9
򤇚
A41DA
򤇛
A41DB
򤇜
A41DC
򤇝
A41DD
򤇞
A41DE
򤇟
A41DF
90
A0
򤇠
A41E0
򤇡
A41E1
򤇢
A41E2
򤇣
A41E3
򤇤
A41E4
򤇥
A41E5
򤇦
A41E6
򤇧
A41E7
򤇨
A41E8
򤇩
A41E9
򤇪
A41EA
򤇫
A41EB
򤇬
A41EC
򤇭
A41ED
򤇮
A41EE
򤇯
A41EF
A0
B0
򤇰
A41F0
򤇱
A41F1
򤇲
A41F2
򤇳
A41F3
򤇴
A41F4
򤇵
A41F5
򤇶
A41F6
򤇷
A41F7
򤇸
A41F8
򤇹
A41F9
򤇺
A41FA
򤇻
A41FB
򤇼
A41FC
򤇽
A41FD
򤇾
A41FE
򤇿
A41FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]