International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3BF83

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󿃀
FF0C0
󿃁
FF0C1
󿃂
FF0C2
󿃃
FF0C3
󿃄
FF0C4
󿃅
FF0C5
󿃆
FF0C6
󿃇
FF0C7
󿃈
FF0C8
󿃉
FF0C9
󿃊
FF0CA
󿃋
FF0CB
󿃌
FF0CC
󿃍
FF0CD
󿃎
FF0CE
󿃏
FF0CF
80
90
󿃐
FF0D0
󿃑
FF0D1
󿃒
FF0D2
󿃓
FF0D3
󿃔
FF0D4
󿃕
FF0D5
󿃖
FF0D6
󿃗
FF0D7
󿃘
FF0D8
󿃙
FF0D9
󿃚
FF0DA
󿃛
FF0DB
󿃜
FF0DC
󿃝
FF0DD
󿃞
FF0DE
󿃟
FF0DF
90
A0
󿃠
FF0E0
󿃡
FF0E1
󿃢
FF0E2
󿃣
FF0E3
󿃤
FF0E4
󿃥
FF0E5
󿃦
FF0E6
󿃧
FF0E7
󿃨
FF0E8
󿃩
FF0E9
󿃪
FF0EA
󿃫
FF0EB
󿃬
FF0EC
󿃭
FF0ED
󿃮
FF0EE
󿃯
FF0EF
A0
B0
󿃰
FF0F0
󿃱
FF0F1
󿃲
FF0F2
󿃳
FF0F3
󿃴
FF0F4
󿃵
FF0F5
󿃶
FF0F6
󿃷
FF0F7
󿃸
FF0F8
󿃹
FF0F9
󿃺
FF0FA
󿃻
FF0FB
󿃼
FF0FC
󿃽
FF0FD
󿃾
FF0FE
󿃿
FF0FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]