International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39B83

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󛃀
DB0C0
󛃁
DB0C1
󛃂
DB0C2
󛃃
DB0C3
󛃄
DB0C4
󛃅
DB0C5
󛃆
DB0C6
󛃇
DB0C7
󛃈
DB0C8
󛃉
DB0C9
󛃊
DB0CA
󛃋
DB0CB
󛃌
DB0CC
󛃍
DB0CD
󛃎
DB0CE
󛃏
DB0CF
80
90
󛃐
DB0D0
󛃑
DB0D1
󛃒
DB0D2
󛃓
DB0D3
󛃔
DB0D4
󛃕
DB0D5
󛃖
DB0D6
󛃗
DB0D7
󛃘
DB0D8
󛃙
DB0D9
󛃚
DB0DA
󛃛
DB0DB
󛃜
DB0DC
󛃝
DB0DD
󛃞
DB0DE
󛃟
DB0DF
90
A0
󛃠
DB0E0
󛃡
DB0E1
󛃢
DB0E2
󛃣
DB0E3
󛃤
DB0E4
󛃥
DB0E5
󛃦
DB0E6
󛃧
DB0E7
󛃨
DB0E8
󛃩
DB0E9
󛃪
DB0EA
󛃫
DB0EB
󛃬
DB0EC
󛃭
DB0ED
󛃮
DB0EE
󛃯
DB0EF
A0
B0
󛃰
DB0F0
󛃱
DB0F1
󛃲
DB0F2
󛃳
DB0F3
󛃴
DB0F4
󛃵
DB0F5
󛃶
DB0F6
󛃷
DB0F7
󛃸
DB0F8
󛃹
DB0F9
󛃺
DB0FA
󛃻
DB0FB
󛃼
DB0FC
󛃽
DB0FD
󛃾
DB0FE
󛃿
DB0FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]