International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48983

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􉃀
1090C0
􉃁
1090C1
􉃂
1090C2
􉃃
1090C3
􉃄
1090C4
􉃅
1090C5
􉃆
1090C6
􉃇
1090C7
􉃈
1090C8
􉃉
1090C9
􉃊
1090CA
􉃋
1090CB
􉃌
1090CC
􉃍
1090CD
􉃎
1090CE
􉃏
1090CF
80
90
􉃐
1090D0
􉃑
1090D1
􉃒
1090D2
􉃓
1090D3
􉃔
1090D4
􉃕
1090D5
􉃖
1090D6
􉃗
1090D7
􉃘
1090D8
􉃙
1090D9
􉃚
1090DA
􉃛
1090DB
􉃜
1090DC
􉃝
1090DD
􉃞
1090DE
􉃟
1090DF
90
A0
􉃠
1090E0
􉃡
1090E1
􉃢
1090E2
􉃣
1090E3
􉃤
1090E4
􉃥
1090E5
􉃦
1090E6
􉃧
1090E7
􉃨
1090E8
􉃩
1090E9
􉃪
1090EA
􉃫
1090EB
􉃬
1090EC
􉃭
1090ED
􉃮
1090EE
􉃯
1090EF
A0
B0
􉃰
1090F0
􉃱
1090F1
􉃲
1090F2
􉃳
1090F3
􉃴
1090F4
􉃵
1090F5
􉃶
1090F6
􉃷
1090F7
􉃸
1090F8
􉃹
1090F9
􉃺
1090FA
􉃻
1090FB
􉃼
1090FC
􉃽
1090FD
􉃾
1090FE
􉃿
1090FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]