International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48683

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􆃀
1060C0
􆃁
1060C1
􆃂
1060C2
􆃃
1060C3
􆃄
1060C4
􆃅
1060C5
􆃆
1060C6
􆃇
1060C7
􆃈
1060C8
􆃉
1060C9
􆃊
1060CA
􆃋
1060CB
􆃌
1060CC
􆃍
1060CD
􆃎
1060CE
􆃏
1060CF
80
90
􆃐
1060D0
􆃑
1060D1
􆃒
1060D2
􆃓
1060D3
􆃔
1060D4
􆃕
1060D5
􆃖
1060D6
􆃗
1060D7
􆃘
1060D8
􆃙
1060D9
􆃚
1060DA
􆃛
1060DB
􆃜
1060DC
􆃝
1060DD
􆃞
1060DE
􆃟
1060DF
90
A0
􆃠
1060E0
􆃡
1060E1
􆃢
1060E2
􆃣
1060E3
􆃤
1060E4
􆃥
1060E5
􆃦
1060E6
􆃧
1060E7
􆃨
1060E8
􆃩
1060E9
􆃪
1060EA
􆃫
1060EB
􆃬
1060EC
􆃭
1060ED
􆃮
1060EE
􆃯
1060EF
A0
B0
􆃰
1060F0
􆃱
1060F1
􆃲
1060F2
􆃳
1060F3
􆃴
1060F4
􆃵
1060F5
􆃶
1060F6
􆃷
1060F7
􆃸
1060F8
􆃹
1060F9
􆃺
1060FA
􆃻
1060FB
􆃼
1060FC
􆃽
1060FD
􆃾
1060FE
􆃿
1060FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]