International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48583

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􅃀
1050C0
􅃁
1050C1
􅃂
1050C2
􅃃
1050C3
􅃄
1050C4
􅃅
1050C5
􅃆
1050C6
􅃇
1050C7
􅃈
1050C8
􅃉
1050C9
􅃊
1050CA
􅃋
1050CB
􅃌
1050CC
􅃍
1050CD
􅃎
1050CE
􅃏
1050CF
80
90
􅃐
1050D0
􅃑
1050D1
􅃒
1050D2
􅃓
1050D3
􅃔
1050D4
􅃕
1050D5
􅃖
1050D6
􅃗
1050D7
􅃘
1050D8
􅃙
1050D9
􅃚
1050DA
􅃛
1050DB
􅃜
1050DC
􅃝
1050DD
􅃞
1050DE
􅃟
1050DF
90
A0
􅃠
1050E0
􅃡
1050E1
􅃢
1050E2
􅃣
1050E3
􅃤
1050E4
􅃥
1050E5
􅃦
1050E6
􅃧
1050E7
􅃨
1050E8
􅃩
1050E9
􅃪
1050EA
􅃫
1050EB
􅃬
1050EC
􅃭
1050ED
􅃮
1050EE
􅃯
1050EF
A0
B0
􅃰
1050F0
􅃱
1050F1
􅃲
1050F2
􅃳
1050F3
􅃴
1050F4
􅃵
1050F5
􅃶
1050F6
􅃷
1050F7
􅃸
1050F8
􅃹
1050F9
􅃺
1050FA
􅃻
1050FB
􅃼
1050FC
􅃽
1050FD
􅃾
1050FE
􅃿
1050FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]