International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A183

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򡃀
A10C0
򡃁
A10C1
򡃂
A10C2
򡃃
A10C3
򡃄
A10C4
򡃅
A10C5
򡃆
A10C6
򡃇
A10C7
򡃈
A10C8
򡃉
A10C9
򡃊
A10CA
򡃋
A10CB
򡃌
A10CC
򡃍
A10CD
򡃎
A10CE
򡃏
A10CF
80
90
򡃐
A10D0
򡃑
A10D1
򡃒
A10D2
򡃓
A10D3
򡃔
A10D4
򡃕
A10D5
򡃖
A10D6
򡃗
A10D7
򡃘
A10D8
򡃙
A10D9
򡃚
A10DA
򡃛
A10DB
򡃜
A10DC
򡃝
A10DD
򡃞
A10DE
򡃟
A10DF
90
A0
򡃠
A10E0
򡃡
A10E1
򡃢
A10E2
򡃣
A10E3
򡃤
A10E4
򡃥
A10E5
򡃦
A10E6
򡃧
A10E7
򡃨
A10E8
򡃩
A10E9
򡃪
A10EA
򡃫
A10EB
򡃬
A10EC
򡃭
A10ED
򡃮
A10EE
򡃯
A10EF
A0
B0
򡃰
A10F0
򡃱
A10F1
򡃲
A10F2
򡃳
A10F3
򡃴
A10F4
򡃵
A10F5
򡃶
A10F6
򡃷
A10F7
򡃸
A10F8
򡃹
A10F9
򡃺
A10FA
򡃻
A10FB
򡃼
A10FC
򡃽
A10FD
򡃾
A10FE
򡃿
A10FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]