International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38783

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󇃀
C70C0
󇃁
C70C1
󇃂
C70C2
󇃃
C70C3
󇃄
C70C4
󇃅
C70C5
󇃆
C70C6
󇃇
C70C7
󇃈
C70C8
󇃉
C70C9
󇃊
C70CA
󇃋
C70CB
󇃌
C70CC
󇃍
C70CD
󇃎
C70CE
󇃏
C70CF
80
90
󇃐
C70D0
󇃑
C70D1
󇃒
C70D2
󇃓
C70D3
󇃔
C70D4
󇃕
C70D5
󇃖
C70D6
󇃗
C70D7
󇃘
C70D8
󇃙
C70D9
󇃚
C70DA
󇃛
C70DB
󇃜
C70DC
󇃝
C70DD
󇃞
C70DE
󇃟
C70DF
90
A0
󇃠
C70E0
󇃡
C70E1
󇃢
C70E2
󇃣
C70E3
󇃤
C70E4
󇃥
C70E5
󇃦
C70E6
󇃧
C70E7
󇃨
C70E8
󇃩
C70E9
󇃪
C70EA
󇃫
C70EB
󇃬
C70EC
󇃭
C70ED
󇃮
C70EE
󇃯
C70EF
A0
B0
󇃰
C70F0
󇃱
C70F1
󇃲
C70F2
󇃳
C70F3
󇃴
C70F4
󇃵
C70F5
󇃶
C70F6
󇃷
C70F7
󇃸
C70F8
󇃹
C70F9
󇃺
C70FA
󇃻
C70FB
󇃼
C70FC
󇃽
C70FD
󇃾
C70FE
󇃿
C70FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]