International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2878B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򇋀
872C0
򇋁
872C1
򇋂
872C2
򇋃
872C3
򇋄
872C4
򇋅
872C5
򇋆
872C6
򇋇
872C7
򇋈
872C8
򇋉
872C9
򇋊
872CA
򇋋
872CB
򇋌
872CC
򇋍
872CD
򇋎
872CE
򇋏
872CF
80
90
򇋐
872D0
򇋑
872D1
򇋒
872D2
򇋓
872D3
򇋔
872D4
򇋕
872D5
򇋖
872D6
򇋗
872D7
򇋘
872D8
򇋙
872D9
򇋚
872DA
򇋛
872DB
򇋜
872DC
򇋝
872DD
򇋞
872DE
򇋟
872DF
90
A0
򇋠
872E0
򇋡
872E1
򇋢
872E2
򇋣
872E3
򇋤
872E4
򇋥
872E5
򇋦
872E6
򇋧
872E7
򇋨
872E8
򇋩
872E9
򇋪
872EA
򇋫
872EB
򇋬
872EC
򇋭
872ED
򇋮
872EE
򇋯
872EF
A0
B0
򇋰
872F0
򇋱
872F1
򇋲
872F2
򇋳
872F3
򇋴
872F4
򇋵
872F5
򇋶
872F6
򇋷
872F7
򇋸
872F8
򇋹
872F9
򇋺
872FA
򇋻
872FB
򇋼
872FC
򇋽
872FD
򇋾
872FE
򇋿
872FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]