International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B78B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򷋀
B72C0
򷋁
B72C1
򷋂
B72C2
򷋃
B72C3
򷋄
B72C4
򷋅
B72C5
򷋆
B72C6
򷋇
B72C7
򷋈
B72C8
򷋉
B72C9
򷋊
B72CA
򷋋
B72CB
򷋌
B72CC
򷋍
B72CD
򷋎
B72CE
򷋏
B72CF
80
90
򷋐
B72D0
򷋑
B72D1
򷋒
B72D2
򷋓
B72D3
򷋔
B72D4
򷋕
B72D5
򷋖
B72D6
򷋗
B72D7
򷋘
B72D8
򷋙
B72D9
򷋚
B72DA
򷋛
B72DB
򷋜
B72DC
򷋝
B72DD
򷋞
B72DE
򷋟
B72DF
90
A0
򷋠
B72E0
򷋡
B72E1
򷋢
B72E2
򷋣
B72E3
򷋤
B72E4
򷋥
B72E5
򷋦
B72E6
򷋧
B72E7
򷋨
B72E8
򷋩
B72E9
򷋪
B72EA
򷋫
B72EB
򷋬
B72EC
򷋭
B72ED
򷋮
B72EE
򷋯
B72EF
A0
B0
򷋰
B72F0
򷋱
B72F1
򷋲
B72F2
򷋳
B72F3
򷋴
B72F4
򷋵
B72F5
򷋶
B72F6
򷋷
B72F7
򷋸
B72F8
򷋹
B72F9
򷋺
B72FA
򷋻
B72FB
򷋼
B72FC
򷋽
B72FD
򷋾
B72FE
򷋿
B72FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]