International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F3988B

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󘋀
D82C0
󘋁
D82C1
󘋂
D82C2
󘋃
D82C3
󘋄
D82C4
󘋅
D82C5
󘋆
D82C6
󘋇
D82C7
󘋈
D82C8
󘋉
D82C9
󘋊
D82CA
󘋋
D82CB
󘋌
D82CC
󘋍
D82CD
󘋎
D82CE
󘋏
D82CF
80
90
󘋐
D82D0
󘋑
D82D1
󘋒
D82D2
󘋓
D82D3
󘋔
D82D4
󘋕
D82D5
󘋖
D82D6
󘋗
D82D7
󘋘
D82D8
󘋙
D82D9
󘋚
D82DA
󘋛
D82DB
󘋜
D82DC
󘋝
D82DD
󘋞
D82DE
󘋟
D82DF
90
A0
󘋠
D82E0
󘋡
D82E1
󘋢
D82E2
󘋣
D82E3
󘋤
D82E4
󘋥
D82E5
󘋦
D82E6
󘋧
D82E7
󘋨
D82E8
󘋩
D82E9
󘋪
D82EA
󘋫
D82EB
󘋬
D82EC
󘋭
D82ED
󘋮
D82EE
󘋯
D82EF
A0
B0
󘋰
D82F0
󘋱
D82F1
󘋲
D82F2
󘋳
D82F3
󘋴
D82F4
󘋵
D82F5
󘋶
D82F6
󘋷
D82F7
󘋸
D82F8
󘋹
D82F9
󘋺
D82FA
󘋻
D82FB
󘋼
D82FC
󘋽
D82FD
󘋾
D82FE
󘋿
D82FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]