International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A797

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򧗀
A75C0
򧗁
A75C1
򧗂
A75C2
򧗃
A75C3
򧗄
A75C4
򧗅
A75C5
򧗆
A75C6
򧗇
A75C7
򧗈
A75C8
򧗉
A75C9
򧗊
A75CA
򧗋
A75CB
򧗌
A75CC
򧗍
A75CD
򧗎
A75CE
򧗏
A75CF
80
90
򧗐
A75D0
򧗑
A75D1
򧗒
A75D2
򧗓
A75D3
򧗔
A75D4
򧗕
A75D5
򧗖
A75D6
򧗗
A75D7
򧗘
A75D8
򧗙
A75D9
򧗚
A75DA
򧗛
A75DB
򧗜
A75DC
򧗝
A75DD
򧗞
A75DE
򧗟
A75DF
90
A0
򧗠
A75E0
򧗡
A75E1
򧗢
A75E2
򧗣
A75E3
򧗤
A75E4
򧗥
A75E5
򧗦
A75E6
򧗧
A75E7
򧗨
A75E8
򧗩
A75E9
򧗪
A75EA
򧗫
A75EB
򧗬
A75EC
򧗭
A75ED
򧗮
A75EE
򧗯
A75EF
A0
B0
򧗰
A75F0
򧗱
A75F1
򧗲
A75F2
򧗳
A75F3
򧗴
A75F4
򧗵
A75F5
򧗶
A75F6
򧗷
A75F7
򧗸
A75F8
򧗹
A75F9
򧗺
A75FA
򧗻
A75FB
򧗼
A75FC
򧗽
A75FD
򧗾
A75FE
򧗿
A75FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]