International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28897

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򈗀
885C0
򈗁
885C1
򈗂
885C2
򈗃
885C3
򈗄
885C4
򈗅
885C5
򈗆
885C6
򈗇
885C7
򈗈
885C8
򈗉
885C9
򈗊
885CA
򈗋
885CB
򈗌
885CC
򈗍
885CD
򈗎
885CE
򈗏
885CF
80
90
򈗐
885D0
򈗑
885D1
򈗒
885D2
򈗓
885D3
򈗔
885D4
򈗕
885D5
򈗖
885D6
򈗗
885D7
򈗘
885D8
򈗙
885D9
򈗚
885DA
򈗛
885DB
򈗜
885DC
򈗝
885DD
򈗞
885DE
򈗟
885DF
90
A0
򈗠
885E0
򈗡
885E1
򈗢
885E2
򈗣
885E3
򈗤
885E4
򈗥
885E5
򈗦
885E6
򈗧
885E7
򈗨
885E8
򈗩
885E9
򈗪
885EA
򈗫
885EB
򈗬
885EC
򈗭
885ED
򈗮
885EE
򈗯
885EF
A0
B0
򈗰
885F0
򈗱
885F1
򈗲
885F2
򈗳
885F3
򈗴
885F4
򈗵
885F5
򈗶
885F6
򈗷
885F7
򈗸
885F8
򈗹
885F9
򈗺
885FA
򈗻
885FB
򈗼
885FC
򈗽
885FD
򈗾
885FE
򈗿
885FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]