International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F190A7

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񐧀
509C0
񐧁
509C1
񐧂
509C2
񐧃
509C3
񐧄
509C4
񐧅
509C5
񐧆
509C6
񐧇
509C7
񐧈
509C8
񐧉
509C9
񐧊
509CA
񐧋
509CB
񐧌
509CC
񐧍
509CD
񐧎
509CE
񐧏
509CF
80
90
񐧐
509D0
񐧑
509D1
񐧒
509D2
񐧓
509D3
񐧔
509D4
񐧕
509D5
񐧖
509D6
񐧗
509D7
񐧘
509D8
񐧙
509D9
񐧚
509DA
񐧛
509DB
񐧜
509DC
񐧝
509DD
񐧞
509DE
񐧟
509DF
90
A0
񐧠
509E0
񐧡
509E1
񐧢
509E2
񐧣
509E3
񐧤
509E4
񐧥
509E5
񐧦
509E6
񐧧
509E7
񐧨
509E8
񐧩
509E9
񐧪
509EA
񐧫
509EB
񐧬
509EC
񐧭
509ED
񐧮
509EE
񐧯
509EF
A0
B0
񐧰
509F0
񐧱
509F1
񐧲
509F2
񐧳
509F3
񐧴
509F4
񐧵
509F5
񐧶
509F6
񐧷
509F7
񐧸
509F8
񐧹
509F9
񐧺
509FA
񐧻
509FB
񐧼
509FC
񐧽
509FD
񐧾
509FE
񐧿
509FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]