International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F380A7

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀧀
C09C0
󀧁
C09C1
󀧂
C09C2
󀧃
C09C3
󀧄
C09C4
󀧅
C09C5
󀧆
C09C6
󀧇
C09C7
󀧈
C09C8
󀧉
C09C9
󀧊
C09CA
󀧋
C09CB
󀧌
C09CC
󀧍
C09CD
󀧎
C09CE
󀧏
C09CF
80
90
󀧐
C09D0
󀧑
C09D1
󀧒
C09D2
󀧓
C09D3
󀧔
C09D4
󀧕
C09D5
󀧖
C09D6
󀧗
C09D7
󀧘
C09D8
󀧙
C09D9
󀧚
C09DA
󀧛
C09DB
󀧜
C09DC
󀧝
C09DD
󀧞
C09DE
󀧟
C09DF
90
A0
󀧠
C09E0
󀧡
C09E1
󀧢
C09E2
󀧣
C09E3
󀧤
C09E4
󀧥
C09E5
󀧦
C09E6
󀧧
C09E7
󀧨
C09E8
󀧩
C09E9
󀧪
C09EA
󀧫
C09EB
󀧬
C09EC
󀧭
C09ED
󀧮
C09EE
󀧯
C09EF
A0
B0
󀧰
C09F0
󀧱
C09F1
󀧲
C09F2
󀧳
C09F3
󀧴
C09F4
󀧵
C09F5
󀧶
C09F6
󀧷
C09F7
󀧸
C09F8
󀧹
C09F9
󀧺
C09FA
󀧻
C09FB
󀧼
C09FC
󀧽
C09FD
󀧾
C09FE
󀧿
C09FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]