International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F388A7

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󈧀
C89C0
󈧁
C89C1
󈧂
C89C2
󈧃
C89C3
󈧄
C89C4
󈧅
C89C5
󈧆
C89C6
󈧇
C89C7
󈧈
C89C8
󈧉
C89C9
󈧊
C89CA
󈧋
C89CB
󈧌
C89CC
󈧍
C89CD
󈧎
C89CE
󈧏
C89CF
80
90
󈧐
C89D0
󈧑
C89D1
󈧒
C89D2
󈧓
C89D3
󈧔
C89D4
󈧕
C89D5
󈧖
C89D6
󈧗
C89D7
󈧘
C89D8
󈧙
C89D9
󈧚
C89DA
󈧛
C89DB
󈧜
C89DC
󈧝
C89DD
󈧞
C89DE
󈧟
C89DF
90
A0
󈧠
C89E0
󈧡
C89E1
󈧢
C89E2
󈧣
C89E3
󈧤
C89E4
󈧥
C89E5
󈧦
C89E6
󈧧
C89E7
󈧨
C89E8
󈧩
C89E9
󈧪
C89EA
󈧫
C89EB
󈧬
C89EC
󈧭
C89ED
󈧮
C89EE
󈧯
C89EF
A0
B0
󈧰
C89F0
󈧱
C89F1
󈧲
C89F2
󈧳
C89F3
󈧴
C89F4
󈧵
C89F5
󈧶
C89F6
󈧷
C89F7
󈧸
C89F8
󈧹
C89F9
󈧺
C89FA
󈧻
C89FB
󈧼
C89FC
󈧽
C89FD
󈧾
C89FE
󈧿
C89FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]