International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3838F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃏀
C33C0
󃏁
C33C1
󃏂
C33C2
󃏃
C33C3
󃏄
C33C4
󃏅
C33C5
󃏆
C33C6
󃏇
C33C7
󃏈
C33C8
󃏉
C33C9
󃏊
C33CA
󃏋
C33CB
󃏌
C33CC
󃏍
C33CD
󃏎
C33CE
󃏏
C33CF
80
90
󃏐
C33D0
󃏑
C33D1
󃏒
C33D2
󃏓
C33D3
󃏔
C33D4
󃏕
C33D5
󃏖
C33D6
󃏗
C33D7
󃏘
C33D8
󃏙
C33D9
󃏚
C33DA
󃏛
C33DB
󃏜
C33DC
󃏝
C33DD
󃏞
C33DE
󃏟
C33DF
90
A0
󃏠
C33E0
󃏡
C33E1
󃏢
C33E2
󃏣
C33E3
󃏤
C33E4
󃏥
C33E5
󃏦
C33E6
󃏧
C33E7
󃏨
C33E8
󃏩
C33E9
󃏪
C33EA
󃏫
C33EB
󃏬
C33EC
󃏭
C33ED
󃏮
C33EE
󃏯
C33EF
A0
B0
󃏰
C33F0
󃏱
C33F1
󃏲
C33F2
󃏳
C33F3
󃏴
C33F4
󃏵
C33F5
󃏶
C33F6
󃏷
C33F7
󃏸
C33F8
󃏹
C33F9
󃏺
C33FA
󃏻
C33FB
󃏼
C33FC
󃏽
C33FD
󃏾
C33FE
󃏿
C33FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]