International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A38F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󣏀
E33C0
󣏁
E33C1
󣏂
E33C2
󣏃
E33C3
󣏄
E33C4
󣏅
E33C5
󣏆
E33C6
󣏇
E33C7
󣏈
E33C8
󣏉
E33C9
󣏊
E33CA
󣏋
E33CB
󣏌
E33CC
󣏍
E33CD
󣏎
E33CE
󣏏
E33CF
80
90
󣏐
E33D0
󣏑
E33D1
󣏒
E33D2
󣏓
E33D3
󣏔
E33D4
󣏕
E33D5
󣏖
E33D6
󣏗
E33D7
󣏘
E33D8
󣏙
E33D9
󣏚
E33DA
󣏛
E33DB
󣏜
E33DC
󣏝
E33DD
󣏞
E33DE
󣏟
E33DF
90
A0
󣏠
E33E0
󣏡
E33E1
󣏢
E33E2
󣏣
E33E3
󣏤
E33E4
󣏥
E33E5
󣏦
E33E6
󣏧
E33E7
󣏨
E33E8
󣏩
E33E9
󣏪
E33EA
󣏫
E33EB
󣏬
E33EC
󣏭
E33ED
󣏮
E33EE
󣏯
E33EF
A0
B0
󣏰
E33F0
󣏱
E33F1
󣏲
E33F2
󣏳
E33F3
󣏴
E33F4
󣏵
E33F5
󣏶
E33F6
󣏷
E33F7
󣏸
E33F8
󣏹
E33F9
󣏺
E33FA
󣏻
E33FB
󣏼
E33FC
󣏽
E33FD
󣏾
E33FE
󣏿
E33FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]