International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A08F

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󠏀
E03C0
󠏁
E03C1
󠏂
E03C2
󠏃
E03C3
󠏄
E03C4
󠏅
E03C5
󠏆
E03C6
󠏇
E03C7
󠏈
E03C8
󠏉
E03C9
󠏊
E03CA
󠏋
E03CB
󠏌
E03CC
󠏍
E03CD
󠏎
E03CE
󠏏
E03CF
80
90
󠏐
E03D0
󠏑
E03D1
󠏒
E03D2
󠏓
E03D3
󠏔
E03D4
󠏕
E03D5
󠏖
E03D6
󠏗
E03D7
󠏘
E03D8
󠏙
E03D9
󠏚
E03DA
󠏛
E03DB
󠏜
E03DC
󠏝
E03DD
󠏞
E03DE
󠏟
E03DF
90
A0
󠏠
E03E0
󠏡
E03E1
󠏢
E03E2
󠏣
E03E3
󠏤
E03E4
󠏥
E03E5
󠏦
E03E6
󠏧
E03E7
󠏨
E03E8
󠏩
E03E9
󠏪
E03EA
󠏫
E03EB
󠏬
E03EC
󠏭
E03ED
󠏮
E03EE
󠏯
E03EF
A0
B0
󠏰
E03F0
󠏱
E03F1
󠏲
E03F2
󠏳
E03F3
󠏴
E03F4
󠏵
E03F5
󠏶
E03F6
󠏷
E03F7
󠏸
E03F8
󠏹
E03F9
󠏺
E03FA
󠏻
E03FB
󠏼
E03FC
󠏽
E03FD
󠏾
E03FE
󠏿
E03FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]