International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39E96

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󞖀
DE580
󞖁
DE581
󞖂
DE582
󞖃
DE583
󞖄
DE584
󞖅
DE585
󞖆
DE586
󞖇
DE587
󞖈
DE588
󞖉
DE589
󞖊
DE58A
󞖋
DE58B
󞖌
DE58C
󞖍
DE58D
󞖎
DE58E
󞖏
DE58F
80
90
󞖐
DE590
󞖑
DE591
󞖒
DE592
󞖓
DE593
󞖔
DE594
󞖕
DE595
󞖖
DE596
󞖗
DE597
󞖘
DE598
󞖙
DE599
󞖚
DE59A
󞖛
DE59B
󞖜
DE59C
󞖝
DE59D
󞖞
DE59E
󞖟
DE59F
90
A0
󞖠
DE5A0
󞖡
DE5A1
󞖢
DE5A2
󞖣
DE5A3
󞖤
DE5A4
󞖥
DE5A5
󞖦
DE5A6
󞖧
DE5A7
󞖨
DE5A8
󞖩
DE5A9
󞖪
DE5AA
󞖫
DE5AB
󞖬
DE5AC
󞖭
DE5AD
󞖮
DE5AE
󞖯
DE5AF
A0
B0
󞖰
DE5B0
󞖱
DE5B1
󞖲
DE5B2
󞖳
DE5B3
󞖴
DE5B4
󞖵
DE5B5
󞖶
DE5B6
󞖷
DE5B7
󞖸
DE5B8
󞖹
DE5B9
󞖺
DE5BA
󞖻
DE5BB
󞖼
DE5BC
󞖽
DE5BD
󞖾
DE5BE
󞖿
DE5BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]