International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38A96

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󊖀
CA580
󊖁
CA581
󊖂
CA582
󊖃
CA583
󊖄
CA584
󊖅
CA585
󊖆
CA586
󊖇
CA587
󊖈
CA588
󊖉
CA589
󊖊
CA58A
󊖋
CA58B
󊖌
CA58C
󊖍
CA58D
󊖎
CA58E
󊖏
CA58F
80
90
󊖐
CA590
󊖑
CA591
󊖒
CA592
󊖓
CA593
󊖔
CA594
󊖕
CA595
󊖖
CA596
󊖗
CA597
󊖘
CA598
󊖙
CA599
󊖚
CA59A
󊖛
CA59B
󊖜
CA59C
󊖝
CA59D
󊖞
CA59E
󊖟
CA59F
90
A0
󊖠
CA5A0
󊖡
CA5A1
󊖢
CA5A2
󊖣
CA5A3
󊖤
CA5A4
󊖥
CA5A5
󊖦
CA5A6
󊖧
CA5A7
󊖨
CA5A8
󊖩
CA5A9
󊖪
CA5AA
󊖫
CA5AB
󊖬
CA5AC
󊖭
CA5AD
󊖮
CA5AE
󊖯
CA5AF
A0
B0
󊖰
CA5B0
󊖱
CA5B1
󊖲
CA5B2
󊖳
CA5B3
󊖴
CA5B4
󊖵
CA5B5
󊖶
CA5B6
󊖷
CA5B7
󊖸
CA5B8
󊖹
CA5B9
󊖺
CA5BA
󊖻
CA5BB
󊖼
CA5BC
󊖽
CA5BD
󊖾
CA5BE
󊖿
CA5BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]