International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2AE96

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򮖀
AE580
򮖁
AE581
򮖂
AE582
򮖃
AE583
򮖄
AE584
򮖅
AE585
򮖆
AE586
򮖇
AE587
򮖈
AE588
򮖉
AE589
򮖊
AE58A
򮖋
AE58B
򮖌
AE58C
򮖍
AE58D
򮖎
AE58E
򮖏
AE58F
80
90
򮖐
AE590
򮖑
AE591
򮖒
AE592
򮖓
AE593
򮖔
AE594
򮖕
AE595
򮖖
AE596
򮖗
AE597
򮖘
AE598
򮖙
AE599
򮖚
AE59A
򮖛
AE59B
򮖜
AE59C
򮖝
AE59D
򮖞
AE59E
򮖟
AE59F
90
A0
򮖠
AE5A0
򮖡
AE5A1
򮖢
AE5A2
򮖣
AE5A3
򮖤
AE5A4
򮖥
AE5A5
򮖦
AE5A6
򮖧
AE5A7
򮖨
AE5A8
򮖩
AE5A9
򮖪
AE5AA
򮖫
AE5AB
򮖬
AE5AC
򮖭
AE5AD
򮖮
AE5AE
򮖯
AE5AF
A0
B0
򮖰
AE5B0
򮖱
AE5B1
򮖲
AE5B2
򮖳
AE5B3
򮖴
AE5B4
򮖵
AE5B5
򮖶
AE5B6
򮖷
AE5B7
򮖸
AE5B8
򮖹
AE5B9
򮖺
AE5BA
򮖻
AE5BB
򮖼
AE5BC
򮖽
AE5BD
򮖾
AE5BE
򮖿
AE5BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]