International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A896

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󨖀
E8580
󨖁
E8581
󨖂
E8582
󨖃
E8583
󨖄
E8584
󨖅
E8585
󨖆
E8586
󨖇
E8587
󨖈
E8588
󨖉
E8589
󨖊
E858A
󨖋
E858B
󨖌
E858C
󨖍
E858D
󨖎
E858E
󨖏
E858F
80
90
󨖐
E8590
󨖑
E8591
󨖒
E8592
󨖓
E8593
󨖔
E8594
󨖕
E8595
󨖖
E8596
󨖗
E8597
󨖘
E8598
󨖙
E8599
󨖚
E859A
󨖛
E859B
󨖜
E859C
󨖝
E859D
󨖞
E859E
󨖟
E859F
90
A0
󨖠
E85A0
󨖡
E85A1
󨖢
E85A2
󨖣
E85A3
󨖤
E85A4
󨖥
E85A5
󨖦
E85A6
󨖧
E85A7
󨖨
E85A8
󨖩
E85A9
󨖪
E85AA
󨖫
E85AB
󨖬
E85AC
󨖭
E85AD
󨖮
E85AE
󨖯
E85AF
A0
B0
󨖰
E85B0
󨖱
E85B1
󨖲
E85B2
󨖳
E85B3
󨖴
E85B4
󨖵
E85B5
󨖶
E85B6
󨖷
E85B7
󨖸
E85B8
󨖹
E85B9
󨖺
E85BA
󨖻
E85BB
󨖼
E85BC
󨖽
E85BD
󨖾
E85BE
󨖿
E85BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]