International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28396

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򃖀
83580
򃖁
83581
򃖂
83582
򃖃
83583
򃖄
83584
򃖅
83585
򃖆
83586
򃖇
83587
򃖈
83588
򃖉
83589
򃖊
8358A
򃖋
8358B
򃖌
8358C
򃖍
8358D
򃖎
8358E
򃖏
8358F
80
90
򃖐
83590
򃖑
83591
򃖒
83592
򃖓
83593
򃖔
83594
򃖕
83595
򃖖
83596
򃖗
83597
򃖘
83598
򃖙
83599
򃖚
8359A
򃖛
8359B
򃖜
8359C
򃖝
8359D
򃖞
8359E
򃖟
8359F
90
A0
򃖠
835A0
򃖡
835A1
򃖢
835A2
򃖣
835A3
򃖤
835A4
򃖥
835A5
򃖦
835A6
򃖧
835A7
򃖨
835A8
򃖩
835A9
򃖪
835AA
򃖫
835AB
򃖬
835AC
򃖭
835AD
򃖮
835AE
򃖯
835AF
A0
B0
򃖰
835B0
򃖱
835B1
򃖲
835B2
򃖳
835B3
򃖴
835B4
򃖵
835B5
򃖶
835B6
򃖷
835B7
򃖸
835B8
򃖹
835B9
򃖺
835BA
򃖻
835BB
򃖼
835BC
򃖽
835BD
򃖾
835BE
򃖿
835BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]