International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A692

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𦒀
26480
𦒁
26481
𦒂
26482
𦒃
26483
𦒄
26484
𦒅
26485
𦒆
26486
𦒇
26487
𦒈
26488
𦒉
26489
𦒊
2648A
𦒋
2648B
𦒌
2648C
𦒍
2648D
𦒎
2648E
𦒏
2648F
80
90
𦒐
26490
𦒑
26491
𦒒
26492
𦒓
26493
𦒔
26494
𦒕
26495
𦒖
26496
𦒗
26497
𦒘
26498
𦒙
26499
𦒚
2649A
𦒛
2649B
𦒜
2649C
𦒝
2649D
𦒞
2649E
𦒟
2649F
90
A0
𦒠
264A0
𦒡
264A1
𦒢
264A2
𦒣
264A3
𦒤
264A4
𦒥
264A5
𦒦
264A6
𦒧
264A7
𦒨
264A8
𦒩
264A9
𦒪
264AA
𦒫
264AB
𦒬
264AC
𦒭
264AD
𦒮
264AE
𦒯
264AF
A0
B0
𦒰
264B0
𦒱
264B1
𦒲
264B2
𦒳
264B3
𦒴
264B4
𦒵
264B5
𦒶
264B6
𦒷
264B7
𦒸
264B8
𦒹
264B9
𦒺
264BA
𦒻
264BB
𦒼
264BC
𦒽
264BD
𦒾
264BE
𦒿
264BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]