International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09192

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𑒀
11480
𑒁
11481
𑒂
11482
𑒃
11483
𑒄
11484
𑒅
11485
𑒆
11486
𑒇
11487
𑒈
11488
𑒉
11489
𑒊
1148A
𑒋
1148B
𑒌
1148C
𑒍
1148D
𑒎
1148E
𑒏
1148F
80
90
𑒐
11490
𑒑
11491
𑒒
11492
𑒓
11493
𑒔
11494
𑒕
11495
𑒖
11496
𑒗
11497
𑒘
11498
𑒙
11499
𑒚
1149A
𑒛
1149B
𑒜
1149C
𑒝
1149D
𑒞
1149E
𑒟
1149F
90
A0
𑒠
114A0
𑒡
114A1
𑒢
114A2
𑒣
114A3
𑒤
114A4
𑒥
114A5
𑒦
114A6
𑒧
114A7
𑒨
114A8
𑒩
114A9
𑒪
114AA
𑒫
114AB
𑒬
114AC
𑒭
114AD
𑒮
114AE
𑒯
114AF
A0
B0
 𑒰
114B0
 𑒱
114B1
 𑒲
114B2
 𑒳
114B3
 𑒴
114B4
 𑒵
114B5
 𑒶
114B6
 𑒷
114B7
 𑒸
114B8
 𑒹
114B9
 𑒺
114BA
 𑒻
114BB
 𑒼
114BC
 𑒽
114BD
 𑒾
114BE
 𑒿
114BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]