International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F481A6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􁦀
101980
􁦁
101981
􁦂
101982
􁦃
101983
􁦄
101984
􁦅
101985
􁦆
101986
􁦇
101987
􁦈
101988
􁦉
101989
􁦊
10198A
􁦋
10198B
􁦌
10198C
􁦍
10198D
􁦎
10198E
􁦏
10198F
80
90
􁦐
101990
􁦑
101991
􁦒
101992
􁦓
101993
􁦔
101994
􁦕
101995
􁦖
101996
􁦗
101997
􁦘
101998
􁦙
101999
􁦚
10199A
􁦛
10199B
􁦜
10199C
􁦝
10199D
􁦞
10199E
􁦟
10199F
90
A0
􁦠
1019A0
􁦡
1019A1
􁦢
1019A2
􁦣
1019A3
􁦤
1019A4
􁦥
1019A5
􁦦
1019A6
􁦧
1019A7
􁦨
1019A8
􁦩
1019A9
􁦪
1019AA
􁦫
1019AB
􁦬
1019AC
􁦭
1019AD
􁦮
1019AE
􁦯
1019AF
A0
B0
􁦰
1019B0
􁦱
1019B1
􁦲
1019B2
􁦳
1019B3
􁦴
1019B4
􁦵
1019B5
􁦶
1019B6
􁦷
1019B7
􁦸
1019B8
􁦹
1019B9
􁦺
1019BA
􁦻
1019BB
􁦼
1019BC
􁦽
1019BD
􁦾
1019BE
􁦿
1019BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]