International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4889A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􈚀
108680
􈚁
108681
􈚂
108682
􈚃
108683
􈚄
108684
􈚅
108685
􈚆
108686
􈚇
108687
􈚈
108688
􈚉
108689
􈚊
10868A
􈚋
10868B
􈚌
10868C
􈚍
10868D
􈚎
10868E
􈚏
10868F
80
90
􈚐
108690
􈚑
108691
􈚒
108692
􈚓
108693
􈚔
108694
􈚕
108695
􈚖
108696
􈚗
108697
􈚘
108698
􈚙
108699
􈚚
10869A
􈚛
10869B
􈚜
10869C
􈚝
10869D
􈚞
10869E
􈚟
10869F
90
A0
􈚠
1086A0
􈚡
1086A1
􈚢
1086A2
􈚣
1086A3
􈚤
1086A4
􈚥
1086A5
􈚦
1086A6
􈚧
1086A7
􈚨
1086A8
􈚩
1086A9
􈚪
1086AA
􈚫
1086AB
􈚬
1086AC
􈚭
1086AD
􈚮
1086AE
􈚯
1086AF
A0
B0
􈚰
1086B0
􈚱
1086B1
􈚲
1086B2
􈚳
1086B3
􈚴
1086B4
􈚵
1086B5
􈚶
1086B6
􈚷
1086B7
􈚸
1086B8
􈚹
1086B9
􈚺
1086BA
􈚻
1086BB
􈚼
1086BC
􈚽
1086BD
􈚾
1086BE
􈚿
1086BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]