International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A598

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򥘀
A5600
򥘁
A5601
򥘂
A5602
򥘃
A5603
򥘄
A5604
򥘅
A5605
򥘆
A5606
򥘇
A5607
򥘈
A5608
򥘉
A5609
򥘊
A560A
򥘋
A560B
򥘌
A560C
򥘍
A560D
򥘎
A560E
򥘏
A560F
80
90
򥘐
A5610
򥘑
A5611
򥘒
A5612
򥘓
A5613
򥘔
A5614
򥘕
A5615
򥘖
A5616
򥘗
A5617
򥘘
A5618
򥘙
A5619
򥘚
A561A
򥘛
A561B
򥘜
A561C
򥘝
A561D
򥘞
A561E
򥘟
A561F
90
A0
򥘠
A5620
򥘡
A5621
򥘢
A5622
򥘣
A5623
򥘤
A5624
򥘥
A5625
򥘦
A5626
򥘧
A5627
򥘨
A5628
򥘩
A5629
򥘪
A562A
򥘫
A562B
򥘬
A562C
򥘭
A562D
򥘮
A562E
򥘯
A562F
A0
B0
򥘰
A5630
򥘱
A5631
򥘲
A5632
򥘳
A5633
򥘴
A5634
򥘵
A5635
򥘶
A5636
򥘷
A5637
򥘸
A5638
򥘹
A5639
򥘺
A563A
򥘻
A563B
򥘼
A563C
򥘽
A563D
򥘾
A563E
򥘿
A563F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]