International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B199

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񱙀
71640
񱙁
71641
񱙂
71642
񱙃
71643
񱙄
71644
񱙅
71645
񱙆
71646
񱙇
71647
񱙈
71648
񱙉
71649
񱙊
7164A
񱙋
7164B
񱙌
7164C
񱙍
7164D
񱙎
7164E
񱙏
7164F
80
90
񱙐
71650
񱙑
71651
񱙒
71652
񱙓
71653
񱙔
71654
񱙕
71655
񱙖
71656
񱙗
71657
񱙘
71658
񱙙
71659
񱙚
7165A
񱙛
7165B
񱙜
7165C
񱙝
7165D
񱙞
7165E
񱙟
7165F
90
A0
񱙠
71660
񱙡
71661
񱙢
71662
񱙣
71663
񱙤
71664
񱙥
71665
񱙦
71666
񱙧
71667
񱙨
71668
񱙩
71669
񱙪
7166A
񱙫
7166B
񱙬
7166C
񱙭
7166D
񱙮
7166E
񱙯
7166F
A0
B0
񱙰
71670
񱙱
71671
񱙲
71672
񱙳
71673
񱙴
71674
񱙵
71675
񱙶
71676
񱙷
71677
񱙸
71678
񱙹
71679
񱙺
7167A
񱙻
7167B
񱙼
7167C
񱙽
7167D
񱙾
7167E
񱙿
7167F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]