International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A699

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򦙀
A6640
򦙁
A6641
򦙂
A6642
򦙃
A6643
򦙄
A6644
򦙅
A6645
򦙆
A6646
򦙇
A6647
򦙈
A6648
򦙉
A6649
򦙊
A664A
򦙋
A664B
򦙌
A664C
򦙍
A664D
򦙎
A664E
򦙏
A664F
80
90
򦙐
A6650
򦙑
A6651
򦙒
A6652
򦙓
A6653
򦙔
A6654
򦙕
A6655
򦙖
A6656
򦙗
A6657
򦙘
A6658
򦙙
A6659
򦙚
A665A
򦙛
A665B
򦙜
A665C
򦙝
A665D
򦙞
A665E
򦙟
A665F
90
A0
򦙠
A6660
򦙡
A6661
򦙢
A6662
򦙣
A6663
򦙤
A6664
򦙥
A6665
򦙦
A6666
򦙧
A6667
򦙨
A6668
򦙩
A6669
򦙪
A666A
򦙫
A666B
򦙬
A666C
򦙭
A666D
򦙮
A666E
򦙯
A666F
A0
B0
򦙰
A6670
򦙱
A6671
򦙲
A6672
򦙳
A6673
򦙴
A6674
򦙵
A6675
򦙶
A6676
򦙷
A6677
򦙸
A6678
򦙹
A6679
򦙺
A667A
򦙻
A667B
򦙼
A667C
򦙽
A667D
򦙾
A667E
򦙿
A667F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]