International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0AB99

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𫙀
2B640
𫙁
2B641
𫙂
2B642
𫙃
2B643
𫙄
2B644
𫙅
2B645
𫙆
2B646
𫙇
2B647
𫙈
2B648
𫙉
2B649
𫙊
2B64A
𫙋
2B64B
𫙌
2B64C
𫙍
2B64D
𫙎
2B64E
𫙏
2B64F
80
90
𫙐
2B650
𫙑
2B651
𫙒
2B652
𫙓
2B653
𫙔
2B654
𫙕
2B655
𫙖
2B656
𫙗
2B657
𫙘
2B658
𫙙
2B659
𫙚
2B65A
𫙛
2B65B
𫙜
2B65C
𫙝
2B65D
𫙞
2B65E
𫙟
2B65F
90
A0
𫙠
2B660
𫙡
2B661
𫙢
2B662
𫙣
2B663
𫙤
2B664
𫙥
2B665
𫙦
2B666
𫙧
2B667
𫙨
2B668
𫙩
2B669
𫙪
2B66A
𫙫
2B66B
𫙬
2B66C
𫙭
2B66D
𫙮
2B66E
𫙯
2B66F
A0
B0
𫙰
2B670
𫙱
2B671
𫙲
2B672
𫙳
2B673
𫙴
2B674
𫙵
2B675
𫙶
2B676
𫙷
2B677
𫙸
2B678
𫙹
2B679
𫙺
2B67A
𫙻
2B67B
𫙼
2B67C
𫙽
2B67D
𫙾
2B67E
𫙿
2B67F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]