International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0AA99

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𪙀
2A640
𪙁
2A641
𪙂
2A642
𪙃
2A643
𪙄
2A644
𪙅
2A645
𪙆
2A646
𪙇
2A647
𪙈
2A648
𪙉
2A649
𪙊
2A64A
𪙋
2A64B
𪙌
2A64C
𪙍
2A64D
𪙎
2A64E
𪙏
2A64F
80
90
𪙐
2A650
𪙑
2A651
𪙒
2A652
𪙓
2A653
𪙔
2A654
𪙕
2A655
𪙖
2A656
𪙗
2A657
𪙘
2A658
𪙙
2A659
𪙚
2A65A
𪙛
2A65B
𪙜
2A65C
𪙝
2A65D
𪙞
2A65E
𪙟
2A65F
90
A0
𪙠
2A660
𪙡
2A661
𪙢
2A662
𪙣
2A663
𪙤
2A664
𪙥
2A665
𪙦
2A666
𪙧
2A667
𪙨
2A668
𪙩
2A669
𪙪
2A66A
𪙫
2A66B
𪙬
2A66C
𪙭
2A66D
𪙮
2A66E
𪙯
2A66F
A0
B0
𪙰
2A670
𪙱
2A671
𪙲
2A672
𪙳
2A673
𪙴
2A674
𪙵
2A675
𪙶
2A676
𪙷
2A677
𪙸
2A678
𪙹
2A679
𪙺
2A67A
𪙻
2A67B
𪙼
2A67C
𪙽
2A67D
𪙾
2A67E
𪙿
2A67F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]