International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38C99

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌙀
CC640
󌙁
CC641
󌙂
CC642
󌙃
CC643
󌙄
CC644
󌙅
CC645
󌙆
CC646
󌙇
CC647
󌙈
CC648
󌙉
CC649
󌙊
CC64A
󌙋
CC64B
󌙌
CC64C
󌙍
CC64D
󌙎
CC64E
󌙏
CC64F
80
90
󌙐
CC650
󌙑
CC651
󌙒
CC652
󌙓
CC653
󌙔
CC654
󌙕
CC655
󌙖
CC656
󌙗
CC657
󌙘
CC658
󌙙
CC659
󌙚
CC65A
󌙛
CC65B
󌙜
CC65C
󌙝
CC65D
󌙞
CC65E
󌙟
CC65F
90
A0
󌙠
CC660
󌙡
CC661
󌙢
CC662
󌙣
CC663
󌙤
CC664
󌙥
CC665
󌙦
CC666
󌙧
CC667
󌙨
CC668
󌙩
CC669
󌙪
CC66A
󌙫
CC66B
󌙬
CC66C
󌙭
CC66D
󌙮
CC66E
󌙯
CC66F
A0
B0
󌙰
CC670
󌙱
CC671
󌙲
CC672
󌙳
CC673
󌙴
CC674
󌙵
CC675
󌙶
CC676
󌙷
CC677
󌙸
CC678
󌙹
CC679
󌙺
CC67A
󌙻
CC67B
󌙼
CC67C
󌙽
CC67D
󌙾
CC67E
󌙿
CC67F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]