International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1A899

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񨙀
68640
񨙁
68641
񨙂
68642
񨙃
68643
񨙄
68644
񨙅
68645
񨙆
68646
񨙇
68647
񨙈
68648
񨙉
68649
񨙊
6864A
񨙋
6864B
񨙌
6864C
񨙍
6864D
񨙎
6864E
񨙏
6864F
80
90
񨙐
68650
񨙑
68651
񨙒
68652
񨙓
68653
񨙔
68654
񨙕
68655
񨙖
68656
񨙗
68657
񨙘
68658
񨙙
68659
񨙚
6865A
񨙛
6865B
񨙜
6865C
񨙝
6865D
񨙞
6865E
񨙟
6865F
90
A0
񨙠
68660
񨙡
68661
񨙢
68662
񨙣
68663
񨙤
68664
񨙥
68665
񨙦
68666
񨙧
68667
񨙨
68668
񨙩
68669
񨙪
6866A
񨙫
6866B
񨙬
6866C
񨙭
6866D
񨙮
6866E
񨙯
6866F
A0
B0
񨙰
68670
񨙱
68671
񨙲
68672
񨙳
68673
񨙴
68674
񨙵
68675
񨙶
68676
񨙷
68677
񨙸
68678
񨙹
68679
񨙺
6867A
񨙻
6867B
񨙼
6867C
񨙽
6867D
񨙾
6867E
񨙿
6867F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]