International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A899

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򨙀
A8640
򨙁
A8641
򨙂
A8642
򨙃
A8643
򨙄
A8644
򨙅
A8645
򨙆
A8646
򨙇
A8647
򨙈
A8648
򨙉
A8649
򨙊
A864A
򨙋
A864B
򨙌
A864C
򨙍
A864D
򨙎
A864E
򨙏
A864F
80
90
򨙐
A8650
򨙑
A8651
򨙒
A8652
򨙓
A8653
򨙔
A8654
򨙕
A8655
򨙖
A8656
򨙗
A8657
򨙘
A8658
򨙙
A8659
򨙚
A865A
򨙛
A865B
򨙜
A865C
򨙝
A865D
򨙞
A865E
򨙟
A865F
90
A0
򨙠
A8660
򨙡
A8661
򨙢
A8662
򨙣
A8663
򨙤
A8664
򨙥
A8665
򨙦
A8666
򨙧
A8667
򨙨
A8668
򨙩
A8669
򨙪
A866A
򨙫
A866B
򨙬
A866C
򨙭
A866D
򨙮
A866E
򨙯
A866F
A0
B0
򨙰
A8670
򨙱
A8671
򨙲
A8672
򨙳
A8673
򨙴
A8674
򨙵
A8675
򨙶
A8676
򨙷
A8677
򨙸
A8678
򨙹
A8679
򨙺
A867A
򨙻
A867B
򨙼
A867C
򨙽
A867D
򨙾
A867E
򨙿
A867F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]