International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38D99

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󍙀
CD640
󍙁
CD641
󍙂
CD642
󍙃
CD643
󍙄
CD644
󍙅
CD645
󍙆
CD646
󍙇
CD647
󍙈
CD648
󍙉
CD649
󍙊
CD64A
󍙋
CD64B
󍙌
CD64C
󍙍
CD64D
󍙎
CD64E
󍙏
CD64F
80
90
󍙐
CD650
󍙑
CD651
󍙒
CD652
󍙓
CD653
󍙔
CD654
󍙕
CD655
󍙖
CD656
󍙗
CD657
󍙘
CD658
󍙙
CD659
󍙚
CD65A
󍙛
CD65B
󍙜
CD65C
󍙝
CD65D
󍙞
CD65E
󍙟
CD65F
90
A0
󍙠
CD660
󍙡
CD661
󍙢
CD662
󍙣
CD663
󍙤
CD664
󍙥
CD665
󍙦
CD666
󍙧
CD667
󍙨
CD668
󍙩
CD669
󍙪
CD66A
󍙫
CD66B
󍙬
CD66C
󍙭
CD66D
󍙮
CD66E
󍙯
CD66F
A0
B0
󍙰
CD670
󍙱
CD671
󍙲
CD672
󍙳
CD673
󍙴
CD674
󍙵
CD675
󍙶
CD676
󍙷
CD677
󍙸
CD678
󍙹
CD679
󍙺
CD67A
󍙻
CD67B
󍙼
CD67C
󍙽
CD67D
󍙾
CD67E
󍙿
CD67F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]