International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48B84

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􋄀
10B100
􋄁
10B101
􋄂
10B102
􋄃
10B103
􋄄
10B104
􋄅
10B105
􋄆
10B106
􋄇
10B107
􋄈
10B108
􋄉
10B109
􋄊
10B10A
􋄋
10B10B
􋄌
10B10C
􋄍
10B10D
􋄎
10B10E
􋄏
10B10F
80
90
􋄐
10B110
􋄑
10B111
􋄒
10B112
􋄓
10B113
􋄔
10B114
􋄕
10B115
􋄖
10B116
􋄗
10B117
􋄘
10B118
􋄙
10B119
􋄚
10B11A
􋄛
10B11B
􋄜
10B11C
􋄝
10B11D
􋄞
10B11E
􋄟
10B11F
90
A0
􋄠
10B120
􋄡
10B121
􋄢
10B122
􋄣
10B123
􋄤
10B124
􋄥
10B125
􋄦
10B126
􋄧
10B127
􋄨
10B128
􋄩
10B129
􋄪
10B12A
􋄫
10B12B
􋄬
10B12C
􋄭
10B12D
􋄮
10B12E
􋄯
10B12F
A0
B0
􋄰
10B130
􋄱
10B131
􋄲
10B132
􋄳
10B133
􋄴
10B134
􋄵
10B135
􋄶
10B136
􋄷
10B137
􋄸
10B138
􋄹
10B139
􋄺
10B13A
􋄻
10B13B
􋄼
10B13C
􋄽
10B13D
􋄾
10B13E
􋄿
10B13F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]