International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F19884

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񘄀
58100
񘄁
58101
񘄂
58102
񘄃
58103
񘄄
58104
񘄅
58105
񘄆
58106
񘄇
58107
񘄈
58108
񘄉
58109
񘄊
5810A
񘄋
5810B
񘄌
5810C
񘄍
5810D
񘄎
5810E
񘄏
5810F
80
90
񘄐
58110
񘄑
58111
񘄒
58112
񘄓
58113
񘄔
58114
񘄕
58115
񘄖
58116
񘄗
58117
񘄘
58118
񘄙
58119
񘄚
5811A
񘄛
5811B
񘄜
5811C
񘄝
5811D
񘄞
5811E
񘄟
5811F
90
A0
񘄠
58120
񘄡
58121
񘄢
58122
񘄣
58123
񘄤
58124
񘄥
58125
񘄦
58126
񘄧
58127
񘄨
58128
񘄩
58129
񘄪
5812A
񘄫
5812B
񘄬
5812C
񘄭
5812D
񘄮
5812E
񘄯
5812F
A0
B0
񘄰
58130
񘄱
58131
񘄲
58132
񘄳
58133
񘄴
58134
񘄵
58135
񘄶
58136
񘄷
58137
񘄸
58138
񘄹
58139
񘄺
5813A
񘄻
5813B
񘄼
5813C
񘄽
5813D
񘄾
5813E
񘄿
5813F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]