International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39084

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐄀
D0100
󐄁
D0101
󐄂
D0102
󐄃
D0103
󐄄
D0104
󐄅
D0105
󐄆
D0106
󐄇
D0107
󐄈
D0108
󐄉
D0109
󐄊
D010A
󐄋
D010B
󐄌
D010C
󐄍
D010D
󐄎
D010E
󐄏
D010F
80
90
󐄐
D0110
󐄑
D0111
󐄒
D0112
󐄓
D0113
󐄔
D0114
󐄕
D0115
󐄖
D0116
󐄗
D0117
󐄘
D0118
󐄙
D0119
󐄚
D011A
󐄛
D011B
󐄜
D011C
󐄝
D011D
󐄞
D011E
󐄟
D011F
90
A0
󐄠
D0120
󐄡
D0121
󐄢
D0122
󐄣
D0123
󐄤
D0124
󐄥
D0125
󐄦
D0126
󐄧
D0127
󐄨
D0128
󐄩
D0129
󐄪
D012A
󐄫
D012B
󐄬
D012C
󐄭
D012D
󐄮
D012E
󐄯
D012F
A0
B0
󐄰
D0130
󐄱
D0131
󐄲
D0132
󐄳
D0133
󐄴
D0134
󐄵
D0135
󐄶
D0136
󐄷
D0137
󐄸
D0138
󐄹
D0139
󐄺
D013A
󐄻
D013B
󐄼
D013C
󐄽
D013D
󐄾
D013E
󐄿
D013F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]