International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B384

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򳄀
B3100
򳄁
B3101
򳄂
B3102
򳄃
B3103
򳄄
B3104
򳄅
B3105
򳄆
B3106
򳄇
B3107
򳄈
B3108
򳄉
B3109
򳄊
B310A
򳄋
B310B
򳄌
B310C
򳄍
B310D
򳄎
B310E
򳄏
B310F
80
90
򳄐
B3110
򳄑
B3111
򳄒
B3112
򳄓
B3113
򳄔
B3114
򳄕
B3115
򳄖
B3116
򳄗
B3117
򳄘
B3118
򳄙
B3119
򳄚
B311A
򳄛
B311B
򳄜
B311C
򳄝
B311D
򳄞
B311E
򳄟
B311F
90
A0
򳄠
B3120
򳄡
B3121
򳄢
B3122
򳄣
B3123
򳄤
B3124
򳄥
B3125
򳄦
B3126
򳄧
B3127
򳄨
B3128
򳄩
B3129
򳄪
B312A
򳄫
B312B
򳄬
B312C
򳄭
B312D
򳄮
B312E
򳄯
B312F
A0
B0
򳄰
B3130
򳄱
B3131
򳄲
B3132
򳄳
B3133
򳄴
B3134
򳄵
B3135
򳄶
B3136
򳄷
B3137
򳄸
B3138
򳄹
B3139
򳄺
B313A
򳄻
B313B
򳄼
B313C
򳄽
B313D
򳄾
B313E
򳄿
B313F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]