International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B589

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𵉀
35240
𵉁
35241
𵉂
35242
𵉃
35243
𵉄
35244
𵉅
35245
𵉆
35246
𵉇
35247
𵉈
35248
𵉉
35249
𵉊
3524A
𵉋
3524B
𵉌
3524C
𵉍
3524D
𵉎
3524E
𵉏
3524F
80
90
𵉐
35250
𵉑
35251
𵉒
35252
𵉓
35253
𵉔
35254
𵉕
35255
𵉖
35256
𵉗
35257
𵉘
35258
𵉙
35259
𵉚
3525A
𵉛
3525B
𵉜
3525C
𵉝
3525D
𵉞
3525E
𵉟
3525F
90
A0
𵉠
35260
𵉡
35261
𵉢
35262
𵉣
35263
𵉤
35264
𵉥
35265
𵉦
35266
𵉧
35267
𵉨
35268
𵉩
35269
𵉪
3526A
𵉫
3526B
𵉬
3526C
𵉭
3526D
𵉮
3526E
𵉯
3526F
A0
B0
𵉰
35270
𵉱
35271
𵉲
35272
𵉳
35273
𵉴
35274
𵉵
35275
𵉶
35276
𵉷
35277
𵉸
35278
𵉹
35279
𵉺
3527A
𵉻
3527B
𵉼
3527C
𵉽
3527D
𵉾
3527E
𵉿
3527F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]