International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18289

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񂉀
42240
񂉁
42241
񂉂
42242
񂉃
42243
񂉄
42244
񂉅
42245
񂉆
42246
񂉇
42247
񂉈
42248
񂉉
42249
񂉊
4224A
񂉋
4224B
񂉌
4224C
񂉍
4224D
񂉎
4224E
񂉏
4224F
80
90
񂉐
42250
񂉑
42251
񂉒
42252
񂉓
42253
񂉔
42254
񂉕
42255
񂉖
42256
񂉗
42257
񂉘
42258
񂉙
42259
񂉚
4225A
񂉛
4225B
񂉜
4225C
񂉝
4225D
񂉞
4225E
񂉟
4225F
90
A0
񂉠
42260
񂉡
42261
񂉢
42262
񂉣
42263
񂉤
42264
񂉥
42265
񂉦
42266
񂉧
42267
񂉨
42268
񂉩
42269
񂉪
4226A
񂉫
4226B
񂉬
4226C
񂉭
4226D
񂉮
4226E
񂉯
4226F
A0
B0
񂉰
42270
񂉱
42271
񂉲
42272
񂉳
42273
񂉴
42274
񂉵
42275
񂉶
42276
񂉷
42277
񂉸
42278
񂉹
42279
񂉺
4227A
񂉻
4227B
񂉼
4227C
񂉽
4227D
񂉾
4227E
񂉿
4227F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]