International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2848D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򄍀
84340
򄍁
84341
򄍂
84342
򄍃
84343
򄍄
84344
򄍅
84345
򄍆
84346
򄍇
84347
򄍈
84348
򄍉
84349
򄍊
8434A
򄍋
8434B
򄍌
8434C
򄍍
8434D
򄍎
8434E
򄍏
8434F
80
90
򄍐
84350
򄍑
84351
򄍒
84352
򄍓
84353
򄍔
84354
򄍕
84355
򄍖
84356
򄍗
84357
򄍘
84358
򄍙
84359
򄍚
8435A
򄍛
8435B
򄍜
8435C
򄍝
8435D
򄍞
8435E
򄍟
8435F
90
A0
򄍠
84360
򄍡
84361
򄍢
84362
򄍣
84363
򄍤
84364
򄍥
84365
򄍦
84366
򄍧
84367
򄍨
84368
򄍩
84369
򄍪
8436A
򄍫
8436B
򄍬
8436C
򄍭
8436D
򄍮
8436E
򄍯
8436F
A0
B0
򄍰
84370
򄍱
84371
򄍲
84372
򄍳
84373
򄍴
84374
򄍵
84375
򄍶
84376
򄍷
84377
򄍸
84378
򄍹
84379
򄍺
8437A
򄍻
8437B
򄍼
8437C
򄍽
8437D
򄍾
8437E
򄍿
8437F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]