International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A092

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󠒀
E0480
󠒁
E0481
󠒂
E0482
󠒃
E0483
󠒄
E0484
󠒅
E0485
󠒆
E0486
󠒇
E0487
󠒈
E0488
󠒉
E0489
󠒊
E048A
󠒋
E048B
󠒌
E048C
󠒍
E048D
󠒎
E048E
󠒏
E048F
80
90
󠒐
E0490
󠒑
E0491
󠒒
E0492
󠒓
E0493
󠒔
E0494
󠒕
E0495
󠒖
E0496
󠒗
E0497
󠒘
E0498
󠒙
E0499
󠒚
E049A
󠒛
E049B
󠒜
E049C
󠒝
E049D
󠒞
E049E
󠒟
E049F
90
A0
󠒠
E04A0
󠒡
E04A1
󠒢
E04A2
󠒣
E04A3
󠒤
E04A4
󠒥
E04A5
󠒦
E04A6
󠒧
E04A7
󠒨
E04A8
󠒩
E04A9
󠒪
E04AA
󠒫
E04AB
󠒬
E04AC
󠒭
E04AD
󠒮
E04AE
󠒯
E04AF
A0
B0
󠒰
E04B0
󠒱
E04B1
󠒲
E04B2
󠒳
E04B3
󠒴
E04B4
󠒵
E04B5
󠒶
E04B6
󠒷
E04B7
󠒸
E04B8
󠒹
E04B9
󠒺
E04BA
󠒻
E04BB
󠒼
E04BC
󠒽
E04BD
󠒾
E04BE
󠒿
E04BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]