International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28292

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򂒀
82480
򂒁
82481
򂒂
82482
򂒃
82483
򂒄
82484
򂒅
82485
򂒆
82486
򂒇
82487
򂒈
82488
򂒉
82489
򂒊
8248A
򂒋
8248B
򂒌
8248C
򂒍
8248D
򂒎
8248E
򂒏
8248F
80
90
򂒐
82490
򂒑
82491
򂒒
82492
򂒓
82493
򂒔
82494
򂒕
82495
򂒖
82496
򂒗
82497
򂒘
82498
򂒙
82499
򂒚
8249A
򂒛
8249B
򂒜
8249C
򂒝
8249D
򂒞
8249E
򂒟
8249F
90
A0
򂒠
824A0
򂒡
824A1
򂒢
824A2
򂒣
824A3
򂒤
824A4
򂒥
824A5
򂒦
824A6
򂒧
824A7
򂒨
824A8
򂒩
824A9
򂒪
824AA
򂒫
824AB
򂒬
824AC
򂒭
824AD
򂒮
824AE
򂒯
824AF
A0
B0
򂒰
824B0
򂒱
824B1
򂒲
824B2
򂒳
824B3
򂒴
824B4
򂒵
824B5
򂒶
824B6
򂒷
824B7
򂒸
824B8
򂒹
824B9
򂒺
824BA
򂒻
824BB
򂒼
824BC
򂒽
824BD
򂒾
824BE
򂒿
824BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]