International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A492

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򤒀
A4480
򤒁
A4481
򤒂
A4482
򤒃
A4483
򤒄
A4484
򤒅
A4485
򤒆
A4486
򤒇
A4487
򤒈
A4488
򤒉
A4489
򤒊
A448A
򤒋
A448B
򤒌
A448C
򤒍
A448D
򤒎
A448E
򤒏
A448F
80
90
򤒐
A4490
򤒑
A4491
򤒒
A4492
򤒓
A4493
򤒔
A4494
򤒕
A4495
򤒖
A4496
򤒗
A4497
򤒘
A4498
򤒙
A4499
򤒚
A449A
򤒛
A449B
򤒜
A449C
򤒝
A449D
򤒞
A449E
򤒟
A449F
90
A0
򤒠
A44A0
򤒡
A44A1
򤒢
A44A2
򤒣
A44A3
򤒤
A44A4
򤒥
A44A5
򤒦
A44A6
򤒧
A44A7
򤒨
A44A8
򤒩
A44A9
򤒪
A44AA
򤒫
A44AB
򤒬
A44AC
򤒭
A44AD
򤒮
A44AE
򤒯
A44AF
A0
B0
򤒰
A44B0
򤒱
A44B1
򤒲
A44B2
򤒳
A44B3
򤒴
A44B4
򤒵
A44B5
򤒶
A44B6
򤒷
A44B7
򤒸
A44B8
򤒹
A44B9
򤒺
A44BA
򤒻
A44BB
򤒼
A44BC
򤒽
A44BD
򤒾
A44BE
򤒿
A44BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]