International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1B195

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񱕀
71540
񱕁
71541
񱕂
71542
񱕃
71543
񱕄
71544
񱕅
71545
񱕆
71546
񱕇
71547
񱕈
71548
񱕉
71549
񱕊
7154A
񱕋
7154B
񱕌
7154C
񱕍
7154D
񱕎
7154E
񱕏
7154F
80
90
񱕐
71550
񱕑
71551
񱕒
71552
񱕓
71553
񱕔
71554
񱕕
71555
񱕖
71556
񱕗
71557
񱕘
71558
񱕙
71559
񱕚
7155A
񱕛
7155B
񱕜
7155C
񱕝
7155D
񱕞
7155E
񱕟
7155F
90
A0
񱕠
71560
񱕡
71561
񱕢
71562
񱕣
71563
񱕤
71564
񱕥
71565
񱕦
71566
񱕧
71567
񱕨
71568
񱕩
71569
񱕪
7156A
񱕫
7156B
񱕬
7156C
񱕭
7156D
񱕮
7156E
񱕯
7156F
A0
B0
񱕰
71570
񱕱
71571
񱕲
71572
񱕳
71573
񱕴
71574
񱕵
71575
񱕶
71576
񱕷
71577
񱕸
71578
񱕹
71579
񱕺
7157A
񱕻
7157B
񱕼
7157C
񱕽
7157D
񱕾
7157E
񱕿
7157F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]