International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B995

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󹕀
F9540
󹕁
F9541
󹕂
F9542
󹕃
F9543
󹕄
F9544
󹕅
F9545
󹕆
F9546
󹕇
F9547
󹕈
F9548
󹕉
F9549
󹕊
F954A
󹕋
F954B
󹕌
F954C
󹕍
F954D
󹕎
F954E
󹕏
F954F
80
90
󹕐
F9550
󹕑
F9551
󹕒
F9552
󹕓
F9553
󹕔
F9554
󹕕
F9555
󹕖
F9556
󹕗
F9557
󹕘
F9558
󹕙
F9559
󹕚
F955A
󹕛
F955B
󹕜
F955C
󹕝
F955D
󹕞
F955E
󹕟
F955F
90
A0
󹕠
F9560
󹕡
F9561
󹕢
F9562
󹕣
F9563
󹕤
F9564
󹕥
F9565
󹕦
F9566
󹕧
F9567
󹕨
F9568
󹕩
F9569
󹕪
F956A
󹕫
F956B
󹕬
F956C
󹕭
F956D
󹕮
F956E
󹕯
F956F
A0
B0
󹕰
F9570
󹕱
F9571
󹕲
F9572
󹕳
F9573
󹕴
F9574
󹕵
F9575
󹕶
F9576
󹕷
F9577
󹕸
F9578
󹕹
F9579
󹕺
F957A
󹕻
F957B
󹕼
F957C
󹕽
F957D
󹕾
F957E
󹕿
F957F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]