International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38995

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󉕀
C9540
󉕁
C9541
󉕂
C9542
󉕃
C9543
󉕄
C9544
󉕅
C9545
󉕆
C9546
󉕇
C9547
󉕈
C9548
󉕉
C9549
󉕊
C954A
󉕋
C954B
󉕌
C954C
󉕍
C954D
󉕎
C954E
󉕏
C954F
80
90
󉕐
C9550
󉕑
C9551
󉕒
C9552
󉕓
C9553
󉕔
C9554
󉕕
C9555
󉕖
C9556
󉕗
C9557
󉕘
C9558
󉕙
C9559
󉕚
C955A
󉕛
C955B
󉕜
C955C
󉕝
C955D
󉕞
C955E
󉕟
C955F
90
A0
󉕠
C9560
󉕡
C9561
󉕢
C9562
󉕣
C9563
󉕤
C9564
󉕥
C9565
󉕦
C9566
󉕧
C9567
󉕨
C9568
󉕩
C9569
󉕪
C956A
󉕫
C956B
󉕬
C956C
󉕭
C956D
󉕮
C956E
󉕯
C956F
A0
B0
󉕰
C9570
󉕱
C9571
󉕲
C9572
󉕳
C9573
󉕴
C9574
󉕵
C9575
󉕶
C9576
󉕷
C9577
󉕸
C9578
󉕹
C9579
󉕺
C957A
󉕻
C957B
󉕼
C957C
󉕽
C957D
󉕾
C957E
󉕿
C957F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]