International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F38395

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃕀
C3540
󃕁
C3541
󃕂
C3542
󃕃
C3543
󃕄
C3544
󃕅
C3545
󃕆
C3546
󃕇
C3547
󃕈
C3548
󃕉
C3549
󃕊
C354A
󃕋
C354B
󃕌
C354C
󃕍
C354D
󃕎
C354E
󃕏
C354F
80
90
󃕐
C3550
󃕑
C3551
󃕒
C3552
󃕓
C3553
󃕔
C3554
󃕕
C3555
󃕖
C3556
󃕗
C3557
󃕘
C3558
󃕙
C3559
󃕚
C355A
󃕛
C355B
󃕜
C355C
󃕝
C355D
󃕞
C355E
󃕟
C355F
90
A0
󃕠
C3560
󃕡
C3561
󃕢
C3562
󃕣
C3563
󃕤
C3564
󃕥
C3565
󃕦
C3566
󃕧
C3567
󃕨
C3568
󃕩
C3569
󃕪
C356A
󃕫
C356B
󃕬
C356C
󃕭
C356D
󃕮
C356E
󃕯
C356F
A0
B0
󃕰
C3570
󃕱
C3571
󃕲
C3572
󃕳
C3573
󃕴
C3574
󃕵
C3575
󃕶
C3576
󃕷
C3577
󃕸
C3578
󃕹
C3579
󃕺
C357A
󃕻
C357B
󃕼
C357C
󃕽
C357D
󃕾
C357E
󃕿
C357F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]