International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38099

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀙀
C0640
󀙁
C0641
󀙂
C0642
󀙃
C0643
󀙄
C0644
󀙅
C0645
󀙆
C0646
󀙇
C0647
󀙈
C0648
󀙉
C0649
󀙊
C064A
󀙋
C064B
󀙌
C064C
󀙍
C064D
󀙎
C064E
󀙏
C064F
80
90
󀙐
C0650
󀙑
C0651
󀙒
C0652
󀙓
C0653
󀙔
C0654
󀙕
C0655
󀙖
C0656
󀙗
C0657
󀙘
C0658
󀙙
C0659
󀙚
C065A
󀙛
C065B
󀙜
C065C
󀙝
C065D
󀙞
C065E
󀙟
C065F
90
A0
󀙠
C0660
󀙡
C0661
󀙢
C0662
󀙣
C0663
󀙤
C0664
󀙥
C0665
󀙦
C0666
󀙧
C0667
󀙨
C0668
󀙩
C0669
󀙪
C066A
󀙫
C066B
󀙬
C066C
󀙭
C066D
󀙮
C066E
󀙯
C066F
A0
B0
󀙰
C0670
󀙱
C0671
󀙲
C0672
󀙳
C0673
󀙴
C0674
󀙵
C0675
󀙶
C0676
󀙷
C0677
󀙸
C0678
󀙹
C0679
󀙺
C067A
󀙻
C067B
󀙼
C067C
󀙽
C067D
󀙾
C067E
󀙿
C067F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]