International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39099

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐙀
D0640
󐙁
D0641
󐙂
D0642
󐙃
D0643
󐙄
D0644
󐙅
D0645
󐙆
D0646
󐙇
D0647
󐙈
D0648
󐙉
D0649
󐙊
D064A
󐙋
D064B
󐙌
D064C
󐙍
D064D
󐙎
D064E
󐙏
D064F
80
90
󐙐
D0650
󐙑
D0651
󐙒
D0652
󐙓
D0653
󐙔
D0654
󐙕
D0655
󐙖
D0656
󐙗
D0657
󐙘
D0658
󐙙
D0659
󐙚
D065A
󐙛
D065B
󐙜
D065C
󐙝
D065D
󐙞
D065E
󐙟
D065F
90
A0
󐙠
D0660
󐙡
D0661
󐙢
D0662
󐙣
D0663
󐙤
D0664
󐙥
D0665
󐙦
D0666
󐙧
D0667
󐙨
D0668
󐙩
D0669
󐙪
D066A
󐙫
D066B
󐙬
D066C
󐙭
D066D
󐙮
D066E
󐙯
D066F
A0
B0
󐙰
D0670
󐙱
D0671
󐙲
D0672
󐙳
D0673
󐙴
D0674
󐙵
D0675
󐙶
D0676
󐙷
D0677
󐙸
D0678
󐙹
D0679
󐙺
D067A
󐙻
D067B
󐙼
D067C
󐙽
D067D
󐙾
D067E
󐙿
D067F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]