International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F394B3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󔳀
D4CC0
󔳁
D4CC1
󔳂
D4CC2
󔳃
D4CC3
󔳄
D4CC4
󔳅
D4CC5
󔳆
D4CC6
󔳇
D4CC7
󔳈
D4CC8
󔳉
D4CC9
󔳊
D4CCA
󔳋
D4CCB
󔳌
D4CCC
󔳍
D4CCD
󔳎
D4CCE
󔳏
D4CCF
80
90
󔳐
D4CD0
󔳑
D4CD1
󔳒
D4CD2
󔳓
D4CD3
󔳔
D4CD4
󔳕
D4CD5
󔳖
D4CD6
󔳗
D4CD7
󔳘
D4CD8
󔳙
D4CD9
󔳚
D4CDA
󔳛
D4CDB
󔳜
D4CDC
󔳝
D4CDD
󔳞
D4CDE
󔳟
D4CDF
90
A0
󔳠
D4CE0
󔳡
D4CE1
󔳢
D4CE2
󔳣
D4CE3
󔳤
D4CE4
󔳥
D4CE5
󔳦
D4CE6
󔳧
D4CE7
󔳨
D4CE8
󔳩
D4CE9
󔳪
D4CEA
󔳫
D4CEB
󔳬
D4CEC
󔳭
D4CED
󔳮
D4CEE
󔳯
D4CEF
A0
B0
󔳰
D4CF0
󔳱
D4CF1
󔳲
D4CF2
󔳳
D4CF3
󔳴
D4CF4
󔳵
D4CF5
󔳶
D4CF6
󔳷
D4CF7
󔳸
D4CF8
󔳹
D4CF9
󔳺
D4CFA
󔳻
D4CFB
󔳼
D4CFC
󔳽
D4CFD
󔳾
D4CFE
󔳿
D4CFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]