International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A9B3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󩳀
E9CC0
󩳁
E9CC1
󩳂
E9CC2
󩳃
E9CC3
󩳄
E9CC4
󩳅
E9CC5
󩳆
E9CC6
󩳇
E9CC7
󩳈
E9CC8
󩳉
E9CC9
󩳊
E9CCA
󩳋
E9CCB
󩳌
E9CCC
󩳍
E9CCD
󩳎
E9CCE
󩳏
E9CCF
80
90
󩳐
E9CD0
󩳑
E9CD1
󩳒
E9CD2
󩳓
E9CD3
󩳔
E9CD4
󩳕
E9CD5
󩳖
E9CD6
󩳗
E9CD7
󩳘
E9CD8
󩳙
E9CD9
󩳚
E9CDA
󩳛
E9CDB
󩳜
E9CDC
󩳝
E9CDD
󩳞
E9CDE
󩳟
E9CDF
90
A0
󩳠
E9CE0
󩳡
E9CE1
󩳢
E9CE2
󩳣
E9CE3
󩳤
E9CE4
󩳥
E9CE5
󩳦
E9CE6
󩳧
E9CE7
󩳨
E9CE8
󩳩
E9CE9
󩳪
E9CEA
󩳫
E9CEB
󩳬
E9CEC
󩳭
E9CED
󩳮
E9CEE
󩳯
E9CEF
A0
B0
󩳰
E9CF0
󩳱
E9CF1
󩳲
E9CF2
󩳳
E9CF3
󩳴
E9CF4
󩳵
E9CF5
󩳶
E9CF6
󩳷
E9CF7
󩳸
E9CF8
󩳹
E9CF9
󩳺
E9CFA
󩳻
E9CFB
󩳼
E9CFC
󩳽
E9CFD
󩳾
E9CFE
󩳿
E9CFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]