International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F384B3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄳀
C4CC0
󄳁
C4CC1
󄳂
C4CC2
󄳃
C4CC3
󄳄
C4CC4
󄳅
C4CC5
󄳆
C4CC6
󄳇
C4CC7
󄳈
C4CC8
󄳉
C4CC9
󄳊
C4CCA
󄳋
C4CCB
󄳌
C4CCC
󄳍
C4CCD
󄳎
C4CCE
󄳏
C4CCF
80
90
󄳐
C4CD0
󄳑
C4CD1
󄳒
C4CD2
󄳓
C4CD3
󄳔
C4CD4
󄳕
C4CD5
󄳖
C4CD6
󄳗
C4CD7
󄳘
C4CD8
󄳙
C4CD9
󄳚
C4CDA
󄳛
C4CDB
󄳜
C4CDC
󄳝
C4CDD
󄳞
C4CDE
󄳟
C4CDF
90
A0
󄳠
C4CE0
󄳡
C4CE1
󄳢
C4CE2
󄳣
C4CE3
󄳤
C4CE4
󄳥
C4CE5
󄳦
C4CE6
󄳧
C4CE7
󄳨
C4CE8
󄳩
C4CE9
󄳪
C4CEA
󄳫
C4CEB
󄳬
C4CEC
󄳭
C4CED
󄳮
C4CEE
󄳯
C4CEF
A0
B0
󄳰
C4CF0
󄳱
C4CF1
󄳲
C4CF2
󄳳
C4CF3
󄳴
C4CF4
󄳵
C4CF5
󄳶
C4CF6
󄳷
C4CF7
󄳸
C4CF8
󄳹
C4CF9
󄳺
C4CFA
󄳻
C4CFB
󄳼
C4CFC
󄳽
C4CFD
󄳾
C4CFE
󄳿
C4CFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]