International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0A0B3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𠳀
20CC0
𠳁
20CC1
𠳂
20CC2
𠳃
20CC3
𠳄
20CC4
𠳅
20CC5
𠳆
20CC6
𠳇
20CC7
𠳈
20CC8
𠳉
20CC9
𠳊
20CCA
𠳋
20CCB
𠳌
20CCC
𠳍
20CCD
𠳎
20CCE
𠳏
20CCF
80
90
𠳐
20CD0
𠳑
20CD1
𠳒
20CD2
𠳓
20CD3
𠳔
20CD4
𠳕
20CD5
𠳖
20CD6
𠳗
20CD7
𠳘
20CD8
𠳙
20CD9
𠳚
20CDA
𠳛
20CDB
𠳜
20CDC
𠳝
20CDD
𠳞
20CDE
𠳟
20CDF
90
A0
𠳠
20CE0
𠳡
20CE1
𠳢
20CE2
𠳣
20CE3
𠳤
20CE4
𠳥
20CE5
𠳦
20CE6
𠳧
20CE7
𠳨
20CE8
𠳩
20CE9
𠳪
20CEA
𠳫
20CEB
𠳬
20CEC
𠳭
20CED
𠳮
20CEE
𠳯
20CEF
A0
B0
𠳰
20CF0
𠳱
20CF1
𠳲
20CF2
𠳳
20CF3
𠳴
20CF4
𠳵
20CF5
𠳶
20CF6
𠳷
20CF7
𠳸
20CF8
𠳹
20CF9
𠳺
20CFA
𠳻
20CFB
𠳼
20CFC
𠳽
20CFD
𠳾
20CFE
𠳿
20CFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]