International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F291B3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򑳀
91CC0
򑳁
91CC1
򑳂
91CC2
򑳃
91CC3
򑳄
91CC4
򑳅
91CC5
򑳆
91CC6
򑳇
91CC7
򑳈
91CC8
򑳉
91CC9
򑳊
91CCA
򑳋
91CCB
򑳌
91CCC
򑳍
91CCD
򑳎
91CCE
򑳏
91CCF
80
90
򑳐
91CD0
򑳑
91CD1
򑳒
91CD2
򑳓
91CD3
򑳔
91CD4
򑳕
91CD5
򑳖
91CD6
򑳗
91CD7
򑳘
91CD8
򑳙
91CD9
򑳚
91CDA
򑳛
91CDB
򑳜
91CDC
򑳝
91CDD
򑳞
91CDE
򑳟
91CDF
90
A0
򑳠
91CE0
򑳡
91CE1
򑳢
91CE2
򑳣
91CE3
򑳤
91CE4
򑳥
91CE5
򑳦
91CE6
򑳧
91CE7
򑳨
91CE8
򑳩
91CE9
򑳪
91CEA
򑳫
91CEB
򑳬
91CEC
򑳭
91CED
򑳮
91CEE
򑳯
91CEF
A0
B0
򑳰
91CF0
򑳱
91CF1
򑳲
91CF2
򑳳
91CF3
򑳴
91CF4
򑳵
91CF5
򑳶
91CF6
򑳷
91CF7
򑳸
91CF8
򑳹
91CF9
򑳺
91CFA
򑳻
91CFB
򑳼
91CFC
򑳽
91CFD
򑳾
91CFE
򑳿
91CFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]