International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F288B3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򈳀
88CC0
򈳁
88CC1
򈳂
88CC2
򈳃
88CC3
򈳄
88CC4
򈳅
88CC5
򈳆
88CC6
򈳇
88CC7
򈳈
88CC8
򈳉
88CC9
򈳊
88CCA
򈳋
88CCB
򈳌
88CCC
򈳍
88CCD
򈳎
88CCE
򈳏
88CCF
80
90
򈳐
88CD0
򈳑
88CD1
򈳒
88CD2
򈳓
88CD3
򈳔
88CD4
򈳕
88CD5
򈳖
88CD6
򈳗
88CD7
򈳘
88CD8
򈳙
88CD9
򈳚
88CDA
򈳛
88CDB
򈳜
88CDC
򈳝
88CDD
򈳞
88CDE
򈳟
88CDF
90
A0
򈳠
88CE0
򈳡
88CE1
򈳢
88CE2
򈳣
88CE3
򈳤
88CE4
򈳥
88CE5
򈳦
88CE6
򈳧
88CE7
򈳨
88CE8
򈳩
88CE9
򈳪
88CEA
򈳫
88CEB
򈳬
88CEC
򈳭
88CED
򈳮
88CEE
򈳯
88CEF
A0
B0
򈳰
88CF0
򈳱
88CF1
򈳲
88CF2
򈳳
88CF3
򈳴
88CF4
򈳵
88CF5
򈳶
88CF6
򈳷
88CF7
򈳸
88CF8
򈳹
88CF9
򈳺
88CFA
򈳻
88CFB
򈳼
88CFC
򈳽
88CFD
򈳾
88CFE
򈳿
88CFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]