International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28CB2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򌲀
8CC80
򌲁
8CC81
򌲂
8CC82
򌲃
8CC83
򌲄
8CC84
򌲅
8CC85
򌲆
8CC86
򌲇
8CC87
򌲈
8CC88
򌲉
8CC89
򌲊
8CC8A
򌲋
8CC8B
򌲌
8CC8C
򌲍
8CC8D
򌲎
8CC8E
򌲏
8CC8F
80
90
򌲐
8CC90
򌲑
8CC91
򌲒
8CC92
򌲓
8CC93
򌲔
8CC94
򌲕
8CC95
򌲖
8CC96
򌲗
8CC97
򌲘
8CC98
򌲙
8CC99
򌲚
8CC9A
򌲛
8CC9B
򌲜
8CC9C
򌲝
8CC9D
򌲞
8CC9E
򌲟
8CC9F
90
A0
򌲠
8CCA0
򌲡
8CCA1
򌲢
8CCA2
򌲣
8CCA3
򌲤
8CCA4
򌲥
8CCA5
򌲦
8CCA6
򌲧
8CCA7
򌲨
8CCA8
򌲩
8CCA9
򌲪
8CCAA
򌲫
8CCAB
򌲬
8CCAC
򌲭
8CCAD
򌲮
8CCAE
򌲯
8CCAF
A0
B0
򌲰
8CCB0
򌲱
8CCB1
򌲲
8CCB2
򌲳
8CCB3
򌲴
8CCB4
򌲵
8CCB5
򌲶
8CCB6
򌲷
8CCB7
򌲸
8CCB8
򌲹
8CCB9
򌲺
8CCBA
򌲻
8CCBB
򌲼
8CCBC
򌲽
8CCBD
򌲾
8CCBE
򌲿
8CCBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]