International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F288B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򈲀
88C80
򈲁
88C81
򈲂
88C82
򈲃
88C83
򈲄
88C84
򈲅
88C85
򈲆
88C86
򈲇
88C87
򈲈
88C88
򈲉
88C89
򈲊
88C8A
򈲋
88C8B
򈲌
88C8C
򈲍
88C8D
򈲎
88C8E
򈲏
88C8F
80
90
򈲐
88C90
򈲑
88C91
򈲒
88C92
򈲓
88C93
򈲔
88C94
򈲕
88C95
򈲖
88C96
򈲗
88C97
򈲘
88C98
򈲙
88C99
򈲚
88C9A
򈲛
88C9B
򈲜
88C9C
򈲝
88C9D
򈲞
88C9E
򈲟
88C9F
90
A0
򈲠
88CA0
򈲡
88CA1
򈲢
88CA2
򈲣
88CA3
򈲤
88CA4
򈲥
88CA5
򈲦
88CA6
򈲧
88CA7
򈲨
88CA8
򈲩
88CA9
򈲪
88CAA
򈲫
88CAB
򈲬
88CAC
򈲭
88CAD
򈲮
88CAE
򈲯
88CAF
A0
B0
򈲰
88CB0
򈲱
88CB1
򈲲
88CB2
򈲳
88CB3
򈲴
88CB4
򈲵
88CB5
򈲶
88CB6
򈲷
88CB7
򈲸
88CB8
򈲹
88CB9
򈲺
88CBA
򈲻
88CBB
򈲼
88CBC
򈲽
88CBD
򈲾
88CBE
򈲿
88CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]