International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F298B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򘲀
98C80
򘲁
98C81
򘲂
98C82
򘲃
98C83
򘲄
98C84
򘲅
98C85
򘲆
98C86
򘲇
98C87
򘲈
98C88
򘲉
98C89
򘲊
98C8A
򘲋
98C8B
򘲌
98C8C
򘲍
98C8D
򘲎
98C8E
򘲏
98C8F
80
90
򘲐
98C90
򘲑
98C91
򘲒
98C92
򘲓
98C93
򘲔
98C94
򘲕
98C95
򘲖
98C96
򘲗
98C97
򘲘
98C98
򘲙
98C99
򘲚
98C9A
򘲛
98C9B
򘲜
98C9C
򘲝
98C9D
򘲞
98C9E
򘲟
98C9F
90
A0
򘲠
98CA0
򘲡
98CA1
򘲢
98CA2
򘲣
98CA3
򘲤
98CA4
򘲥
98CA5
򘲦
98CA6
򘲧
98CA7
򘲨
98CA8
򘲩
98CA9
򘲪
98CAA
򘲫
98CAB
򘲬
98CAC
򘲭
98CAD
򘲮
98CAE
򘲯
98CAF
A0
B0
򘲰
98CB0
򘲱
98CB1
򘲲
98CB2
򘲳
98CB3
򘲴
98CB4
򘲵
98CB5
򘲶
98CB6
򘲷
98CB7
򘲸
98CB8
򘲹
98CB9
򘲺
98CBA
򘲻
98CBB
򘲼
98CBC
򘲽
98CBD
򘲾
98CBE
򘲿
98CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]