International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F487B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􇲀
107C80
􇲁
107C81
􇲂
107C82
􇲃
107C83
􇲄
107C84
􇲅
107C85
􇲆
107C86
􇲇
107C87
􇲈
107C88
􇲉
107C89
􇲊
107C8A
􇲋
107C8B
􇲌
107C8C
􇲍
107C8D
􇲎
107C8E
􇲏
107C8F
80
90
􇲐
107C90
􇲑
107C91
􇲒
107C92
􇲓
107C93
􇲔
107C94
􇲕
107C95
􇲖
107C96
􇲗
107C97
􇲘
107C98
􇲙
107C99
􇲚
107C9A
􇲛
107C9B
􇲜
107C9C
􇲝
107C9D
􇲞
107C9E
􇲟
107C9F
90
A0
􇲠
107CA0
􇲡
107CA1
􇲢
107CA2
􇲣
107CA3
􇲤
107CA4
􇲥
107CA5
􇲦
107CA6
􇲧
107CA7
􇲨
107CA8
􇲩
107CA9
􇲪
107CAA
􇲫
107CAB
􇲬
107CAC
􇲭
107CAD
􇲮
107CAE
􇲯
107CAF
A0
B0
􇲰
107CB0
􇲱
107CB1
􇲲
107CB2
􇲳
107CB3
􇲴
107CB4
􇲵
107CB5
􇲶
107CB6
􇲷
107CB7
􇲸
107CB8
􇲹
107CB9
􇲺
107CBA
􇲻
107CBB
􇲼
107CBC
􇲽
107CBD
􇲾
107CBE
􇲿
107CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]