International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F293B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򓲀
93C80
򓲁
93C81
򓲂
93C82
򓲃
93C83
򓲄
93C84
򓲅
93C85
򓲆
93C86
򓲇
93C87
򓲈
93C88
򓲉
93C89
򓲊
93C8A
򓲋
93C8B
򓲌
93C8C
򓲍
93C8D
򓲎
93C8E
򓲏
93C8F
80
90
򓲐
93C90
򓲑
93C91
򓲒
93C92
򓲓
93C93
򓲔
93C94
򓲕
93C95
򓲖
93C96
򓲗
93C97
򓲘
93C98
򓲙
93C99
򓲚
93C9A
򓲛
93C9B
򓲜
93C9C
򓲝
93C9D
򓲞
93C9E
򓲟
93C9F
90
A0
򓲠
93CA0
򓲡
93CA1
򓲢
93CA2
򓲣
93CA3
򓲤
93CA4
򓲥
93CA5
򓲦
93CA6
򓲧
93CA7
򓲨
93CA8
򓲩
93CA9
򓲪
93CAA
򓲫
93CAB
򓲬
93CAC
򓲭
93CAD
򓲮
93CAE
򓲯
93CAF
A0
B0
򓲰
93CB0
򓲱
93CB1
򓲲
93CB2
򓲳
93CB3
򓲴
93CB4
򓲵
93CB5
򓲶
93CB6
򓲷
93CB7
򓲸
93CB8
򓲹
93CB9
򓲺
93CBA
򓲻
93CBB
򓲼
93CBC
򓲽
93CBD
򓲾
93CBE
򓲿
93CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]