International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F289B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉲀
89C80
򉲁
89C81
򉲂
89C82
򉲃
89C83
򉲄
89C84
򉲅
89C85
򉲆
89C86
򉲇
89C87
򉲈
89C88
򉲉
89C89
򉲊
89C8A
򉲋
89C8B
򉲌
89C8C
򉲍
89C8D
򉲎
89C8E
򉲏
89C8F
80
90
򉲐
89C90
򉲑
89C91
򉲒
89C92
򉲓
89C93
򉲔
89C94
򉲕
89C95
򉲖
89C96
򉲗
89C97
򉲘
89C98
򉲙
89C99
򉲚
89C9A
򉲛
89C9B
򉲜
89C9C
򉲝
89C9D
򉲞
89C9E
򉲟
89C9F
90
A0
򉲠
89CA0
򉲡
89CA1
򉲢
89CA2
򉲣
89CA3
򉲤
89CA4
򉲥
89CA5
򉲦
89CA6
򉲧
89CA7
򉲨
89CA8
򉲩
89CA9
򉲪
89CAA
򉲫
89CAB
򉲬
89CAC
򉲭
89CAD
򉲮
89CAE
򉲯
89CAF
A0
B0
򉲰
89CB0
򉲱
89CB1
򉲲
89CB2
򉲳
89CB3
򉲴
89CB4
򉲵
89CB5
򉲶
89CB6
򉲷
89CB7
򉲸
89CB8
򉲹
89CB9
򉲺
89CBA
򉲻
89CBB
򉲼
89CBC
򉲽
89CBD
򉲾
89CBE
򉲿
89CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]