International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A0B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󠲀
E0C80
󠲁
E0C81
󠲂
E0C82
󠲃
E0C83
󠲄
E0C84
󠲅
E0C85
󠲆
E0C86
󠲇
E0C87
󠲈
E0C88
󠲉
E0C89
󠲊
E0C8A
󠲋
E0C8B
󠲌
E0C8C
󠲍
E0C8D
󠲎
E0C8E
󠲏
E0C8F
80
90
󠲐
E0C90
󠲑
E0C91
󠲒
E0C92
󠲓
E0C93
󠲔
E0C94
󠲕
E0C95
󠲖
E0C96
󠲗
E0C97
󠲘
E0C98
󠲙
E0C99
󠲚
E0C9A
󠲛
E0C9B
󠲜
E0C9C
󠲝
E0C9D
󠲞
E0C9E
󠲟
E0C9F
90
A0
󠲠
E0CA0
󠲡
E0CA1
󠲢
E0CA2
󠲣
E0CA3
󠲤
E0CA4
󠲥
E0CA5
󠲦
E0CA6
󠲧
E0CA7
󠲨
E0CA8
󠲩
E0CA9
󠲪
E0CAA
󠲫
E0CAB
󠲬
E0CAC
󠲭
E0CAD
󠲮
E0CAE
󠲯
E0CAF
A0
B0
󠲰
E0CB0
󠲱
E0CB1
󠲲
E0CB2
󠲳
E0CB3
󠲴
E0CB4
󠲵
E0CB5
󠲶
E0CB6
󠲷
E0CB7
󠲸
E0CB8
󠲹
E0CB9
󠲺
E0CBA
󠲻
E0CBB
󠲼
E0CBC
󠲽
E0CBD
󠲾
E0CBE
󠲿
E0CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]