International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F394B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󔲀
D4C80
󔲁
D4C81
󔲂
D4C82
󔲃
D4C83
󔲄
D4C84
󔲅
D4C85
󔲆
D4C86
󔲇
D4C87
󔲈
D4C88
󔲉
D4C89
󔲊
D4C8A
󔲋
D4C8B
󔲌
D4C8C
󔲍
D4C8D
󔲎
D4C8E
󔲏
D4C8F
80
90
󔲐
D4C90
󔲑
D4C91
󔲒
D4C92
󔲓
D4C93
󔲔
D4C94
󔲕
D4C95
󔲖
D4C96
󔲗
D4C97
󔲘
D4C98
󔲙
D4C99
󔲚
D4C9A
󔲛
D4C9B
󔲜
D4C9C
󔲝
D4C9D
󔲞
D4C9E
󔲟
D4C9F
90
A0
󔲠
D4CA0
󔲡
D4CA1
󔲢
D4CA2
󔲣
D4CA3
󔲤
D4CA4
󔲥
D4CA5
󔲦
D4CA6
󔲧
D4CA7
󔲨
D4CA8
󔲩
D4CA9
󔲪
D4CAA
󔲫
D4CAB
󔲬
D4CAC
󔲭
D4CAD
󔲮
D4CAE
󔲯
D4CAF
A0
B0
󔲰
D4CB0
󔲱
D4CB1
󔲲
D4CB2
󔲳
D4CB3
󔲴
D4CB4
󔲵
D4CB5
󔲶
D4CB6
󔲷
D4CB7
󔲸
D4CB8
󔲹
D4CB9
󔲺
D4CBA
󔲻
D4CBB
󔲼
D4CBC
󔲽
D4CBD
󔲾
D4CBE
󔲿
D4CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]