International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F393B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󓲀
D3C80
󓲁
D3C81
󓲂
D3C82
󓲃
D3C83
󓲄
D3C84
󓲅
D3C85
󓲆
D3C86
󓲇
D3C87
󓲈
D3C88
󓲉
D3C89
󓲊
D3C8A
󓲋
D3C8B
󓲌
D3C8C
󓲍
D3C8D
󓲎
D3C8E
󓲏
D3C8F
80
90
󓲐
D3C90
󓲑
D3C91
󓲒
D3C92
󓲓
D3C93
󓲔
D3C94
󓲕
D3C95
󓲖
D3C96
󓲗
D3C97
󓲘
D3C98
󓲙
D3C99
󓲚
D3C9A
󓲛
D3C9B
󓲜
D3C9C
󓲝
D3C9D
󓲞
D3C9E
󓲟
D3C9F
90
A0
󓲠
D3CA0
󓲡
D3CA1
󓲢
D3CA2
󓲣
D3CA3
󓲤
D3CA4
󓲥
D3CA5
󓲦
D3CA6
󓲧
D3CA7
󓲨
D3CA8
󓲩
D3CA9
󓲪
D3CAA
󓲫
D3CAB
󓲬
D3CAC
󓲭
D3CAD
󓲮
D3CAE
󓲯
D3CAF
A0
B0
󓲰
D3CB0
󓲱
D3CB1
󓲲
D3CB2
󓲳
D3CB3
󓲴
D3CB4
󓲵
D3CB5
󓲶
D3CB6
󓲷
D3CB7
󓲸
D3CB8
󓲹
D3CB9
󓲺
D3CBA
󓲻
D3CBB
󓲼
D3CBC
󓲽
D3CBD
󓲾
D3CBE
󓲿
D3CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]