International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F192B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񒲀
52C80
񒲁
52C81
񒲂
52C82
񒲃
52C83
񒲄
52C84
񒲅
52C85
񒲆
52C86
񒲇
52C87
񒲈
52C88
񒲉
52C89
񒲊
52C8A
񒲋
52C8B
񒲌
52C8C
񒲍
52C8D
񒲎
52C8E
񒲏
52C8F
80
90
񒲐
52C90
񒲑
52C91
񒲒
52C92
񒲓
52C93
񒲔
52C94
񒲕
52C95
񒲖
52C96
񒲗
52C97
񒲘
52C98
񒲙
52C99
񒲚
52C9A
񒲛
52C9B
񒲜
52C9C
񒲝
52C9D
񒲞
52C9E
񒲟
52C9F
90
A0
񒲠
52CA0
񒲡
52CA1
񒲢
52CA2
񒲣
52CA3
񒲤
52CA4
񒲥
52CA5
񒲦
52CA6
񒲧
52CA7
񒲨
52CA8
񒲩
52CA9
񒲪
52CAA
񒲫
52CAB
񒲬
52CAC
񒲭
52CAD
񒲮
52CAE
񒲯
52CAF
A0
B0
񒲰
52CB0
񒲱
52CB1
񒲲
52CB2
񒲳
52CB3
񒲴
52CB4
񒲵
52CB5
񒲶
52CB6
񒲷
52CB7
񒲸
52CB8
񒲹
52CB9
񒲺
52CBA
񒲻
52CBB
񒲼
52CBC
񒲽
52CBD
񒲾
52CBE
񒲿
52CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]