International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B3B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󳲀
F3C80
󳲁
F3C81
󳲂
F3C82
󳲃
F3C83
󳲄
F3C84
󳲅
F3C85
󳲆
F3C86
󳲇
F3C87
󳲈
F3C88
󳲉
F3C89
󳲊
F3C8A
󳲋
F3C8B
󳲌
F3C8C
󳲍
F3C8D
󳲎
F3C8E
󳲏
F3C8F
80
90
󳲐
F3C90
󳲑
F3C91
󳲒
F3C92
󳲓
F3C93
󳲔
F3C94
󳲕
F3C95
󳲖
F3C96
󳲗
F3C97
󳲘
F3C98
󳲙
F3C99
󳲚
F3C9A
󳲛
F3C9B
󳲜
F3C9C
󳲝
F3C9D
󳲞
F3C9E
󳲟
F3C9F
90
A0
󳲠
F3CA0
󳲡
F3CA1
󳲢
F3CA2
󳲣
F3CA3
󳲤
F3CA4
󳲥
F3CA5
󳲦
F3CA6
󳲧
F3CA7
󳲨
F3CA8
󳲩
F3CA9
󳲪
F3CAA
󳲫
F3CAB
󳲬
F3CAC
󳲭
F3CAD
󳲮
F3CAE
󳲯
F3CAF
A0
B0
󳲰
F3CB0
󳲱
F3CB1
󳲲
F3CB2
󳲳
F3CB3
󳲴
F3CB4
󳲵
F3CB5
󳲶
F3CB6
󳲷
F3CB7
󳲸
F3CB8
󳲹
F3CB9
󳲺
F3CBA
󳲻
F3CBB
󳲼
F3CBC
󳲽
F3CBD
󳲾
F3CBE
󳲿
F3CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]