International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B5B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𵲀
35C80
𵲁
35C81
𵲂
35C82
𵲃
35C83
𵲄
35C84
𵲅
35C85
𵲆
35C86
𵲇
35C87
𵲈
35C88
𵲉
35C89
𵲊
35C8A
𵲋
35C8B
𵲌
35C8C
𵲍
35C8D
𵲎
35C8E
𵲏
35C8F
80
90
𵲐
35C90
𵲑
35C91
𵲒
35C92
𵲓
35C93
𵲔
35C94
𵲕
35C95
𵲖
35C96
𵲗
35C97
𵲘
35C98
𵲙
35C99
𵲚
35C9A
𵲛
35C9B
𵲜
35C9C
𵲝
35C9D
𵲞
35C9E
𵲟
35C9F
90
A0
𵲠
35CA0
𵲡
35CA1
𵲢
35CA2
𵲣
35CA3
𵲤
35CA4
𵲥
35CA5
𵲦
35CA6
𵲧
35CA7
𵲨
35CA8
𵲩
35CA9
𵲪
35CAA
𵲫
35CAB
𵲬
35CAC
𵲭
35CAD
𵲮
35CAE
𵲯
35CAF
A0
B0
𵲰
35CB0
𵲱
35CB1
𵲲
35CB2
𵲳
35CB3
𵲴
35CB4
𵲵
35CB5
𵲶
35CB6
𵲷
35CB7
𵲸
35CB8
𵲹
35CB9
𵲺
35CBA
𵲻
35CBB
𵲼
35CBC
𵲽
35CBD
𵲾
35CBE
𵲿
35CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]