International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F197B2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񗲀
57C80
񗲁
57C81
񗲂
57C82
񗲃
57C83
񗲄
57C84
񗲅
57C85
񗲆
57C86
񗲇
57C87
񗲈
57C88
񗲉
57C89
񗲊
57C8A
񗲋
57C8B
񗲌
57C8C
񗲍
57C8D
񗲎
57C8E
񗲏
57C8F
80
90
񗲐
57C90
񗲑
57C91
񗲒
57C92
񗲓
57C93
񗲔
57C94
񗲕
57C95
񗲖
57C96
񗲗
57C97
񗲘
57C98
񗲙
57C99
񗲚
57C9A
񗲛
57C9B
񗲜
57C9C
񗲝
57C9D
񗲞
57C9E
񗲟
57C9F
90
A0
񗲠
57CA0
񗲡
57CA1
񗲢
57CA2
񗲣
57CA3
񗲤
57CA4
񗲥
57CA5
񗲦
57CA6
񗲧
57CA7
񗲨
57CA8
񗲩
57CA9
񗲪
57CAA
񗲫
57CAB
񗲬
57CAC
񗲭
57CAD
񗲮
57CAE
񗲯
57CAF
A0
B0
񗲰
57CB0
񗲱
57CB1
񗲲
57CB2
񗲳
57CB3
񗲴
57CB4
񗲵
57CB5
񗲶
57CB6
񗲷
57CB7
񗲸
57CB8
񗲹
57CB9
񗲺
57CBA
񗲻
57CBB
񗲼
57CBC
񗲽
57CBD
񗲾
57CBE
񗲿
57CBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]