International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A0B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򠶀
A0D80
򠶁
A0D81
򠶂
A0D82
򠶃
A0D83
򠶄
A0D84
򠶅
A0D85
򠶆
A0D86
򠶇
A0D87
򠶈
A0D88
򠶉
A0D89
򠶊
A0D8A
򠶋
A0D8B
򠶌
A0D8C
򠶍
A0D8D
򠶎
A0D8E
򠶏
A0D8F
80
90
򠶐
A0D90
򠶑
A0D91
򠶒
A0D92
򠶓
A0D93
򠶔
A0D94
򠶕
A0D95
򠶖
A0D96
򠶗
A0D97
򠶘
A0D98
򠶙
A0D99
򠶚
A0D9A
򠶛
A0D9B
򠶜
A0D9C
򠶝
A0D9D
򠶞
A0D9E
򠶟
A0D9F
90
A0
򠶠
A0DA0
򠶡
A0DA1
򠶢
A0DA2
򠶣
A0DA3
򠶤
A0DA4
򠶥
A0DA5
򠶦
A0DA6
򠶧
A0DA7
򠶨
A0DA8
򠶩
A0DA9
򠶪
A0DAA
򠶫
A0DAB
򠶬
A0DAC
򠶭
A0DAD
򠶮
A0DAE
򠶯
A0DAF
A0
B0
򠶰
A0DB0
򠶱
A0DB1
򠶲
A0DB2
򠶳
A0DB3
򠶴
A0DB4
򠶵
A0DB5
򠶶
A0DB6
򠶷
A0DB7
򠶸
A0DB8
򠶹
A0DB9
򠶺
A0DBA
򠶻
A0DBB
򠶼
A0DBC
򠶽
A0DBD
򠶾
A0DBE
򠶿
A0DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]