International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F299B6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򙶀
99D80
򙶁
99D81
򙶂
99D82
򙶃
99D83
򙶄
99D84
򙶅
99D85
򙶆
99D86
򙶇
99D87
򙶈
99D88
򙶉
99D89
򙶊
99D8A
򙶋
99D8B
򙶌
99D8C
򙶍
99D8D
򙶎
99D8E
򙶏
99D8F
80
90
򙶐
99D90
򙶑
99D91
򙶒
99D92
򙶓
99D93
򙶔
99D94
򙶕
99D95
򙶖
99D96
򙶗
99D97
򙶘
99D98
򙶙
99D99
򙶚
99D9A
򙶛
99D9B
򙶜
99D9C
򙶝
99D9D
򙶞
99D9E
򙶟
99D9F
90
A0
򙶠
99DA0
򙶡
99DA1
򙶢
99DA2
򙶣
99DA3
򙶤
99DA4
򙶥
99DA5
򙶦
99DA6
򙶧
99DA7
򙶨
99DA8
򙶩
99DA9
򙶪
99DAA
򙶫
99DAB
򙶬
99DAC
򙶭
99DAD
򙶮
99DAE
򙶯
99DAF
A0
B0
򙶰
99DB0
򙶱
99DB1
򙶲
99DB2
򙶳
99DB3
򙶴
99DB4
򙶵
99DB5
򙶶
99DB6
򙶷
99DB7
򙶸
99DB8
򙶹
99DB9
򙶺
99DBA
򙶻
99DBB
򙶼
99DBC
򙶽
99DBD
򙶾
99DBE
򙶿
99DBF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]