International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A780

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򧀀
A7000
򧀁
A7001
򧀂
A7002
򧀃
A7003
򧀄
A7004
򧀅
A7005
򧀆
A7006
򧀇
A7007
򧀈
A7008
򧀉
A7009
򧀊
A700A
򧀋
A700B
򧀌
A700C
򧀍
A700D
򧀎
A700E
򧀏
A700F
80
90
򧀐
A7010
򧀑
A7011
򧀒
A7012
򧀓
A7013
򧀔
A7014
򧀕
A7015
򧀖
A7016
򧀗
A7017
򧀘
A7018
򧀙
A7019
򧀚
A701A
򧀛
A701B
򧀜
A701C
򧀝
A701D
򧀞
A701E
򧀟
A701F
90
A0
򧀠
A7020
򧀡
A7021
򧀢
A7022
򧀣
A7023
򧀤
A7024
򧀥
A7025
򧀦
A7026
򧀧
A7027
򧀨
A7028
򧀩
A7029
򧀪
A702A
򧀫
A702B
򧀬
A702C
򧀭
A702D
򧀮
A702E
򧀯
A702F
A0
B0
򧀰
A7030
򧀱
A7031
򧀲
A7032
򧀳
A7033
򧀴
A7034
򧀵
A7035
򧀶
A7036
򧀷
A7037
򧀸
A7038
򧀹
A7039
򧀺
A703A
򧀻
A703B
򧀼
A703C
򧀽
A703D
򧀾
A703E
򧀿
A703F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]