International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F297A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򗥀
97940
򗥁
97941
򗥂
97942
򗥃
97943
򗥄
97944
򗥅
97945
򗥆
97946
򗥇
97947
򗥈
97948
򗥉
97949
򗥊
9794A
򗥋
9794B
򗥌
9794C
򗥍
9794D
򗥎
9794E
򗥏
9794F
80
90
򗥐
97950
򗥑
97951
򗥒
97952
򗥓
97953
򗥔
97954
򗥕
97955
򗥖
97956
򗥗
97957
򗥘
97958
򗥙
97959
򗥚
9795A
򗥛
9795B
򗥜
9795C
򗥝
9795D
򗥞
9795E
򗥟
9795F
90
A0
򗥠
97960
򗥡
97961
򗥢
97962
򗥣
97963
򗥤
97964
򗥥
97965
򗥦
97966
򗥧
97967
򗥨
97968
򗥩
97969
򗥪
9796A
򗥫
9796B
򗥬
9796C
򗥭
9796D
򗥮
9796E
򗥯
9796F
A0
B0
򗥰
97970
򗥱
97971
򗥲
97972
򗥳
97973
򗥴
97974
򗥵
97975
򗥶
97976
򗥷
97977
򗥸
97978
򗥹
97979
򗥺
9797A
򗥻
9797B
򗥼
9797C
򗥽
9797D
򗥾
9797E
򗥿
9797F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]