International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38780

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󇀀
C7000
󇀁
C7001
󇀂
C7002
󇀃
C7003
󇀄
C7004
󇀅
C7005
󇀆
C7006
󇀇
C7007
󇀈
C7008
󇀉
C7009
󇀊
C700A
󇀋
C700B
󇀌
C700C
󇀍
C700D
󇀎
C700E
󇀏
C700F
80
90
󇀐
C7010
󇀑
C7011
󇀒
C7012
󇀓
C7013
󇀔
C7014
󇀕
C7015
󇀖
C7016
󇀗
C7017
󇀘
C7018
󇀙
C7019
󇀚
C701A
󇀛
C701B
󇀜
C701C
󇀝
C701D
󇀞
C701E
󇀟
C701F
90
A0
󇀠
C7020
󇀡
C7021
󇀢
C7022
󇀣
C7023
󇀤
C7024
󇀥
C7025
󇀦
C7026
󇀧
C7027
󇀨
C7028
󇀩
C7029
󇀪
C702A
󇀫
C702B
󇀬
C702C
󇀭
C702D
󇀮
C702E
󇀯
C702F
A0
B0
󇀰
C7030
󇀱
C7031
󇀲
C7032
󇀳
C7033
󇀴
C7034
󇀵
C7035
󇀶
C7036
󇀷
C7037
󇀸
C7038
󇀹
C7039
󇀺
C703A
󇀻
C703B
󇀼
C703C
󇀽
C703D
󇀾
C703E
󇀿
C703F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]