International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39080

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐀀
D0000
󐀁
D0001
󐀂
D0002
󐀃
D0003
󐀄
D0004
󐀅
D0005
󐀆
D0006
󐀇
D0007
󐀈
D0008
󐀉
D0009
󐀊
D000A
󐀋
D000B
󐀌
D000C
󐀍
D000D
󐀎
D000E
󐀏
D000F
80
90
󐀐
D0010
󐀑
D0011
󐀒
D0012
󐀓
D0013
󐀔
D0014
󐀕
D0015
󐀖
D0016
󐀗
D0017
󐀘
D0018
󐀙
D0019
󐀚
D001A
󐀛
D001B
󐀜
D001C
󐀝
D001D
󐀞
D001E
󐀟
D001F
90
A0
󐀠
D0020
󐀡
D0021
󐀢
D0022
󐀣
D0023
󐀤
D0024
󐀥
D0025
󐀦
D0026
󐀧
D0027
󐀨
D0028
󐀩
D0029
󐀪
D002A
󐀫
D002B
󐀬
D002C
󐀭
D002D
󐀮
D002E
󐀯
D002F
A0
B0
󐀰
D0030
󐀱
D0031
󐀲
D0032
󐀳
D0033
󐀴
D0034
󐀵
D0035
󐀶
D0036
󐀷
D0037
󐀸
D0038
󐀹
D0039
󐀺
D003A
󐀻
D003B
󐀼
D003C
󐀽
D003D
󐀾
D003E
󐀿
D003F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]