International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F39680

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󖀀
D6000
󖀁
D6001
󖀂
D6002
󖀃
D6003
󖀄
D6004
󖀅
D6005
󖀆
D6006
󖀇
D6007
󖀈
D6008
󖀉
D6009
󖀊
D600A
󖀋
D600B
󖀌
D600C
󖀍
D600D
󖀎
D600E
󖀏
D600F
80
90
󖀐
D6010
󖀑
D6011
󖀒
D6012
󖀓
D6013
󖀔
D6014
󖀕
D6015
󖀖
D6016
󖀗
D6017
󖀘
D6018
󖀙
D6019
󖀚
D601A
󖀛
D601B
󖀜
D601C
󖀝
D601D
󖀞
D601E
󖀟
D601F
90
A0
󖀠
D6020
󖀡
D6021
󖀢
D6022
󖀣
D6023
󖀤
D6024
󖀥
D6025
󖀦
D6026
󖀧
D6027
󖀨
D6028
󖀩
D6029
󖀪
D602A
󖀫
D602B
󖀬
D602C
󖀭
D602D
󖀮
D602E
󖀯
D602F
A0
B0
󖀰
D6030
󖀱
D6031
󖀲
D6032
󖀳
D6033
󖀴
D6034
󖀵
D6035
󖀶
D6036
󖀷
D6037
󖀸
D6038
󖀹
D6039
󖀺
D603A
󖀻
D603B
󖀼
D603C
󖀽
D603D
󖀾
D603E
󖀿
D603F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]