International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38380

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃀀
C3000
󃀁
C3001
󃀂
C3002
󃀃
C3003
󃀄
C3004
󃀅
C3005
󃀆
C3006
󃀇
C3007
󃀈
C3008
󃀉
C3009
󃀊
C300A
󃀋
C300B
󃀌
C300C
󃀍
C300D
󃀎
C300E
󃀏
C300F
80
90
󃀐
C3010
󃀑
C3011
󃀒
C3012
󃀓
C3013
󃀔
C3014
󃀕
C3015
󃀖
C3016
󃀗
C3017
󃀘
C3018
󃀙
C3019
󃀚
C301A
󃀛
C301B
󃀜
C301C
󃀝
C301D
󃀞
C301E
󃀟
C301F
90
A0
󃀠
C3020
󃀡
C3021
󃀢
C3022
󃀣
C3023
󃀤
C3024
󃀥
C3025
󃀦
C3026
󃀧
C3027
󃀨
C3028
󃀩
C3029
󃀪
C302A
󃀫
C302B
󃀬
C302C
󃀭
C302D
󃀮
C302E
󃀯
C302F
A0
B0
󃀰
C3030
󃀱
C3031
󃀲
C3032
󃀳
C3033
󃀴
C3034
󃀵
C3035
󃀶
C3036
󃀷
C3037
󃀸
C3038
󃀹
C3039
󃀺
C303A
󃀻
C303B
󃀼
C303C
󃀽
C303D
󃀾
C303E
󃀿
C303F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]