International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B08C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𰌀
30300
𰌁
30301
𰌂
30302
𰌃
30303
𰌄
30304
𰌅
30305
𰌆
30306
𰌇
30307
𰌈
30308
𰌉
30309
𰌊
3030A
𰌋
3030B
𰌌
3030C
𰌍
3030D
𰌎
3030E
𰌏
3030F
80
90
𰌐
30310
𰌑
30311
𰌒
30312
𰌓
30313
𰌔
30314
𰌕
30315
𰌖
30316
𰌗
30317
𰌘
30318
𰌙
30319
𰌚
3031A
𰌛
3031B
𰌜
3031C
𰌝
3031D
𰌞
3031E
𰌟
3031F
90
A0
𰌠
30320
𰌡
30321
𰌢
30322
𰌣
30323
𰌤
30324
𰌥
30325
𰌦
30326
𰌧
30327
𰌨
30328
𰌩
30329
𰌪
3032A
𰌫
3032B
𰌬
3032C
𰌭
3032D
𰌮
3032E
𰌯
3032F
A0
B0
𰌰
30330
𰌱
30331
𰌲
30332
𰌳
30333
𰌴
30334
𰌵
30335
𰌶
30336
𰌷
30337
𰌸
30338
𰌹
30339
𰌺
3033A
𰌻
3033B
𰌼
3033C
𰌽
3033D
𰌾
3033E
𰌿
3033F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]