International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B38C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򳌀
B3300
򳌁
B3301
򳌂
B3302
򳌃
B3303
򳌄
B3304
򳌅
B3305
򳌆
B3306
򳌇
B3307
򳌈
B3308
򳌉
B3309
򳌊
B330A
򳌋
B330B
򳌌
B330C
򳌍
B330D
򳌎
B330E
򳌏
B330F
80
90
򳌐
B3310
򳌑
B3311
򳌒
B3312
򳌓
B3313
򳌔
B3314
򳌕
B3315
򳌖
B3316
򳌗
B3317
򳌘
B3318
򳌙
B3319
򳌚
B331A
򳌛
B331B
򳌜
B331C
򳌝
B331D
򳌞
B331E
򳌟
B331F
90
A0
򳌠
B3320
򳌡
B3321
򳌢
B3322
򳌣
B3323
򳌤
B3324
򳌥
B3325
򳌦
B3326
򳌧
B3327
򳌨
B3328
򳌩
B3329
򳌪
B332A
򳌫
B332B
򳌬
B332C
򳌭
B332D
򳌮
B332E
򳌯
B332F
A0
B0
򳌰
B3330
򳌱
B3331
򳌲
B3332
򳌳
B3333
򳌴
B3334
򳌵
B3335
򳌶
B3336
򳌷
B3337
򳌸
B3338
򳌹
B3339
򳌺
B333A
򳌻
B333B
򳌼
B333C
򳌽
B333D
򳌾
B333E
򳌿
B333F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]