International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3878C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󇌀
C7300
󇌁
C7301
󇌂
C7302
󇌃
C7303
󇌄
C7304
󇌅
C7305
󇌆
C7306
󇌇
C7307
󇌈
C7308
󇌉
C7309
󇌊
C730A
󇌋
C730B
󇌌
C730C
󇌍
C730D
󇌎
C730E
󇌏
C730F
80
90
󇌐
C7310
󇌑
C7311
󇌒
C7312
󇌓
C7313
󇌔
C7314
󇌕
C7315
󇌖
C7316
󇌗
C7317
󇌘
C7318
󇌙
C7319
󇌚
C731A
󇌛
C731B
󇌜
C731C
󇌝
C731D
󇌞
C731E
󇌟
C731F
90
A0
󇌠
C7320
󇌡
C7321
󇌢
C7322
󇌣
C7323
󇌤
C7324
󇌥
C7325
󇌦
C7326
󇌧
C7327
󇌨
C7328
󇌩
C7329
󇌪
C732A
󇌫
C732B
󇌬
C732C
󇌭
C732D
󇌮
C732E
󇌯
C732F
A0
B0
󇌰
C7330
󇌱
C7331
󇌲
C7332
󇌳
C7333
󇌴
C7334
󇌵
C7335
󇌶
C7336
󇌷
C7337
󇌸
C7338
󇌹
C7339
󇌺
C733A
󇌻
C733B
󇌼
C733C
󇌽
C733D
󇌾
C733E
󇌿
C733F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]