International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3958A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󕊀
D5280
󕊁
D5281
󕊂
D5282
󕊃
D5283
󕊄
D5284
󕊅
D5285
󕊆
D5286
󕊇
D5287
󕊈
D5288
󕊉
D5289
󕊊
D528A
󕊋
D528B
󕊌
D528C
󕊍
D528D
󕊎
D528E
󕊏
D528F
80
90
󕊐
D5290
󕊑
D5291
󕊒
D5292
󕊓
D5293
󕊔
D5294
󕊕
D5295
󕊖
D5296
󕊗
D5297
󕊘
D5298
󕊙
D5299
󕊚
D529A
󕊛
D529B
󕊜
D529C
󕊝
D529D
󕊞
D529E
󕊟
D529F
90
A0
󕊠
D52A0
󕊡
D52A1
󕊢
D52A2
󕊣
D52A3
󕊤
D52A4
󕊥
D52A5
󕊦
D52A6
󕊧
D52A7
󕊨
D52A8
󕊩
D52A9
󕊪
D52AA
󕊫
D52AB
󕊬
D52AC
󕊭
D52AD
󕊮
D52AE
󕊯
D52AF
A0
B0
󕊰
D52B0
󕊱
D52B1
󕊲
D52B2
󕊳
D52B3
󕊴
D52B4
󕊵
D52B5
󕊶
D52B6
󕊷
D52B7
󕊸
D52B8
󕊹
D52B9
󕊺
D52BA
󕊻
D52BB
󕊼
D52BC
󕊽
D52BD
󕊾
D52BE
󕊿
D52BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]