International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AD8A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󭊀
ED280
󭊁
ED281
󭊂
ED282
󭊃
ED283
󭊄
ED284
󭊅
ED285
󭊆
ED286
󭊇
ED287
󭊈
ED288
󭊉
ED289
󭊊
ED28A
󭊋
ED28B
󭊌
ED28C
󭊍
ED28D
󭊎
ED28E
󭊏
ED28F
80
90
󭊐
ED290
󭊑
ED291
󭊒
ED292
󭊓
ED293
󭊔
ED294
󭊕
ED295
󭊖
ED296
󭊗
ED297
󭊘
ED298
󭊙
ED299
󭊚
ED29A
󭊛
ED29B
󭊜
ED29C
󭊝
ED29D
󭊞
ED29E
󭊟
ED29F
90
A0
󭊠
ED2A0
󭊡
ED2A1
󭊢
ED2A2
󭊣
ED2A3
󭊤
ED2A4
󭊥
ED2A5
󭊦
ED2A6
󭊧
ED2A7
󭊨
ED2A8
󭊩
ED2A9
󭊪
ED2AA
󭊫
ED2AB
󭊬
ED2AC
󭊭
ED2AD
󭊮
ED2AE
󭊯
ED2AF
A0
B0
󭊰
ED2B0
󭊱
ED2B1
󭊲
ED2B2
󭊳
ED2B3
󭊴
ED2B4
󭊵
ED2B5
󭊶
ED2B6
󭊷
ED2B7
󭊸
ED2B8
󭊹
ED2B9
󭊺
ED2BA
󭊻
ED2BB
󭊼
ED2BC
󭊽
ED2BD
󭊾
ED2BE
󭊿
ED2BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]