International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F09A8A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𚊀
1A280
𚊁
1A281
𚊂
1A282
𚊃
1A283
𚊄
1A284
𚊅
1A285
𚊆
1A286
𚊇
1A287
𚊈
1A288
𚊉
1A289
𚊊
1A28A
𚊋
1A28B
𚊌
1A28C
𚊍
1A28D
𚊎
1A28E
𚊏
1A28F
80
90
𚊐
1A290
𚊑
1A291
𚊒
1A292
𚊓
1A293
𚊔
1A294
𚊕
1A295
𚊖
1A296
𚊗
1A297
𚊘
1A298
𚊙
1A299
𚊚
1A29A
𚊛
1A29B
𚊜
1A29C
𚊝
1A29D
𚊞
1A29E
𚊟
1A29F
90
A0
𚊠
1A2A0
𚊡
1A2A1
𚊢
1A2A2
𚊣
1A2A3
𚊤
1A2A4
𚊥
1A2A5
𚊦
1A2A6
𚊧
1A2A7
𚊨
1A2A8
𚊩
1A2A9
𚊪
1A2AA
𚊫
1A2AB
𚊬
1A2AC
𚊭
1A2AD
𚊮
1A2AE
𚊯
1A2AF
A0
B0
𚊰
1A2B0
𚊱
1A2B1
𚊲
1A2B2
𚊳
1A2B3
𚊴
1A2B4
𚊵
1A2B5
𚊶
1A2B6
𚊷
1A2B7
𚊸
1A2B8
𚊹
1A2B9
𚊺
1A2BA
𚊻
1A2BB
𚊼
1A2BC
𚊽
1A2BD
𚊾
1A2BE
𚊿
1A2BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]