International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2AE8A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򮊀
AE280
򮊁
AE281
򮊂
AE282
򮊃
AE283
򮊄
AE284
򮊅
AE285
򮊆
AE286
򮊇
AE287
򮊈
AE288
򮊉
AE289
򮊊
AE28A
򮊋
AE28B
򮊌
AE28C
򮊍
AE28D
򮊎
AE28E
򮊏
AE28F
80
90
򮊐
AE290
򮊑
AE291
򮊒
AE292
򮊓
AE293
򮊔
AE294
򮊕
AE295
򮊖
AE296
򮊗
AE297
򮊘
AE298
򮊙
AE299
򮊚
AE29A
򮊛
AE29B
򮊜
AE29C
򮊝
AE29D
򮊞
AE29E
򮊟
AE29F
90
A0
򮊠
AE2A0
򮊡
AE2A1
򮊢
AE2A2
򮊣
AE2A3
򮊤
AE2A4
򮊥
AE2A5
򮊦
AE2A6
򮊧
AE2A7
򮊨
AE2A8
򮊩
AE2A9
򮊪
AE2AA
򮊫
AE2AB
򮊬
AE2AC
򮊭
AE2AD
򮊮
AE2AE
򮊯
AE2AF
A0
B0
򮊰
AE2B0
򮊱
AE2B1
򮊲
AE2B2
򮊳
AE2B3
򮊴
AE2B4
򮊵
AE2B5
򮊶
AE2B6
򮊷
AE2B7
򮊸
AE2B8
򮊹
AE2B9
򮊺
AE2BA
򮊻
AE2BB
򮊼
AE2BC
򮊽
AE2BD
򮊾
AE2BE
򮊿
AE2BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]