International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F29C8A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򜊀
9C280
򜊁
9C281
򜊂
9C282
򜊃
9C283
򜊄
9C284
򜊅
9C285
򜊆
9C286
򜊇
9C287
򜊈
9C288
򜊉
9C289
򜊊
9C28A
򜊋
9C28B
򜊌
9C28C
򜊍
9C28D
򜊎
9C28E
򜊏
9C28F
80
90
򜊐
9C290
򜊑
9C291
򜊒
9C292
򜊓
9C293
򜊔
9C294
򜊕
9C295
򜊖
9C296
򜊗
9C297
򜊘
9C298
򜊙
9C299
򜊚
9C29A
򜊛
9C29B
򜊜
9C29C
򜊝
9C29D
򜊞
9C29E
򜊟
9C29F
90
A0
򜊠
9C2A0
򜊡
9C2A1
򜊢
9C2A2
򜊣
9C2A3
򜊤
9C2A4
򜊥
9C2A5
򜊦
9C2A6
򜊧
9C2A7
򜊨
9C2A8
򜊩
9C2A9
򜊪
9C2AA
򜊫
9C2AB
򜊬
9C2AC
򜊭
9C2AD
򜊮
9C2AE
򜊯
9C2AF
A0
B0
򜊰
9C2B0
򜊱
9C2B1
򜊲
9C2B2
򜊳
9C2B3
򜊴
9C2B4
򜊵
9C2B5
򜊶
9C2B6
򜊷
9C2B7
򜊸
9C2B8
򜊹
9C2B9
򜊺
9C2BA
򜊻
9C2BB
򜊼
9C2BC
򜊽
9C2BD
򜊾
9C2BE
򜊿
9C2BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]