International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1828A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񂊀
42280
񂊁
42281
񂊂
42282
񂊃
42283
񂊄
42284
񂊅
42285
񂊆
42286
񂊇
42287
񂊈
42288
񂊉
42289
񂊊
4228A
񂊋
4228B
񂊌
4228C
񂊍
4228D
񂊎
4228E
񂊏
4228F
80
90
񂊐
42290
񂊑
42291
񂊒
42292
񂊓
42293
񂊔
42294
񂊕
42295
񂊖
42296
񂊗
42297
񂊘
42298
񂊙
42299
񂊚
4229A
񂊛
4229B
񂊜
4229C
񂊝
4229D
񂊞
4229E
񂊟
4229F
90
A0
񂊠
422A0
񂊡
422A1
񂊢
422A2
񂊣
422A3
񂊤
422A4
񂊥
422A5
񂊦
422A6
񂊧
422A7
񂊨
422A8
񂊩
422A9
񂊪
422AA
񂊫
422AB
񂊬
422AC
񂊭
422AD
񂊮
422AE
񂊯
422AF
A0
B0
񂊰
422B0
񂊱
422B1
񂊲
422B2
񂊳
422B3
񂊴
422B4
񂊵
422B5
񂊶
422B6
񂊷
422B7
񂊸
422B8
񂊹
422B9
񂊺
422BA
񂊻
422BB
񂊼
422BC
񂊽
422BD
񂊾
422BE
񂊿
422BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]