International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2988A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򘊀
98280
򘊁
98281
򘊂
98282
򘊃
98283
򘊄
98284
򘊅
98285
򘊆
98286
򘊇
98287
򘊈
98288
򘊉
98289
򘊊
9828A
򘊋
9828B
򘊌
9828C
򘊍
9828D
򘊎
9828E
򘊏
9828F
80
90
򘊐
98290
򘊑
98291
򘊒
98292
򘊓
98293
򘊔
98294
򘊕
98295
򘊖
98296
򘊗
98297
򘊘
98298
򘊙
98299
򘊚
9829A
򘊛
9829B
򘊜
9829C
򘊝
9829D
򘊞
9829E
򘊟
9829F
90
A0
򘊠
982A0
򘊡
982A1
򘊢
982A2
򘊣
982A3
򘊤
982A4
򘊥
982A5
򘊦
982A6
򘊧
982A7
򘊨
982A8
򘊩
982A9
򘊪
982AA
򘊫
982AB
򘊬
982AC
򘊭
982AD
򘊮
982AE
򘊯
982AF
A0
B0
򘊰
982B0
򘊱
982B1
򘊲
982B2
򘊳
982B3
򘊴
982B4
򘊵
982B5
򘊶
982B6
򘊷
982B7
򘊸
982B8
򘊹
982B9
򘊺
982BA
򘊻
982BB
򘊼
982BC
򘊽
982BD
򘊾
982BE
򘊿
982BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]