International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2858A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򅊀
85280
򅊁
85281
򅊂
85282
򅊃
85283
򅊄
85284
򅊅
85285
򅊆
85286
򅊇
85287
򅊈
85288
򅊉
85289
򅊊
8528A
򅊋
8528B
򅊌
8528C
򅊍
8528D
򅊎
8528E
򅊏
8528F
80
90
򅊐
85290
򅊑
85291
򅊒
85292
򅊓
85293
򅊔
85294
򅊕
85295
򅊖
85296
򅊗
85297
򅊘
85298
򅊙
85299
򅊚
8529A
򅊛
8529B
򅊜
8529C
򅊝
8529D
򅊞
8529E
򅊟
8529F
90
A0
򅊠
852A0
򅊡
852A1
򅊢
852A2
򅊣
852A3
򅊤
852A4
򅊥
852A5
򅊦
852A6
򅊧
852A7
򅊨
852A8
򅊩
852A9
򅊪
852AA
򅊫
852AB
򅊬
852AC
򅊭
852AD
򅊮
852AE
򅊯
852AF
A0
B0
򅊰
852B0
򅊱
852B1
򅊲
852B2
򅊳
852B3
򅊴
852B4
򅊵
852B5
򅊶
852B6
򅊷
852B7
򅊸
852B8
򅊹
852B9
򅊺
852BA
򅊻
852BB
򅊼
852BC
򅊽
852BD
򅊾
852BE
򅊿
852BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]