International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
IBM IANA
UTF-8 ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
UTF-8


Codepage Layout

Currently showing the codepage starting with the bytes F383AF

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃯀
C3BC0
󃯁
C3BC1
󃯂
C3BC2
󃯃
C3BC3
󃯄
C3BC4
󃯅
C3BC5
󃯆
C3BC6
󃯇
C3BC7
󃯈
C3BC8
󃯉
C3BC9
󃯊
C3BCA
󃯋
C3BCB
󃯌
C3BCC
󃯍
C3BCD
󃯎
C3BCE
󃯏
C3BCF
80
90
󃯐
C3BD0
󃯑
C3BD1
󃯒
C3BD2
󃯓
C3BD3
󃯔
C3BD4
󃯕
C3BD5
󃯖
C3BD6
󃯗
C3BD7
󃯘
C3BD8
󃯙
C3BD9
󃯚
C3BDA
󃯛
C3BDB
󃯜
C3BDC
󃯝
C3BDD
󃯞
C3BDE
󃯟
C3BDF
90
A0
󃯠
C3BE0
󃯡
C3BE1
󃯢
C3BE2
󃯣
C3BE3
󃯤
C3BE4
󃯥
C3BE5
󃯦
C3BE6
󃯧
C3BE7
󃯨
C3BE8
󃯩
C3BE9
󃯪
C3BEA
󃯫
C3BEB
󃯬
C3BEC
󃯭
C3BED
󃯮
C3BEE
󃯯
C3BEF
A0
B0
󃯰
C3BF0
󃯱
C3BF1
󃯲
C3BF2
󃯳
C3BF3
󃯴
C3BF4
󃯵
C3BF5
󃯶
C3BF6
󃯷
C3BF7
󃯸
C3BF8
󃯹
C3BF9
󃯺
C3BFA
󃯻
C3BFB
󃯼
C3BFC
󃯽
C3BFD
󃯾
C3BFE
󃯿
C3BFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]