International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2B5A0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򵠀
B5800
򵠁
B5801
򵠂
B5802
򵠃
B5803
򵠄
B5804
򵠅
B5805
򵠆
B5806
򵠇
B5807
򵠈
B5808
򵠉
B5809
򵠊
B580A
򵠋
B580B
򵠌
B580C
򵠍
B580D
򵠎
B580E
򵠏
B580F
80
90
򵠐
B5810
򵠑
B5811
򵠒
B5812
򵠓
B5813
򵠔
B5814
򵠕
B5815
򵠖
B5816
򵠗
B5817
򵠘
B5818
򵠙
B5819
򵠚
B581A
򵠛
B581B
򵠜
B581C
򵠝
B581D
򵠞
B581E
򵠟
B581F
90
A0
򵠠
B5820
򵠡
B5821
򵠢
B5822
򵠣
B5823
򵠤
B5824
򵠥
B5825
򵠦
B5826
򵠧
B5827
򵠨
B5828
򵠩
B5829
򵠪
B582A
򵠫
B582B
򵠬
B582C
򵠭
B582D
򵠮
B582E
򵠯
B582F
A0
B0
򵠰
B5830
򵠱
B5831
򵠲
B5832
򵠳
B5833
򵠴
B5834
򵠵
B5835
򵠶
B5836
򵠷
B5837
򵠸
B5838
򵠹
B5839
򵠺
B583A
򵠻
B583B
򵠼
B583C
򵠽
B583D
򵠾
B583E
򵠿
B583F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]