International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28AA0

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򊠀
8A800
򊠁
8A801
򊠂
8A802
򊠃
8A803
򊠄
8A804
򊠅
8A805
򊠆
8A806
򊠇
8A807
򊠈
8A808
򊠉
8A809
򊠊
8A80A
򊠋
8A80B
򊠌
8A80C
򊠍
8A80D
򊠎
8A80E
򊠏
8A80F
80
90
򊠐
8A810
򊠑
8A811
򊠒
8A812
򊠓
8A813
򊠔
8A814
򊠕
8A815
򊠖
8A816
򊠗
8A817
򊠘
8A818
򊠙
8A819
򊠚
8A81A
򊠛
8A81B
򊠜
8A81C
򊠝
8A81D
򊠞
8A81E
򊠟
8A81F
90
A0
򊠠
8A820
򊠡
8A821
򊠢
8A822
򊠣
8A823
򊠤
8A824
򊠥
8A825
򊠦
8A826
򊠧
8A827
򊠨
8A828
򊠩
8A829
򊠪
8A82A
򊠫
8A82B
򊠬
8A82C
򊠭
8A82D
򊠮
8A82E
򊠯
8A82F
A0
B0
򊠰
8A830
򊠱
8A831
򊠲
8A832
򊠳
8A833
򊠴
8A834
򊠵
8A835
򊠶
8A836
򊠷
8A837
򊠸
8A838
򊠹
8A839
򊠺
8A83A
򊠻
8A83B
򊠼
8A83C
򊠽
8A83D
򊠾
8A83E
򊠿
8A83F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]