International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2859D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򅝀
85740
򅝁
85741
򅝂
85742
򅝃
85743
򅝄
85744
򅝅
85745
򅝆
85746
򅝇
85747
򅝈
85748
򅝉
85749
򅝊
8574A
򅝋
8574B
򅝌
8574C
򅝍
8574D
򅝎
8574E
򅝏
8574F
80
90
򅝐
85750
򅝑
85751
򅝒
85752
򅝓
85753
򅝔
85754
򅝕
85755
򅝖
85756
򅝗
85757
򅝘
85758
򅝙
85759
򅝚
8575A
򅝛
8575B
򅝜
8575C
򅝝
8575D
򅝞
8575E
򅝟
8575F
90
A0
򅝠
85760
򅝡
85761
򅝢
85762
򅝣
85763
򅝤
85764
򅝥
85765
򅝦
85766
򅝧
85767
򅝨
85768
򅝩
85769
򅝪
8576A
򅝫
8576B
򅝬
8576C
򅝭
8576D
򅝮
8576E
򅝯
8576F
A0
B0
򅝰
85770
򅝱
85771
򅝲
85772
򅝳
85773
򅝴
85774
򅝵
85775
򅝶
85776
򅝷
85777
򅝸
85778
򅝹
85779
򅝺
8577A
򅝻
8577B
򅝼
8577C
򅝽
8577D
򅝾
8577E
򅝿
8577F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]