International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B89D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󸝀
F8740
󸝁
F8741
󸝂
F8742
󸝃
F8743
󸝄
F8744
󸝅
F8745
󸝆
F8746
󸝇
F8747
󸝈
F8748
󸝉
F8749
󸝊
F874A
󸝋
F874B
󸝌
F874C
󸝍
F874D
󸝎
F874E
󸝏
F874F
80
90
󸝐
F8750
󸝑
F8751
󸝒
F8752
󸝓
F8753
󸝔
F8754
󸝕
F8755
󸝖
F8756
󸝗
F8757
󸝘
F8758
󸝙
F8759
󸝚
F875A
󸝛
F875B
󸝜
F875C
󸝝
F875D
󸝞
F875E
󸝟
F875F
90
A0
󸝠
F8760
󸝡
F8761
󸝢
F8762
󸝣
F8763
󸝤
F8764
󸝥
F8765
󸝦
F8766
󸝧
F8767
󸝨
F8768
󸝩
F8769
󸝪
F876A
󸝫
F876B
󸝬
F876C
󸝭
F876D
󸝮
F876E
󸝯
F876F
A0
B0
󸝰
F8770
󸝱
F8771
󸝲
F8772
󸝳
F8773
󸝴
F8774
󸝵
F8775
󸝶
F8776
󸝷
F8777
󸝸
F8778
󸝹
F8779
󸝺
F877A
󸝻
F877B
󸝼
F877C
󸝽
F877D
󸝾
F877E
󸝿
F877F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]