International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2849D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򄝀
84740
򄝁
84741
򄝂
84742
򄝃
84743
򄝄
84744
򄝅
84745
򄝆
84746
򄝇
84747
򄝈
84748
򄝉
84749
򄝊
8474A
򄝋
8474B
򄝌
8474C
򄝍
8474D
򄝎
8474E
򄝏
8474F
80
90
򄝐
84750
򄝑
84751
򄝒
84752
򄝓
84753
򄝔
84754
򄝕
84755
򄝖
84756
򄝗
84757
򄝘
84758
򄝙
84759
򄝚
8475A
򄝛
8475B
򄝜
8475C
򄝝
8475D
򄝞
8475E
򄝟
8475F
90
A0
򄝠
84760
򄝡
84761
򄝢
84762
򄝣
84763
򄝤
84764
򄝥
84765
򄝦
84766
򄝧
84767
򄝨
84768
򄝩
84769
򄝪
8476A
򄝫
8476B
򄝬
8476C
򄝭
8476D
򄝮
8476E
򄝯
8476F
A0
B0
򄝰
84770
򄝱
84771
򄝲
84772
򄝳
84773
򄝴
84774
򄝵
84775
򄝶
84776
򄝷
84777
򄝸
84778
򄝹
84779
򄝺
8477A
򄝻
8477B
򄝼
8477C
򄝽
8477D
򄝾
8477E
򄝿
8477F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]