International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4849D

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􄝀
104740
􄝁
104741
􄝂
104742
􄝃
104743
􄝄
104744
􄝅
104745
􄝆
104746
􄝇
104747
􄝈
104748
􄝉
104749
􄝊
10474A
􄝋
10474B
􄝌
10474C
􄝍
10474D
􄝎
10474E
􄝏
10474F
80
90
􄝐
104750
􄝑
104751
􄝒
104752
􄝓
104753
􄝔
104754
􄝕
104755
􄝖
104756
􄝗
104757
􄝘
104758
􄝙
104759
􄝚
10475A
􄝛
10475B
􄝜
10475C
􄝝
10475D
􄝞
10475E
􄝟
10475F
90
A0
􄝠
104760
􄝡
104761
􄝢
104762
􄝣
104763
􄝤
104764
􄝥
104765
􄝦
104766
􄝧
104767
􄝨
104768
􄝩
104769
􄝪
10476A
􄝫
10476B
􄝬
10476C
􄝭
10476D
􄝮
10476E
􄝯
10476F
A0
B0
􄝰
104770
􄝱
104771
􄝲
104772
􄝳
104773
􄝴
104774
􄝵
104775
􄝶
104776
􄝷
104777
􄝸
104778
􄝹
104779
􄝺
10477A
􄝻
10477B
􄝼
10477C
􄝽
10477D
􄝾
10477E
􄝿
10477F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]