International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B3A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󳡀
F3840
󳡁
F3841
󳡂
F3842
󳡃
F3843
󳡄
F3844
󳡅
F3845
󳡆
F3846
󳡇
F3847
󳡈
F3848
󳡉
F3849
󳡊
F384A
󳡋
F384B
󳡌
F384C
󳡍
F384D
󳡎
F384E
󳡏
F384F
80
90
󳡐
F3850
󳡑
F3851
󳡒
F3852
󳡓
F3853
󳡔
F3854
󳡕
F3855
󳡖
F3856
󳡗
F3857
󳡘
F3858
󳡙
F3859
󳡚
F385A
󳡛
F385B
󳡜
F385C
󳡝
F385D
󳡞
F385E
󳡟
F385F
90
A0
󳡠
F3860
󳡡
F3861
󳡢
F3862
󳡣
F3863
󳡤
F3864
󳡥
F3865
󳡦
F3866
󳡧
F3867
󳡨
F3868
󳡩
F3869
󳡪
F386A
󳡫
F386B
󳡬
F386C
󳡭
F386D
󳡮
F386E
󳡯
F386F
A0
B0
󳡰
F3870
󳡱
F3871
󳡲
F3872
󳡳
F3873
󳡴
F3874
󳡵
F3875
󳡶
F3876
󳡷
F3877
󳡸
F3878
󳡹
F3879
󳡺
F387A
󳡻
F387B
󳡼
F387C
󳡽
F387D
󳡾
F387E
󳡿
F387F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]