International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F383A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󃡀
C3840
󃡁
C3841
󃡂
C3842
󃡃
C3843
󃡄
C3844
󃡅
C3845
󃡆
C3846
󃡇
C3847
󃡈
C3848
󃡉
C3849
󃡊
C384A
󃡋
C384B
󃡌
C384C
󃡍
C384D
󃡎
C384E
󃡏
C384F
80
90
󃡐
C3850
󃡑
C3851
󃡒
C3852
󃡓
C3853
󃡔
C3854
󃡕
C3855
󃡖
C3856
󃡗
C3857
󃡘
C3858
󃡙
C3859
󃡚
C385A
󃡛
C385B
󃡜
C385C
󃡝
C385D
󃡞
C385E
󃡟
C385F
90
A0
󃡠
C3860
󃡡
C3861
󃡢
C3862
󃡣
C3863
󃡤
C3864
󃡥
C3865
󃡦
C3866
󃡧
C3867
󃡨
C3868
󃡩
C3869
󃡪
C386A
󃡫
C386B
󃡬
C386C
󃡭
C386D
󃡮
C386E
󃡯
C386F
A0
B0
󃡰
C3870
󃡱
C3871
󃡲
C3872
󃡳
C3873
󃡴
C3874
󃡵
C3875
󃡶
C3876
󃡷
C3877
󃡸
C3878
󃡹
C3879
󃡺
C387A
󃡻
C387B
󃡼
C387C
󃡽
C387D
󃡾
C387E
󃡿
C387F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]