International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B2A1

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𲡀
32840
𲡁
32841
𲡂
32842
𲡃
32843
𲡄
32844
𲡅
32845
𲡆
32846
𲡇
32847
𲡈
32848
𲡉
32849
𲡊
3284A
𲡋
3284B
𲡌
3284C
𲡍
3284D
𲡎
3284E
𲡏
3284F
80
90
𲡐
32850
𲡑
32851
𲡒
32852
𲡓
32853
𲡔
32854
𲡕
32855
𲡖
32856
𲡗
32857
𲡘
32858
𲡙
32859
𲡚
3285A
𲡛
3285B
𲡜
3285C
𲡝
3285D
𲡞
3285E
𲡟
3285F
90
A0
𲡠
32860
𲡡
32861
𲡢
32862
𲡣
32863
𲡤
32864
𲡥
32865
𲡦
32866
𲡧
32867
𲡨
32868
𲡩
32869
𲡪
3286A
𲡫
3286B
𲡬
3286C
𲡭
3286D
𲡮
3286E
𲡯
3286F
A0
B0
𲡰
32870
𲡱
32871
𲡲
32872
𲡳
32873
𲡴
32874
𲡵
32875
𲡶
32876
𲡷
32877
𲡸
32878
𲡹
32879
𲡺
3287A
𲡻
3287B
𲡼
3287C
𲡽
3287D
𲡾
3287E
𲡿
3287F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]