International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4819C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􁜀
101700
􁜁
101701
􁜂
101702
􁜃
101703
􁜄
101704
􁜅
101705
􁜆
101706
􁜇
101707
􁜈
101708
􁜉
101709
􁜊
10170A
􁜋
10170B
􁜌
10170C
􁜍
10170D
􁜎
10170E
􁜏
10170F
80
90
􁜐
101710
􁜑
101711
􁜒
101712
􁜓
101713
􁜔
101714
􁜕
101715
􁜖
101716
􁜗
101717
􁜘
101718
􁜙
101719
􁜚
10171A
􁜛
10171B
􁜜
10171C
􁜝
10171D
􁜞
10171E
􁜟
10171F
90
A0
􁜠
101720
􁜡
101721
􁜢
101722
􁜣
101723
􁜤
101724
􁜥
101725
􁜦
101726
􁜧
101727
􁜨
101728
􁜩
101729
􁜪
10172A
􁜫
10172B
􁜬
10172C
􁜭
10172D
􁜮
10172E
􁜯
10172F
A0
B0
􁜰
101730
􁜱
101731
􁜲
101732
􁜳
101733
􁜴
101734
􁜵
101735
􁜶
101736
􁜷
101737
􁜸
101738
􁜹
101739
􁜺
10173A
􁜻
10173B
􁜼
10173C
􁜽
10173D
􁜾
10173E
􁜿
10173F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]