International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4899C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􉜀
109700
􉜁
109701
􉜂
109702
􉜃
109703
􉜄
109704
􉜅
109705
􉜆
109706
􉜇
109707
􉜈
109708
􉜉
109709
􉜊
10970A
􉜋
10970B
􉜌
10970C
􉜍
10970D
􉜎
10970E
􉜏
10970F
80
90
􉜐
109710
􉜑
109711
􉜒
109712
􉜓
109713
􉜔
109714
􉜕
109715
􉜖
109716
􉜗
109717
􉜘
109718
􉜙
109719
􉜚
10971A
􉜛
10971B
􉜜
10971C
􉜝
10971D
􉜞
10971E
􉜟
10971F
90
A0
􉜠
109720
􉜡
109721
􉜢
109722
􉜣
109723
􉜤
109724
􉜥
109725
􉜦
109726
􉜧
109727
􉜨
109728
􉜩
109729
􉜪
10972A
􉜫
10972B
􉜬
10972C
􉜭
10972D
􉜮
10972E
􉜯
10972F
A0
B0
􉜰
109730
􉜱
109731
􉜲
109732
􉜳
109733
􉜴
109734
􉜵
109735
􉜶
109736
􉜷
109737
􉜸
109738
􉜹
109739
􉜺
10973A
􉜻
10973B
􉜼
10973C
􉜽
10973D
􉜾
10973E
􉜿
10973F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]