International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48682

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􆂀
106080
􆂁
106081
􆂂
106082
􆂃
106083
􆂄
106084
􆂅
106085
􆂆
106086
􆂇
106087
􆂈
106088
􆂉
106089
􆂊
10608A
􆂋
10608B
􆂌
10608C
􆂍
10608D
􆂎
10608E
􆂏
10608F
80
90
􆂐
106090
􆂑
106091
􆂒
106092
􆂓
106093
􆂔
106094
􆂕
106095
􆂖
106096
􆂗
106097
􆂘
106098
􆂙
106099
􆂚
10609A
􆂛
10609B
􆂜
10609C
􆂝
10609D
􆂞
10609E
􆂟
10609F
90
A0
􆂠
1060A0
􆂡
1060A1
􆂢
1060A2
􆂣
1060A3
􆂤
1060A4
􆂥
1060A5
􆂦
1060A6
􆂧
1060A7
􆂨
1060A8
􆂩
1060A9
􆂪
1060AA
􆂫
1060AB
􆂬
1060AC
􆂭
1060AD
􆂮
1060AE
􆂯
1060AF
A0
B0
􆂰
1060B0
􆂱
1060B1
􆂲
1060B2
􆂳
1060B3
􆂴
1060B4
􆂵
1060B5
􆂶
1060B6
􆂷
1060B7
􆂸
1060B8
􆂹
1060B9
􆂺
1060BA
􆂻
1060BB
􆂼
1060BC
􆂽
1060BD
􆂾
1060BE
􆂿
1060BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]