International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F4828E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􂎀
102380
􂎁
102381
􂎂
102382
􂎃
102383
􂎄
102384
􂎅
102385
􂎆
102386
􂎇
102387
􂎈
102388
􂎉
102389
􂎊
10238A
􂎋
10238B
􂎌
10238C
􂎍
10238D
􂎎
10238E
􂎏
10238F
80
90
􂎐
102390
􂎑
102391
􂎒
102392
􂎓
102393
􂎔
102394
􂎕
102395
􂎖
102396
􂎗
102397
􂎘
102398
􂎙
102399
􂎚
10239A
􂎛
10239B
􂎜
10239C
􂎝
10239D
􂎞
10239E
􂎟
10239F
90
A0
􂎠
1023A0
􂎡
1023A1
􂎢
1023A2
􂎣
1023A3
􂎤
1023A4
􂎥
1023A5
􂎦
1023A6
􂎧
1023A7
􂎨
1023A8
􂎩
1023A9
􂎪
1023AA
􂎫
1023AB
􂎬
1023AC
􂎭
1023AD
􂎮
1023AE
􂎯
1023AF
A0
B0
􂎰
1023B0
􂎱
1023B1
􂎲
1023B2
􂎳
1023B3
􂎴
1023B4
􂎵
1023B5
􂎶
1023B6
􂎷
1023B7
􂎸
1023B8
􂎹
1023B9
􂎺
1023BA
􂎻
1023BB
􂎼
1023BC
􂎽
1023BD
􂎾
1023BE
􂎿
1023BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]