International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28282

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򂂀
82080
򂂁
82081
򂂂
82082
򂂃
82083
򂂄
82084
򂂅
82085
򂂆
82086
򂂇
82087
򂂈
82088
򂂉
82089
򂂊
8208A
򂂋
8208B
򂂌
8208C
򂂍
8208D
򂂎
8208E
򂂏
8208F
80
90
򂂐
82090
򂂑
82091
򂂒
82092
򂂓
82093
򂂔
82094
򂂕
82095
򂂖
82096
򂂗
82097
򂂘
82098
򂂙
82099
򂂚
8209A
򂂛
8209B
򂂜
8209C
򂂝
8209D
򂂞
8209E
򂂟
8209F
90
A0
򂂠
820A0
򂂡
820A1
򂂢
820A2
򂂣
820A3
򂂤
820A4
򂂥
820A5
򂂦
820A6
򂂧
820A7
򂂨
820A8
򂂩
820A9
򂂪
820AA
򂂫
820AB
򂂬
820AC
򂂭
820AD
򂂮
820AE
򂂯
820AF
A0
B0
򂂰
820B0
򂂱
820B1
򂂲
820B2
򂂳
820B3
򂂴
820B4
򂂵
820B5
򂂶
820B6
򂂷
820B7
򂂸
820B8
򂂹
820B9
򂂺
820BA
򂂻
820BB
򂂼
820BC
򂂽
820BD
򂂾
820BE
򂂿
820BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]