International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B482

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𴂀
34080
𴂁
34081
𴂂
34082
𴂃
34083
𴂄
34084
𴂅
34085
𴂆
34086
𴂇
34087
𴂈
34088
𴂉
34089
𴂊
3408A
𴂋
3408B
𴂌
3408C
𴂍
3408D
𴂎
3408E
𴂏
3408F
80
90
𴂐
34090
𴂑
34091
𴂒
34092
𴂓
34093
𴂔
34094
𴂕
34095
𴂖
34096
𴂗
34097
𴂘
34098
𴂙
34099
𴂚
3409A
𴂛
3409B
𴂜
3409C
𴂝
3409D
𴂞
3409E
𴂟
3409F
90
A0
𴂠
340A0
𴂡
340A1
𴂢
340A2
𴂣
340A3
𴂤
340A4
𴂥
340A5
𴂦
340A6
𴂧
340A7
𴂨
340A8
𴂩
340A9
𴂪
340AA
𴂫
340AB
𴂬
340AC
𴂭
340AD
𴂮
340AE
𴂯
340AF
A0
B0
𴂰
340B0
𴂱
340B1
𴂲
340B2
𴂳
340B3
𴂴
340B4
𴂵
340B5
𴂶
340B6
𴂷
340B7
𴂸
340B8
𴂹
340B9
𴂺
340BA
𴂻
340BB
𴂼
340BC
𴂽
340BD
𴂾
340BE
𴂿
340BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]