International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A782

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򧂀
A7080
򧂁
A7081
򧂂
A7082
򧂃
A7083
򧂄
A7084
򧂅
A7085
򧂆
A7086
򧂇
A7087
򧂈
A7088
򧂉
A7089
򧂊
A708A
򧂋
A708B
򧂌
A708C
򧂍
A708D
򧂎
A708E
򧂏
A708F
80
90
򧂐
A7090
򧂑
A7091
򧂒
A7092
򧂓
A7093
򧂔
A7094
򧂕
A7095
򧂖
A7096
򧂗
A7097
򧂘
A7098
򧂙
A7099
򧂚
A709A
򧂛
A709B
򧂜
A709C
򧂝
A709D
򧂞
A709E
򧂟
A709F
90
A0
򧂠
A70A0
򧂡
A70A1
򧂢
A70A2
򧂣
A70A3
򧂤
A70A4
򧂥
A70A5
򧂦
A70A6
򧂧
A70A7
򧂨
A70A8
򧂩
A70A9
򧂪
A70AA
򧂫
A70AB
򧂬
A70AC
򧂭
A70AD
򧂮
A70AE
򧂯
A70AF
A0
B0
򧂰
A70B0
򧂱
A70B1
򧂲
A70B2
򧂳
A70B3
򧂴
A70B4
򧂵
A70B5
򧂶
A70B6
򧂷
A70B7
򧂸
A70B8
򧂹
A70B9
򧂺
A70BA
򧂻
A70BB
򧂼
A70BC
򧂽
A70BD
򧂾
A70BE
򧂿
A70BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]