International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38782

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󇂀
C7080
󇂁
C7081
󇂂
C7082
󇂃
C7083
󇂄
C7084
󇂅
C7085
󇂆
C7086
󇂇
C7087
󇂈
C7088
󇂉
C7089
󇂊
C708A
󇂋
C708B
󇂌
C708C
󇂍
C708D
󇂎
C708E
󇂏
C708F
80
90
󇂐
C7090
󇂑
C7091
󇂒
C7092
󇂓
C7093
󇂔
C7094
󇂕
C7095
󇂖
C7096
󇂗
C7097
󇂘
C7098
󇂙
C7099
󇂚
C709A
󇂛
C709B
󇂜
C709C
󇂝
C709D
󇂞
C709E
󇂟
C709F
90
A0
󇂠
C70A0
󇂡
C70A1
󇂢
C70A2
󇂣
C70A3
󇂤
C70A4
󇂥
C70A5
󇂦
C70A6
󇂧
C70A7
󇂨
C70A8
󇂩
C70A9
󇂪
C70AA
󇂫
C70AB
󇂬
C70AC
󇂭
C70AD
󇂮
C70AE
󇂯
C70AF
A0
B0
󇂰
C70B0
󇂱
C70B1
󇂲
C70B2
󇂳
C70B3
󇂴
C70B4
󇂵
C70B5
󇂶
C70B6
󇂷
C70B7
󇂸
C70B8
󇂹
C70B9
󇂺
C70BA
󇂻
C70BB
󇂼
C70BC
󇂽
C70BD
󇂾
C70BE
󇂿
C70BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]