International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38886

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󈆀
C8180
󈆁
C8181
󈆂
C8182
󈆃
C8183
󈆄
C8184
󈆅
C8185
󈆆
C8186
󈆇
C8187
󈆈
C8188
󈆉
C8189
󈆊
C818A
󈆋
C818B
󈆌
C818C
󈆍
C818D
󈆎
C818E
󈆏
C818F
80
90
󈆐
C8190
󈆑
C8191
󈆒
C8192
󈆓
C8193
󈆔
C8194
󈆕
C8195
󈆖
C8196
󈆗
C8197
󈆘
C8198
󈆙
C8199
󈆚
C819A
󈆛
C819B
󈆜
C819C
󈆝
C819D
󈆞
C819E
󈆟
C819F
90
A0
󈆠
C81A0
󈆡
C81A1
󈆢
C81A2
󈆣
C81A3
󈆤
C81A4
󈆥
C81A5
󈆦
C81A6
󈆧
C81A7
󈆨
C81A8
󈆩
C81A9
󈆪
C81AA
󈆫
C81AB
󈆬
C81AC
󈆭
C81AD
󈆮
C81AE
󈆯
C81AF
A0
B0
󈆰
C81B0
󈆱
C81B1
󈆲
C81B2
󈆳
C81B3
󈆴
C81B4
󈆵
C81B5
󈆶
C81B6
󈆷
C81B7
󈆸
C81B8
󈆹
C81B9
󈆺
C81BA
󈆻
C81BB
󈆼
C81BC
󈆽
C81BD
󈆾
C81BE
󈆿
C81BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]