International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48B86

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􋆀
10B180
􋆁
10B181
􋆂
10B182
􋆃
10B183
􋆄
10B184
􋆅
10B185
􋆆
10B186
􋆇
10B187
􋆈
10B188
􋆉
10B189
􋆊
10B18A
􋆋
10B18B
􋆌
10B18C
􋆍
10B18D
􋆎
10B18E
􋆏
10B18F
80
90
􋆐
10B190
􋆑
10B191
􋆒
10B192
􋆓
10B193
􋆔
10B194
􋆕
10B195
􋆖
10B196
􋆗
10B197
􋆘
10B198
􋆙
10B199
􋆚
10B19A
􋆛
10B19B
􋆜
10B19C
􋆝
10B19D
􋆞
10B19E
􋆟
10B19F
90
A0
􋆠
10B1A0
􋆡
10B1A1
􋆢
10B1A2
􋆣
10B1A3
􋆤
10B1A4
􋆥
10B1A5
􋆦
10B1A6
􋆧
10B1A7
􋆨
10B1A8
􋆩
10B1A9
􋆪
10B1AA
􋆫
10B1AB
􋆬
10B1AC
􋆭
10B1AD
􋆮
10B1AE
􋆯
10B1AF
A0
B0
􋆰
10B1B0
􋆱
10B1B1
􋆲
10B1B2
􋆳
10B1B3
􋆴
10B1B4
􋆵
10B1B5
􋆶
10B1B6
􋆷
10B1B7
􋆸
10B1B8
􋆹
10B1B9
􋆺
10B1BA
􋆻
10B1BB
􋆼
10B1BC
􋆽
10B1BD
􋆾
10B1BE
􋆿
10B1BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]