International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38C86

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󌆀
CC180
󌆁
CC181
󌆂
CC182
󌆃
CC183
󌆄
CC184
󌆅
CC185
󌆆
CC186
󌆇
CC187
󌆈
CC188
󌆉
CC189
󌆊
CC18A
󌆋
CC18B
󌆌
CC18C
󌆍
CC18D
󌆎
CC18E
󌆏
CC18F
80
90
󌆐
CC190
󌆑
CC191
󌆒
CC192
󌆓
CC193
󌆔
CC194
󌆕
CC195
󌆖
CC196
󌆗
CC197
󌆘
CC198
󌆙
CC199
󌆚
CC19A
󌆛
CC19B
󌆜
CC19C
󌆝
CC19D
󌆞
CC19E
󌆟
CC19F
90
A0
󌆠
CC1A0
󌆡
CC1A1
󌆢
CC1A2
󌆣
CC1A3
󌆤
CC1A4
󌆥
CC1A5
󌆦
CC1A6
󌆧
CC1A7
󌆨
CC1A8
󌆩
CC1A9
󌆪
CC1AA
󌆫
CC1AB
󌆬
CC1AC
󌆭
CC1AD
󌆮
CC1AE
󌆯
CC1AF
A0
B0
󌆰
CC1B0
󌆱
CC1B1
󌆲
CC1B2
󌆳
CC1B3
󌆴
CC1B4
󌆵
CC1B5
󌆶
CC1B6
󌆷
CC1B7
󌆸
CC1B8
󌆹
CC1B9
󌆺
CC1BA
󌆻
CC1BB
󌆼
CC1BC
󌆽
CC1BD
󌆾
CC1BE
󌆿
CC1BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]