International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39386

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󓆀
D3180
󓆁
D3181
󓆂
D3182
󓆃
D3183
󓆄
D3184
󓆅
D3185
󓆆
D3186
󓆇
D3187
󓆈
D3188
󓆉
D3189
󓆊
D318A
󓆋
D318B
󓆌
D318C
󓆍
D318D
󓆎
D318E
󓆏
D318F
80
90
󓆐
D3190
󓆑
D3191
󓆒
D3192
󓆓
D3193
󓆔
D3194
󓆕
D3195
󓆖
D3196
󓆗
D3197
󓆘
D3198
󓆙
D3199
󓆚
D319A
󓆛
D319B
󓆜
D319C
󓆝
D319D
󓆞
D319E
󓆟
D319F
90
A0
󓆠
D31A0
󓆡
D31A1
󓆢
D31A2
󓆣
D31A3
󓆤
D31A4
󓆥
D31A5
󓆦
D31A6
󓆧
D31A7
󓆨
D31A8
󓆩
D31A9
󓆪
D31AA
󓆫
D31AB
󓆬
D31AC
󓆭
D31AD
󓆮
D31AE
󓆯
D31AF
A0
B0
󓆰
D31B0
󓆱
D31B1
󓆲
D31B2
󓆳
D31B3
󓆴
D31B4
󓆵
D31B5
󓆶
D31B6
󓆷
D31B7
󓆸
D31B8
󓆹
D31B9
󓆺
D31BA
󓆻
D31BB
󓆼
D31BC
󓆽
D31BD
󓆾
D31BE
󓆿
D31BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]