Skip to content

Commit 63d4a4c

Browse files
ShayAgrosdavem330
authored andcommitted
net: ena: Prevent reset after device destruction
The reset work is scheduled by the timer routine whenever it detects that a device reset is required (e.g. when a keep_alive signal is missing). When releasing device resources in ena_destroy_device() the driver cancels the scheduling of the timer routine without destroying the reset work explicitly. This creates the following bug: The driver is suspended and the ena_suspend() function is called -> This function calls ena_destroy_device() to free the net device resources -> The driver waits for the timer routine to finish its execution and then cancels it, thus preventing from it to be called again. If, in its final execution, the timer routine schedules a reset, the reset routine might be called afterwards,and a redundant call to ena_restore_device() would be made. By changing the reset routine we allow it to read the device's state accurately. This is achieved by checking whether ENA_FLAG_TRIGGER_RESET flag is set before resetting the device and making both the destruction function and the flag check are under rtnl lock. The ENA_FLAG_TRIGGER_RESET is cleared at the end of the destruction routine. Also surround the flag check with 'likely' because we expect that the reset routine would be called only when ENA_FLAG_TRIGGER_RESET flag is set. The destruction of the timer and reset services in __ena_shutoff() have to stay, even though the timer routine is destroyed in ena_destroy_device(). This is to avoid a case in which the reset routine is scheduled after free_netdev() in __ena_shutoff(), which would create an access to freed memory in adapter->flags. Fixes: 8c5c7ab ("net: ena: add power management ops to the ENA driver") Signed-off-by: Shay Agroskin <[email protected]> Signed-off-by: David S. Miller <[email protected]>
1 parent ad66411 commit 63d4a4c

File tree

1 file changed

+10
-9
lines changed

1 file changed

+10
-9
lines changed

drivers/net/ethernet/amazon/ena/ena_netdev.c

Lines changed: 10 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -3601,16 +3601,14 @@ static void ena_fw_reset_device(struct work_struct *work)
36013601
{
36023602
struct ena_adapter *adapter =
36033603
container_of(work, struct ena_adapter, reset_task);
3604-
struct pci_dev *pdev = adapter->pdev;
36053604

3606-
if (unlikely(!test_bit(ENA_FLAG_TRIGGER_RESET, &adapter->flags))) {
3607-
dev_err(&pdev->dev,
3608-
"device reset schedule while reset bit is off\n");
3609-
return;
3610-
}
36113605
rtnl_lock();
3612-
ena_destroy_device(adapter, false);
3613-
ena_restore_device(adapter);
3606+
3607+
if (likely(test_bit(ENA_FLAG_TRIGGER_RESET, &adapter->flags))) {
3608+
ena_destroy_device(adapter, false);
3609+
ena_restore_device(adapter);
3610+
}
3611+
36143612
rtnl_unlock();
36153613
}
36163614

@@ -4389,8 +4387,11 @@ static void __ena_shutoff(struct pci_dev *pdev, bool shutdown)
43894387
netdev->rx_cpu_rmap = NULL;
43904388
}
43914389
#endif /* CONFIG_RFS_ACCEL */
4392-
del_timer_sync(&adapter->timer_service);
43934390

4391+
/* Make sure timer and reset routine won't be called after
4392+
* freeing device resources.
4393+
*/
4394+
del_timer_sync(&adapter->timer_service);
43944395
cancel_work_sync(&adapter->reset_task);
43954396

43964397
rtnl_lock(); /* lock released inside the below if-else block */

0 commit comments

Comments
 (0)