Skip to content

Commit 96a9fe6

Browse files
Muchun Songaxboe
authored andcommitted
block: fix ordering between checking BLK_MQ_S_STOPPED request adding
Supposing first scenario with a virtio_blk driver. CPU0 CPU1 blk_mq_try_issue_directly() __blk_mq_issue_directly() q->mq_ops->queue_rq() virtio_queue_rq() blk_mq_stop_hw_queue() virtblk_done() blk_mq_request_bypass_insert() 1) store blk_mq_start_stopped_hw_queue() clear_bit(BLK_MQ_S_STOPPED) 3) store blk_mq_run_hw_queue() if (!blk_mq_hctx_has_pending()) 4) load return blk_mq_sched_dispatch_requests() blk_mq_run_hw_queue() if (!blk_mq_hctx_has_pending()) return blk_mq_sched_dispatch_requests() if (blk_mq_hctx_stopped()) 2) load return __blk_mq_sched_dispatch_requests() Supposing another scenario. CPU0 CPU1 blk_mq_requeue_work() blk_mq_insert_request() 1) store virtblk_done() blk_mq_start_stopped_hw_queue() blk_mq_run_hw_queues() clear_bit(BLK_MQ_S_STOPPED) 3) store blk_mq_run_hw_queue() if (!blk_mq_hctx_has_pending()) 4) load return blk_mq_sched_dispatch_requests() if (blk_mq_hctx_stopped()) 2) load continue blk_mq_run_hw_queue() Both scenarios are similar, the full memory barrier should be inserted between 1) and 2), as well as between 3) and 4) to make sure that either CPU0 sees BLK_MQ_S_STOPPED is cleared or CPU1 sees dispatch list. Otherwise, either CPU will not rerun the hardware queue causing starvation of the request. The easy way to fix it is to add the essential full memory barrier into helper of blk_mq_hctx_stopped(). In order to not affect the fast path (hardware queue is not stopped most of the time), we only insert the barrier into the slow path. Actually, only slow path needs to care about missing of dispatching the request to the low-level device driver. Fixes: 320ae51 ("blk-mq: new multi-queue block IO queueing mechanism") Cc: [email protected] Cc: Muchun Song <[email protected]> Signed-off-by: Muchun Song <[email protected]> Reviewed-by: Ming Lei <[email protected]> Link: https://lore.kernel.org/r/[email protected] Signed-off-by: Jens Axboe <[email protected]>
1 parent 6bda857 commit 96a9fe6

File tree

2 files changed

+19
-0
lines changed

2 files changed

+19
-0
lines changed

block/blk-mq.c

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2438,6 +2438,12 @@ void blk_mq_start_stopped_hw_queue(struct blk_mq_hw_ctx *hctx, bool async)
24382438
return;
24392439

24402440
clear_bit(BLK_MQ_S_STOPPED, &hctx->state);
2441+
/*
2442+
* Pairs with the smp_mb() in blk_mq_hctx_stopped() to order the
2443+
* clearing of BLK_MQ_S_STOPPED above and the checking of dispatch
2444+
* list in the subsequent routine.
2445+
*/
2446+
smp_mb__after_atomic();
24412447
blk_mq_run_hw_queue(hctx, async);
24422448
}
24432449
EXPORT_SYMBOL_GPL(blk_mq_start_stopped_hw_queue);

block/blk-mq.h

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -230,6 +230,19 @@ static inline struct blk_mq_tags *blk_mq_tags_from_data(struct blk_mq_alloc_data
230230

231231
static inline bool blk_mq_hctx_stopped(struct blk_mq_hw_ctx *hctx)
232232
{
233+
/* Fast path: hardware queue is not stopped most of the time. */
234+
if (likely(!test_bit(BLK_MQ_S_STOPPED, &hctx->state)))
235+
return false;
236+
237+
/*
238+
* This barrier is used to order adding of dispatch list before and
239+
* the test of BLK_MQ_S_STOPPED below. Pairs with the memory barrier
240+
* in blk_mq_start_stopped_hw_queue() so that dispatch code could
241+
* either see BLK_MQ_S_STOPPED is cleared or dispatch list is not
242+
* empty to avoid missing dispatching requests.
243+
*/
244+
smp_mb();
245+
233246
return test_bit(BLK_MQ_S_STOPPED, &hctx->state);
234247
}
235248

0 commit comments

Comments
 (0)