Skip to content

Commit 25e6be2

Browse files
yangdongshengidryomov
authored andcommitted
rbd: cancel lock_dwork if the wait is interrupted
There is a warning message in my test with below steps: # rbd bench --io-type write --io-size 4K --io-threads 1 --io-pattern rand test & # sleep 5 # pkill -9 rbd # rbd map test & # sleep 5 # pkill rbd The reason is that the rbd_add_acquire_lock() is interruptable, that means, when we kill the waiting on ->acquire_wait, the lock_dwork could be still running. 1. do_rbd_add() 2. lock_dwork rbd_add_acquire_lock() - queue_delayed_work() lock_dwork queued - wait_for_completion_killable_timeout() <-- kill happen rbd_dev_image_unlock() <-- UNLOCKED now, nothing to do. rbd_dev_device_release() rbd_dev_image_release() - ... lock successed here - cancel_delayed_work_sync(&rbd_dev->lock_dwork) Then when we reach the rbd_dev_free(), WARN_ON is triggered because lock_state is not RBD_LOCK_STATE_UNLOCKED. To fix it, this commit make sure the lock_dwork was finished before calling rbd_dev_image_unlock(). On the other hand, this would not happend in do_rbd_remove(), because after rbd mapped, lock_dwork will only be queued for IO request, and request will continue unless lock_dwork finished. when we call rbd_dev_image_unlock() in do_rbd_remove(), all requests are done. That means, lock_state should not be locked again after rbd_dev_image_unlock(). [ Cancel lock_dwork in rbd_add_acquire_lock(), only if the wait is interrupted. ] Fixes: 637cd06 ("rbd: new exclusive lock wait/wake code") Signed-off-by: Dongsheng Yang <[email protected]> Reviewed-by: Ilya Dryomov <[email protected]> Signed-off-by: Ilya Dryomov <[email protected]>
1 parent 1d3f872 commit 25e6be2

File tree

1 file changed

+6
-3
lines changed

1 file changed

+6
-3
lines changed

drivers/block/rbd.c

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6639,10 +6639,13 @@ static int rbd_add_acquire_lock(struct rbd_device *rbd_dev)
66396639
queue_delayed_work(rbd_dev->task_wq, &rbd_dev->lock_dwork, 0);
66406640
ret = wait_for_completion_killable_timeout(&rbd_dev->acquire_wait,
66416641
ceph_timeout_jiffies(rbd_dev->opts->lock_timeout));
6642-
if (ret > 0)
6642+
if (ret > 0) {
66436643
ret = rbd_dev->acquire_err;
6644-
else if (!ret)
6645-
ret = -ETIMEDOUT;
6644+
} else {
6645+
cancel_delayed_work_sync(&rbd_dev->lock_dwork);
6646+
if (!ret)
6647+
ret = -ETIMEDOUT;
6648+
}
66466649

66476650
if (ret) {
66486651
rbd_warn(rbd_dev, "failed to acquire exclusive lock: %ld", ret);

0 commit comments

Comments
 (0)