- 
          
- 
                Notifications
    You must be signed in to change notification settings 
- Fork 33.2k
gh-125451: Fix deadlock in ProcessPoolExecutor shutdown #125492
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gh-125451: Fix deadlock in ProcessPoolExecutor shutdown #125492
Conversation
There was a deadlock when `ProcessPoolExecutor` shuts down at the same time that a queueing thread handles an error when processing a task. Don't use `_shutdown_lock` to protect the `_ThreadWakeup` pipes -- use an internal lock instead. This fixes the ordering deadlock where the `ExecutorManagerThread` holds the `_shutdown_lock` and joins the queueing thread, while the queueing thread is attempting to acquire the `_shutdown_lock` while closing the `_ThreadWakeup`.
| 🤖 New build scheduled with the buildbot fleet by @colesbury for commit 2efa056 🤖 If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again. | 
| 🤖 New build scheduled with the buildbot fleet by @colesbury for commit 2a30e4f 🤖 If you want to schedule another build, you need to add the 🔨 test-with-buildbots label again. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Near line 724, please remove the comment:
        # _shutdown_lock must be locked to access _ThreadWakeup (...)
Oh, I made commit a4dfe8e in 2020 to fix another race condition:
commit a4dfe8ede5a37576e17035dccfe109ba7752237e
Author: Victor Stinner <[email protected]>
Date:   Wed Apr 29 03:32:06 2020 +0200
    bpo-39995: Fix concurrent.futures _ThreadWakeup (GH-19760)
    
    Fix a race condition in concurrent.futures._ThreadWakeup: access to
    _ThreadWakeup is now protected with the shutdown lock.
| 
 The failure of these 2 workers is known: #125535 | 
| !buildbot AMD64 Fedora Stable Refleaks | 
| 🤖 New build scheduled with the buildbot fleet by @colesbury for commit 84db201 🤖 The command will test the builders whose names match following regular expression:  The builders matched are: 
 | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
I confirm that this change fix #125451 issue.
Using a lock per _ThreadWakeup, instead of reusing the shutdown lock, sounds like a good idea.
| Thanks @colesbury for the PR 🌮🎉.. I'm working now to backport this PR to: 3.12, 3.13. | 
…GH-125492) There was a deadlock when `ProcessPoolExecutor` shuts down at the same time that a queueing thread handles an error processing a task. Don't use `_shutdown_lock` to protect the `_ThreadWakeup` pipes -- use an internal lock instead. This fixes the ordering deadlock where the `ExecutorManagerThread` holds the `_shutdown_lock` and joins the queueing thread, while the queueing thread is attempting to acquire the `_shutdown_lock` while closing the `_ThreadWakeup`. (cherry picked from commit 760872e) Co-authored-by: Sam Gross <[email protected]>
| Sorry, @colesbury, I could not cleanly backport this to   | 
| GH-125598 is a backport of this pull request to the 3.13 branch. | 
…ythonGH-125492) There was a deadlock when `ProcessPoolExecutor` shuts down at the same time that a queueing thread handles an error processing a task. Don't use `_shutdown_lock` to protect the `_ThreadWakeup` pipes -- use an internal lock instead. This fixes the ordering deadlock where the `ExecutorManagerThread` holds the `_shutdown_lock` and joins the queueing thread, while the queueing thread is attempting to acquire the `_shutdown_lock` while closing the `_ThreadWakeup`. (cherry picked from commit 760872e) Co-authored-by: Sam Gross <[email protected]>
| GH-125599 is a backport of this pull request to the 3.12 branch. | 
…5492) (GH-125598) There was a deadlock when `ProcessPoolExecutor` shuts down at the same time that a queueing thread handles an error processing a task. Don't use `_shutdown_lock` to protect the `_ThreadWakeup` pipes -- use an internal lock instead. This fixes the ordering deadlock where the `ExecutorManagerThread` holds the `_shutdown_lock` and joins the queueing thread, while the queueing thread is attempting to acquire the `_shutdown_lock` while closing the `_ThreadWakeup`. (cherry picked from commit 760872e) Co-authored-by: Sam Gross <[email protected]>
…5492) (#125599) There was a deadlock when `ProcessPoolExecutor` shuts down at the same time that a queueing thread handles an error processing a task. Don't use `_shutdown_lock` to protect the `_ThreadWakeup` pipes -- use an internal lock instead. This fixes the ordering deadlock where the `ExecutorManagerThread` holds the `_shutdown_lock` and joins the queueing thread, while the queueing thread is attempting to acquire the `_shutdown_lock` while closing the `_ThreadWakeup`. (cherry picked from commit 760872e)
…#125492) There was a deadlock when `ProcessPoolExecutor` shuts down at the same time that a queueing thread handles an error processing a task. Don't use `_shutdown_lock` to protect the `_ThreadWakeup` pipes -- use an internal lock instead. This fixes the ordering deadlock where the `ExecutorManagerThread` holds the `_shutdown_lock` and joins the queueing thread, while the queueing thread is attempting to acquire the `_shutdown_lock` while closing the `_ThreadWakeup`.
There was a deadlock when
ProcessPoolExecutorshuts down at the same time that a queueing thread handles an error when processing a task. Don't use_shutdown_lockto protect the_ThreadWakeuppipes -- use an internal lock instead. This fixes the ordering deadlock where theExecutorManagerThreadholds the_shutdown_lockand joins the queueing thread, while the queueing thread is attempting to acquire the_shutdown_lockwhile closing the_ThreadWakeup.