Skip to content
This repository was archived by the owner on Nov 3, 2023. It is now read-only.

Conversation

@dcfidalgo
Copy link

Fixes #253

In the end, I figured out, that when a scheduler terminates a trial, we do not tear down the corresponding workers, and the necessary resources are not available for the subsequent trials.

The proposed solution works for my use case but feels very hacky. Maybe someone with a deeper knowledge about the inner workings of ray tune can come up with a more elegant solution.

Let me know what you think!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Trials hang when using a scheduler

1 participant