Ability to Automatically Relaunch Jobs in LSF Environment #12278
Unanswered
amorehead
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello. I was reading through Lightning's documentation on the Slurm cluster environment, and I saw that it currently supports the ability to automatically requeue a running job after its time expires (e.g., SBATCH --signal=SIGUSR1@90). I was wondering if such a feature is also possible/available through the LSF cluster environment that Lightning currently supports.
Beta Was this translation helpful? Give feedback.
All reactions