Simple way to CPU-fork / parallelize a particular evaluation? #10664
Unanswered
turian
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 1 comment
-
Hey @turian,
Right now, I would say you could skip the first training epochs to save some training time, but it won't scale well. With a hackier solution, you could run the validation loop aync in a thread/subprocess and wait to collect the result while the next training epoch starts. It would be blocking on the next validation call. This would require more granular knowledge but could provide some speedups. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
My GPU gradient steps and epochs are very fast. However, CPU evaluation is a real bottleneck.
How can I fork / parallelize off my CPU evaluation so it doesn't block training, and GPU training can proceed unimpeded?
My early stopping is based upon gnarly single-CPU code that I don't want to touch or modify. So I am trying to figure out how to increase the training time, while not blocking on CPU eval all the time.
Beta Was this translation helpful? Give feedback.
All reactions