Interleaving training of several models #14229

turian · 2022-08-16T13:01:54Z

turian
Aug 16, 2022

I am a long-time lightning fan, on the other hand in the past I have gotten bitten by how opinionated it is about training loops. I have a new project in pure torch, and I'm debating whether I can easily migrate it to lightning. I have a related question here: #14228

I have a pretraining model which trains only training data to minimize a self-supervised loss.
I have a downstream model which uses the pretrained model (either frozen or finetuning it), and trains on dev data and evaluates on test data.

I want to explore multiple training strategies, including pretrain then downstream (simple) and interleaved pretraining and downstream training. What are best practices for using lightning in this setting?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Interleaving training of several models #14229

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Interleaving training of several models #14229

Uh oh!

Uh oh!

turian Aug 16, 2022

Replies: 0 comments

turian
Aug 16, 2022