Gradient Manipulation for Multitask Learning #12810

haldunbalim · 2022-04-19T22:43:23Z

haldunbalim
Apr 19, 2022

Hello,

I would like to implement the paper "Adapting Auxiliary Losses Using Gradient Similarity". Where I have a main loss and multiple different auxiliary losses and would pass these auxiliary losses to optimizer if the gradients have cosine similarity over zero with the main task loss. For this I calculate gradients with respect to each loss using torch.autograd.grad and calculate cosine similarities and add only the selected task losses to optimized loss. However, since I don’t know how to pass these gradients to optimizer I calculate the backwards two times every step. I would like to learn how to implement this in an effective way.

Thanks a lot.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gradient Manipulation for Multitask Learning #12810

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Gradient Manipulation for Multitask Learning #12810

Uh oh!

haldunbalim Apr 19, 2022

Replies: 0 comments

haldunbalim
Apr 19, 2022