Hi there,
Thank you for sharing your excellent work!
I am having trouble understanding how the loss is calculated during pretraining. Could you help clarifying some details, please?
So far, I have noticed that for both frame loss (models/EAT_pretraining.py#L705) and utterance/CLS loss (models/EAT_pretraining.py#L659), the code uses MSE, but without reduction (loss = F.mse_loss(x, y, reduction="none")) .
The paper states that the two losses are summed up. But before one can do that, they need to be "reduced" (e.g., sum or mean). Where can I see how the reduction and the calculation of the total loss are performed?
Thank you!
Hi there,
Thank you for sharing your excellent work!
I am having trouble understanding how the loss is calculated during pretraining. Could you help clarifying some details, please?
So far, I have noticed that for both frame loss (models/EAT_pretraining.py#L705) and utterance/CLS loss (models/EAT_pretraining.py#L659), the code uses MSE, but without reduction (
loss = F.mse_loss(x, y, reduction="none")) .The paper states that the two losses are summed up. But before one can do that, they need to be "reduced" (e.g., sum or mean). Where can I see how the reduction and the calculation of the total loss are performed?
Thank you!