DeepLabCE and ignore_label / ignore_index #3876

Giaco90 · 2022-01-11T15:10:51Z

Giaco90
Jan 11, 2022

Hello all,
reading the code of DeepLabCE, I found a possible problem in how the forward method is implemented.
The used criterion is CrossEntropyLoss and is set up with ignore_index parameter and reduction="none"; according to CrossEntropyLoss doc:

ignore_index (int, optional) – Specifies a target value that is ignored and does not contribute to the input gradient. When size_average is True, the loss is averaged over non-ignored targets.

then, ignore_index is used to:

modify the loss in order to ignore specific values (I think something like zeroing specific positions)
ignore values during the reduction (the doc says during size_average, but it has been deprecated so i think it does the same with reduction set to sum or mean).

The problem is that the forward of DeepLabCE applies the CrossEntropyLoss without the reduction and manually applies the mean after weighting and topk selection. Then, the first point is in someway applied, but the second is not.

Is this correct? Should the mean be applied considering the ignore_label / ignore_index ?

Thanks,
Giacomo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DeepLabCE and ignore_label / ignore_index #3876

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

DeepLabCE and ignore_label / ignore_index #3876

Uh oh!

Uh oh!

Giaco90 Jan 11, 2022

Replies: 0 comments

Giaco90
Jan 11, 2022