FasterRCNN bug with grayscale input #10338
Unanswered
adeschemps
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 2 comments
-
Seems like it converts it automatically to 3 channels during the normalization (see below). This does not have to be the desired way of doing it, one could also just copy the single channel 3 times. Or change the architecture. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Dear @adeschemps, I believe you should open this issue on Bolts directly. Best, |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
The following code doesn't fail, even though I believe it should:
as
print(model)
returns:showing that the first convolution of the backbone expects an input with 3 channels instead of one. This is confirmed by https://github.com/pytorch/vision/blob/3300692c6e7c2023d2f2356a69ec22ca91e38790/torchvision/models/resnet.py#L323:
Because of this, I don't understand what the model is actually computing, which is very confusing. Any help would be much appreciated
Beta Was this translation helpful? Give feedback.
All reactions