Why do we need function load_from in SwinUnetr in fine-tuning #5765

Levishery · 2022-12-18T10:46:07Z

Levishery
Dec 18, 2022

Hi,

I noticed that in the tutorial, they use

model.load_from(weights=weight)

to initialize the Swin UNETR encoder from self-supervised pre-trained weights. For the test, they use

model.load_state_dict(torch.load(os.path.join(root_dir, "best_metric_model.pth")))

I would like to know why it's necessary to use the function load_from rather than just filter out the mismatches and use load_state_dict.

I'm curious about this since when I want to load a pre-trained model to continue my training on the same task, I notice that the loss appears higher than what it should be, and it degrades the performance. Using load_from instead cannot solve my problem, either. So I would appreciate it if someone could point out the difference between the two functions.

Thanks.

Answered by tangy5

Dec 19, 2022

hi @Levishery , thanks for the question.

The first load_from is used to load the pre-trained weights from self-supervised learning before fine-tuning on segmentation task. The pre-training weights only has the Swin Transformer which serves the encoder part of SwinUNETR. So the load_from function is to match keys only for SwinUNETR encoder.

Later, for test, the entire Swin UNETR model weights need to be loaded. Thus, the direct torch.load is used.

For your case, if your pre-trained model is trained on entire Swin UNETR, you can directly load weights with "torch.load", if your pre-trained model is only for the encoder, "load_from" might help.

Thanks.

View full answer

tangy5 · 2022-12-19T18:45:12Z

tangy5
Dec 19, 2022
Collaborator

hi @Levishery , thanks for the question.

The first load_from is used to load the pre-trained weights from self-supervised learning before fine-tuning on segmentation task. The pre-training weights only has the Swin Transformer which serves the encoder part of SwinUNETR. So the load_from function is to match keys only for SwinUNETR encoder.

Later, for test, the entire Swin UNETR model weights need to be loaded. Thus, the direct torch.load is used.

For your case, if your pre-trained model is trained on entire Swin UNETR, you can directly load weights with "torch.load", if your pre-trained model is only for the encoder, "load_from" might help.

Thanks.

1 reply

Levishery Dec 25, 2022
Author

Thank you very much for the reply! It helps a lot.
Have a nice holiday :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why do we need function load_from in SwinUnetr in fine-tuning #5765

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Why do we need function load_from in SwinUnetr in fine-tuning #5765

Uh oh!

Levishery Dec 18, 2022

Replies: 1 comment · 1 reply

Uh oh!

tangy5 Dec 19, 2022 Collaborator

Uh oh!

Levishery Dec 25, 2022 Author

Levishery
Dec 18, 2022

Replies: 1 comment 1 reply

tangy5
Dec 19, 2022
Collaborator

Levishery Dec 25, 2022
Author