Strange training/validation curves for Swin transformer SSL pre-training #4735
-
Hi all, I'm trying to reproduce the pre-trained Swin UNETR and would like to apply it to internal CT scans in my institution. I copy exactly the same code from this address except the dataset directory part of The validation image is also strange and it seems like the model is not learning useful features: May I ask for some advices regarding this? I'm still new to CT scan segmentation and a little confused about situation. If you spotted anything wrong please let me know. I would appreciate any help from the community :) |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi Fengling410, I'm not sure with this issue. But when batch size is set to 1, the contrastive learning will cause some problem that lead to unstable overall training. You could try increase the batch size and see whether it will stablize the training. Thanks. |
Beta Was this translation helpful? Give feedback.
Hi Fengling410, I'm not sure with this issue. But when batch size is set to 1, the contrastive learning will cause some problem that lead to unstable overall training. You could try increase the batch size and see whether it will stablize the training. Thanks.