Dropout recommendation for segmentation using Swin UnetR #5945
-
Hi, I was wondering if you had any recommendations on which type of dropout to use with the Swin UnetR architecture for a segmentation task. This particular task has significantly more background pixels than foreground. I see that dropout can be applied to the attention ( Thank you. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Hi @tangy5, could you please help share some comments on this question? Thanks in advance! |
Beta Was this translation helpful? Give feedback.
-
@jpcenteno80 , thanks for the question. My suggestions are: as Drop_rate or dropout rate is to prevent network from overfitting, if your dataset is large, you probably want lower drop_rate, or set to 0. If your dataset is small you could set it higher, but not too high, such as 0.2. attn_drop_rate is to highlight the informative region for improving the recognition power of the model. 0 for the most discriminative region 1 on the contrary. Drop path also named Stochastic Depth which is a technique to “deactivate” some layers during training. If you are using a "small" model, drop path is preferred to 0 or 0.1, larger model can have larger drop path rate, such as 0.2. |
Beta Was this translation helpful? Give feedback.
@jpcenteno80 , thanks for the question. My suggestions are: as Drop_rate or dropout rate is to prevent network from overfitting, if your dataset is large, you probably want lower drop_rate, or set to 0. If your dataset is small you could set it higher, but not too high, such as 0.2. attn_drop_rate is to highlight the informative region for improving the recognition power of the model. 0 for the most discriminative region 1 on the contrary. Drop path also named Stochastic Depth which is a technique to “deactivate” some layers during training. If you are using a "small" model, drop path is preferred to 0 or 0.1, larger model can have larger drop path rate, such as 0.2.