Skip to content
Discussion options

You must be logged in to vote

Hi @hxhxhx88 , thanks for the question.
The 3D segmentation network design would like to follow 4 times downsample-upsample architecture like most segmentation networks did (such as UNet). 4 times downsample with encoder won't cause the model to be over-complicated or higher number of parameters compared to 5 encoders-decoders are designed). But there are 5 hidden output feature from the designed SwinUNETR, hidden_states_out[0][1][2][3][4], finally we decide to start from output feature of hidden_states_out[0] as the first encoded feature for Encoder2, followed by [1][2], [3] is skipped and [4] as the bottomneck feature for encoder10. The stage 4 is going to encoder10 which is the bottlen…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@hxhxhx88
Comment options

Answer selected by hxhxhx88
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants