-
The default hidden size value of monai.networks.nets.ViT is 768 now. In my opinion, the reason why ViT developers decided the hidden layer value to 768 is because the patch size they used is 16 and the channel of patch is 3. so I think, ViT hidden size should be the For example) Is this right? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
I think it's a free system parameter, and 4096 for single channel 3d is a logical choice...(cc @ahatamiz @finalelement ) |
Beta Was this translation helpful? Give feedback.
-
Hi @victolee0 Thanks for the comment. Yes, theoretically a hidden size of 4096 is desirable, but I kept it to 768 to manage the computational requirements. In addition, my experiments with larger hidden sizes (e.g. 1536) did not show significant improvement, hence 768 seemed an optimal choice. Best |
Beta Was this translation helpful? Give feedback.
Hi @victolee0
Thanks for the comment. Yes, theoretically a hidden size of 4096 is desirable, but I kept it to 768 to manage the computational requirements. In addition, my experiments with larger hidden sizes (e.g. 1536) did not show significant improvement, hence 768 seemed an optimal choice.
Best