Skip to content
Discussion options

You must be logged in to vote

Hi @lanslotttTT. It isn't really a question of papers particularly; neural networks are made out of common building blocks whose memory requirements can be concretely understood; I'll can make a few observations that may help.

One important factor is whether your network is purely convolutional or not. Take a UNet vs. a VAE, for example:

  • The UNet is purely convolutional. If you increase the volume of a patch by the ratio that you decrease the batch size, it should require an identical amount of memory
  • The VAE has a number of linear layers that take the lowest spatial resolution and "de-cohere" it to a latent space that is then compressed, and then perform the opposite operation for the d…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@atbenmurray
Comment options

Answer selected by lanslotttTT
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants