Hi, may I ask how much GPU memory is needed to reproduce this model to obtain the results of the paper? Is 24 GB of GPU memory enough?