It was noted in paper that your method was trained "approximately 30 minutes of finetuning on a single A100 GPU" I am curious about two questions: 1. GPU meomery: 40G or 80G 2. batchsize is 4 or any other number