It requires more VRAM when you give bigger beam size is that expected? #396
-
I have found out that bigger beam (10) size increases accuracy For example by default it spells as GPX, however I am saying JPEG and it correctly spells with beam size 10 : https://www.youtube.com/watch?v=_nKwisL8dTs However, with bigger beam size (20), i am starting to get not enough memory error on my 12 GB RTX 3060 Is this expected and why happening? Moreover, what other hyper parameters did you find that increases accuracy? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 12 replies
-
Yes this is expected, they mention it in some of these GitHub discussions that you can fit in smaller RAM by turning off beam_size and best_of (so that you'll be using the Greedy decoder instead of BeamSearch Decoder). Also, using FP16 will use less RAM than FP32. I posted some GDDR measurements for small beam search values in #391 where you can see that |
Beta Was this translation helpful? Give feedback.
Yes this is expected, they mention it in some of these GitHub discussions that you can fit in smaller RAM by turning off beam_size and best_of (so that you'll be using the Greedy decoder instead of BeamSearch Decoder). Also, using FP16 will use less RAM than FP32. I posted some GDDR measurements for small beam search values in #391 where you can see that
beam_size=7
is slightly larger memory requirement than the others, but not by much.