How to Clear CUDA cache for loaded models? #1371

atnjqt · 2023-05-19T17:11:50Z

atnjqt
May 19, 2023

Is there a convenient way to clear CUDA memory when you load a model?

I can run it once, but if I rerun the command it seemingly doesn't replace the model in memory it tries to add on..

import whisper
model_size = "large-v2" # options are "tiny", "base", "small", "medium", "large-v1", "large-v2", "large"

model = whisper.load_model(model_size)

if I rerun, it throws the following:
OutOfMemoryError: CUDA out of memory. Tried to allocate 128.00 MiB (GPU 0; 14.62 GiB total capacity; 13.51 GiB already allocated; 19.94 MiB free; 14.02 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

My thought was to try and empty cache but that doesn't seem to make a difference?

import torch
torch.cuda.empty_cache()

torch.cuda.mem_get_info()

the mem_get_info shows memory is not cleared...

Answered by phineas-pta

May 24, 2023

in that case, follow #1313

model = whisper.load_model("small")

del model
torch.cuda.empty_cache()
gc.collect()

model = whisper.load_model("large-v2")

View full answer

baibhavKumar1 · 2023-05-19T17:42:05Z

baibhavKumar1
May 19, 2023

Hey, have you tried rerunning the code, after emptying the cache? Ignoring the mem_get_info()

0 replies

atnjqt · 2023-05-19T17:58:40Z

atnjqt
May 19, 2023
Author

Thanks @Wholesomebruh, yes that is precisely my confusion is that I only get that error when I rerun the model_load.. as if it's not overwriting in CUDA memory or clearing cache, it's strange!

1 reply

baibhavKumar1 May 20, 2023

Ok , Where are you running this model, if on your machine what are the hardware, if somewhere else, how to do it?

phineas-pta · 2023-05-19T21:36:20Z

phineas-pta
May 19, 2023

why do you want to reload the model?

4 replies

atnjqt May 19, 2023
Author

OH. That's a great point actually maybe that isn't something I should be doing 😅😅

phineas-pta May 20, 2023

well it depends on you want, but in most cases you don't need to reload

atnjqt May 24, 2023
Author

Oh wait, of course -- the reason was that I loaded model_size=small for testing, and then wanted to pivot to model_size=large-v2 for scaling up and comparing... is there a recommendation on changing model size and reloading, other than terminating the environment (running on AWS SageMaker with accelerated compute that has GPUs)

phineas-pta May 24, 2023

in that case, follow #1313

model = whisper.load_model("small")

del model
torch.cuda.empty_cache()
gc.collect()

model = whisper.load_model("large-v2")

Answer selected by atnjqt

atnjqt · 2023-05-24T18:01:18Z

atnjqt
May 24, 2023
Author

maybe I'm just unclear on how much CUDA memory is really needed for the large + models.. I see the Required VRAM referenced here but no clear mention of CUDA https://github.com/openai/whisper#available-models-and-languages

4 replies

phineas-pta May 24, 2023

VRAM is the correct technical term, you may call it "CUDA memory"

atnjqt May 24, 2023
Author

Right yes, Video RAM -- duh! Thanks for explaining all of this relatively self-explanatory stuff. I was honestly just making a mistake of trying to reload the model size without deleting the model.

atnjqt May 24, 2023
Author

And running on AWS SageMaker for pay-as-you-go EC2 with GPUs is fantastic!

atnjqt May 24, 2023
Author

And running on AWS SageMaker for pay-as-you-go EC2 with GPUs is fantastic!

How to Clear CUDA cache for loaded models? #1371

Uh oh!

atnjqt May 19, 2023

Replies: 4 comments · 9 replies

Uh oh!

baibhavKumar1 May 19, 2023

Uh oh!

Uh oh!

atnjqt May 19, 2023 Author

Uh oh!

baibhavKumar1 May 20, 2023

Uh oh!

phineas-pta May 19, 2023

Uh oh!

atnjqt May 19, 2023 Author

Uh oh!

phineas-pta May 20, 2023

Uh oh!

atnjqt May 24, 2023 Author

Uh oh!

phineas-pta May 24, 2023

Uh oh!

atnjqt May 24, 2023 Author

Uh oh!

phineas-pta May 24, 2023

Uh oh!

atnjqt May 24, 2023 Author

Uh oh!

atnjqt May 24, 2023 Author

Uh oh!

atnjqt May 24, 2023 Author

atnjqt
May 19, 2023

Replies: 4 comments 9 replies

baibhavKumar1
May 19, 2023

atnjqt
May 19, 2023
Author

phineas-pta
May 19, 2023

atnjqt May 19, 2023
Author

atnjqt May 24, 2023
Author

atnjqt
May 24, 2023
Author

atnjqt May 24, 2023
Author

atnjqt May 24, 2023
Author

atnjqt May 24, 2023
Author