it's not as expected to save memory #7624

jinggaizi · 2021-05-20T05:20:31Z

jinggaizi
May 20, 2021

i just run the code in ReadMe, and test
trainer = pl.Trainer(gpus=8, accelerator="ddp")
trainer = pl.Trainer(gpus=8, accelerator="ddp", plugins='ddp_sharded')
but the memory used is same.

i work on torch1.7.1 and titan xp, is there any requirement for training ,such as v100, or is only effective for huge parameter model

cc: @SeanNaren

SeanNaren · 2021-05-20T20:55:43Z

SeanNaren
May 20, 2021

could you give more information as to what you're running? there is definitely an improvement in memory at larger scales (but we've seen improvements with 250M+ params, as seen here https://share.streamlit.io/seannaren/mingpt/streamlit/app.py

0 replies

jinggaizi · 2021-05-21T08:02:35Z

jinggaizi
May 21, 2021
Author

thanks for your quick reply, i train a ASR model just 30M, i will try to use a larger model

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

it's not as expected to save memory #7624

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

it's not as expected to save memory #7624

Uh oh!

Uh oh!

jinggaizi May 20, 2021

Replies: 2 comments

Uh oh!

SeanNaren May 20, 2021

Uh oh!

jinggaizi May 21, 2021 Author

jinggaizi
May 20, 2021

SeanNaren
May 20, 2021

jinggaizi
May 21, 2021
Author