training on gpu becomes non-deterministic #11250

2533245542 · 2021-12-24T07:03:43Z

2533245542
Dec 24, 2021

I usually set seed_everything() at the beginning of my script but this does not always solves the problem.

When I train models on cpu, it is determinstic; but when I switch to gpu, it becomes non-deterministic.

When I am training simple model, like one-layer LSTM, it is deterministic both on cpu and gpu.

But when I train a more completed model like LSTM-FCN, it is deterministic on cpu but not on gpu

Can I get any help on debugging?

I got my LSTM-FCN model from here (https://github.com/timeseriesAI/tsai/blob/main/tsai/models/RNN_FCN.py) and the LSTM model I tested was simply a nn.LSTM

Answered by rohitgr7

Dec 24, 2021

maybe setting Trainer(deterministic=True) might help?

View full answer

rohitgr7 · 2021-12-24T18:41:02Z

rohitgr7
Dec 24, 2021

maybe setting Trainer(deterministic=True) might help?

0 replies

2533245542 · 2021-12-24T19:47:51Z

2533245542
Dec 24, 2021
Author

Wow, it worked. This is amazing. I thought seed_everything() has done everything pytorch lightning could do. Is there any documentation for the Trainer(deterministic=True) you mentioned? Want to take a look.

2 replies

rohitgr7 Dec 26, 2021

sure! here are the docs: https://pytorch-lightning.readthedocs.io/en/latest/common/trainer.html#deterministic

in short deterministic operations are slower so it's not enabled by default.

yipliu Jun 9, 2022

Hi,

Even if they are both on the GPU, seed_everything does not give me near the same results for my model.

Adding the tag Trainer(deterministic=True) currently makes all my results repeatable

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

training on gpu becomes non-deterministic #11250

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

training on gpu becomes non-deterministic #11250

Uh oh!

2533245542 Dec 24, 2021

Replies: 2 comments · 2 replies

Uh oh!

rohitgr7 Dec 24, 2021

Uh oh!

2533245542 Dec 24, 2021 Author

Uh oh!

rohitgr7 Dec 26, 2021

Uh oh!

yipliu Jun 9, 2022

2533245542
Dec 24, 2021

Replies: 2 comments 2 replies

rohitgr7
Dec 24, 2021

2533245542
Dec 24, 2021
Author