Commit dbf75d9
authored
initial device context at
if User A use gpu6,7 and User B use gpu0. Then UserB kills all process at gpu0 but User A's training also stopped. because `torch.cuda.empty_cache()` default initialize at rank0.
Reference: pytorch/pytorch#25752 (comment)args.device
1 parent 0938894 commit dbf75d9
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
30 | | - | |
| 30 | + | |
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| |||
0 commit comments