Inconsistent device in `regressor.py`

Dear authors,

Thank you for releasing `fast_l1` code together with `datamodels`.

While running the linear regression step of datamodels, I faced an issue regarding tensors not being in the same device.

After running
```
python -m datamodels.regression.compute_datamodels \
    -C regression_config.yaml \
    --data.data_path "$tmp_dir/reg_data.beton" \
    --cfg.out_dir "$tmp_dir/reg_results"
```
I would face something similar to
```
  File "/path_to_python3.9/site-packages/fast_l1-0.0.1-py3.9.egg/fast_l1/regressor.py", line 221, in train_saga
RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cpu)
```
or
```
  File "/path_to_python3.9/site-packages/fast_l1-0.0.1-py3.9.egg/fast_l1/regressor.py", line 341, in train_saga
RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cpu)
```

This happened because in lines `221` and `341` of `regressor.py` some CPU tensors are being indexed/sliced using other tensors that lie on the GPU, in this case, `idx` and `still_opt_outer`:

https://github.com/MadryLab/fast_l1/blob/ef7d08dd201afb43bb9d242e30978b0b2bb68912/fast_l1/regressor.py#L221-L222

https://github.com/MadryLab/fast_l1/blob/ef7d08dd201afb43bb9d242e30978b0b2bb68912/fast_l1/regressor.py#L341

On the other hand, they are both on the GPU because `weight` and `train_loader` in [datamodels/datamodels/regression/compute_datamodels.py](https://github.com/MadryLab/datamodels/blob/61e590a6d857b31b6b11be10800f7c9bba6b400e/datamodels/regression/compute_datamodels.py#L151-L162) are on the GPU when `train_saga` is called:

```python
        regressor.train_saga(weight,
                             bias,
                             train_loader,
                             val_loader,
                             lr=lr,
                             start_lams=max_lam,
                             update_bias=(use_bias > 0),
                             lam_decay=np.exp(np.log(eps)/k),
                             num_lambdas=k,
                             early_stop_freq=early_stop_freq,
                             early_stop_eps=early_stop_eps,
                             logdir=str(log_path))
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent device in `regressor.py` #4

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	a_prev[:, :num_keep].copy_(a_table[idx, :num_keep],
	non_blocking=True)

Inconsistent device in regressor.py #4

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Inconsistent device in `regressor.py` #4