[BUG] Data leakage in `TorchExperiment`

The `TorchExperiment` reuses the same `LightningDataModule` instance across all optimization trials. If the datamodule maintains internal state (e.g., data augmentation settings, random state, preprocessing cache), this state persists across trials. 
In [this line](https://github.com/SimonBlanke/Hyperactive/blob/main/src/hyperactive/experiment/integrations/torch_lightning_experiment.py#L177) the datamodule is the same for each iteration. So the `lightning_module` is instantiated fresh for each trial, but `datamodule` is reused. Here is an example of a datamodule, that would react in an undesired way:

```python
class ImageDataModule(L.LightningDataModule):
    def __init__(self):
        super().__init__()
        self.augmentation_strength = 0.5  # Internal state

    def train_dataloader(self):
        # Augmentation strength might be modified during training
        transforms = self._get_transforms(self.augmentation_strength)
        return DataLoader(self.train_dataset, ...)

    def on_train_epoch_end(self):
        # Some datamods adjust augmentation over time
        self.augmentation_strength *= 1.1 
```



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] Data leakage in `TorchExperiment` #212

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] Data leakage in TorchExperiment #212

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

[BUG] Data leakage in `TorchExperiment` #212