Incorrect time_dim for intermediate temporal layers

I have been working through your code trying to get it working, and I believe I found an issue when you set the __time_dim__ for the temporal layers [here](https://github.com/lucidrains/lumiere-pytorch/blob/b806d1675d48b104368f4d8cb6afb19dc428a702/lumiere_pytorch/lumiere.py#L95-L102):
```python
def set_time_dim_(
    klasses: Tuple[Type[Module]],
    model: Module,
    time_dim: int
):
    for model in model.modules():
        if isinstance(model, klasses):
            model.time_dim = time_dim
```
You are setting the same __time_dim__ for all of layers, but the size of the temporal dimension is cut in half after each step in the UNet.  Because of this, the model crashes when trying to reshape/rearrange the tensors for intermediate layers (for instance [here](https://github.com/lucidrains/lumiere-pytorch/blob/b806d1675d48b104368f4d8cb6afb19dc428a702/lumiere_pytorch/lumiere.py#L375C1-L382C93) (maybe others as well?):
```python
if is_video:
    batch_size = x.shape[0]
    x = rearrange(x, 'b c t h w -> b h w t c')
else:
    assert exists(batch_size) or exists(self.time_dim)

    rearrange_kwargs = dict(b = batch_size, t = self.time_dim)
    x = rearrange(x, '(b t) c h w -> b h w t c', **compact_values(rearrange_kwargs))
```
I am working on my on workaround in the same __set_time_dim__ function but thought I would report it in case it is helpful.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Incorrect time_dim for intermediate temporal layers #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Incorrect time_dim for intermediate temporal layers #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions