-
Notifications
You must be signed in to change notification settings - Fork 6.5k
Closed
Labels
bugSomething isn't workingSomething isn't working
Description
Describe the bug
WAN 2.1 I2V models prepare_latents method has an issue when num_frames is not at default 81 frames.
Reproduction
Set width=832 height=480 num_frames=15
Logs
β /home/vlado/dev/sdnext/venv/lib/python3.12/site-packages/diffusers/pipelines/wan/pipeline_wan_i2v.py:611 in __call__ β
β β
β 610 β β image = self.video_processor.preprocess(image, height=height, width=width).to(device, dtype=torch.float32) β
β β± 611 β β latents, condition = self.prepare_latents( β
β 612 β β β image, β
β β
β /home/vlado/dev/sdnext/venv/lib/python3.12/site-packages/diffusers/pipelines/wan/pipeline_wan_i2v.py:424 in prepare_latents β
β β
β β± 424 β β mask_lat_size = mask_lat_size.view(batch_size, -1, self.vae_scale_factor_temporal, latent_height, latent_width) β
β 425 β β mask_lat_size = mask_lat_size.transpose(1, 2) β
β°βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ―
RuntimeError: shape '[1, -1, 4, 60, 104]' is invalid for input of size 112320System Info
diffusers==main
Who can help?
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't working