[LoRA] kijai wan lora support for I2V #11588

linoytsaban · 2025-05-20T12:09:30Z

modify _maybe_expand_t2v_lora_for_i2v to allow kijai t2v lora loading in i2v wan

output.44.mp4

import torch
from diffusers import AutoencoderKLWan, WanPipeline, UniPCMultistepScheduler
from diffusers.utils import export_to_video
from diffusers.loaders.lora_conversion_utils import _convert_non_diffusers_wan_lora_to_diffusers 
from huggingface_hub import hf_hub_download
from safetensors.torch import load_file
import torch
import numpy as np
from diffusers import AutoencoderKLWan, WanImageToVideoPipeline
from diffusers.utils import export_to_video, load_image
from transformers import CLIPVisionModel

MODEL_ID = "Wan-AI/Wan2.1-I2V-14B-480P-Diffusers"
LORA_REPO_ID = "Kijai/WanVideo_comfy"
LORA_FILENAME = "Wan21_CausVid_14B_T2V_lora_rank32.safetensors"

image_encoder = CLIPVisionModel.from_pretrained(MODEL_ID, subfolder="image_encoder", torch_dtype=torch.float32)

vae = AutoencoderKLWan.from_pretrained(
    MODEL_ID,
    subfolder="vae",
    torch_dtype=torch.float32 # float32 for VAE stability
)
pipe = WanImageToVideoPipeline.from_pretrained(MODEL_ID, vae=vae, 
                                               image_encoder=image_encoder, 
                                               torch_dtype=torch.bfloat16)
flow_shift = 8.0
pipe.scheduler = UniPCMultistepScheduler.from_config(
    pipe.scheduler.config, flow_shift=flow_shift
)
pipe.to("cuda")

# --- LoRA Loading ---
causvid_path = hf_hub_download(repo_id=LORA_REPO_ID, filename=LORA_FILENAME)
pipe.load_lora_weights(causvid_path,adapter_name="causvid_lora")
pipe.set_adapters(["causvid_lora"], adapter_weights=[1.0])

image = load_image(
    "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/penguin.png"
)
max_area = 480 * 1280
aspect_ratio = image.height / image.width
mod_value = pipe.vae_scale_factor_spatial * pipe.transformer.config.patch_size[1]
height = round(np.sqrt(max_area * aspect_ratio)) // mod_value * mod_value
width = round(np.sqrt(max_area / aspect_ratio)) // mod_value * mod_value
image = image.resize((width, height))
prompt = (
    "a penguin playfully dancing in the snow"
)
negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"

output = pipe(
    image=image,
    num_inference_steps=4,
    generator = torch.Generator(device="cuda").manual_seed(0),
    prompt=prompt, 
    negative_prompt=negative_prompt, 
    height=height, 
    width=width, num_frames=81, 
    guidance_scale=1.0
).frames[0]
export_to_video(output, "output.mp4", fps=16)

HuggingFaceDocBuilderDev · 2025-05-20T12:16:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

linoytsaban · 2025-05-20T13:18:04Z

@bot /style

github-actions · 2025-05-20T13:18:59Z

Style fixes have been applied. View the workflow run here.

sayakpaul

Maybe update the PR description with the cool penguin you got dancing?

a-r-r-o-w

Nice, LGTM!

linoytsaban and others added 8 commits May 19, 2025 19:37

testing

068adea

testing

d73bdd1

testing

23e3c1c

testing

3ce7f9c

testing

c909d84

i2v

80b6b94

i2v

e4f3938

Merge branch 'huggingface:main' into wan

6437633

linoytsaban added 8 commits May 20, 2025 15:35

device fix

cd94c12

Merge remote-tracking branch 'origin/wan' into wan

f7dda02

testing

85a618d

fix

c5a753a

fix

36fea4e

fix

6d62f4b

fix

97168f4

fix

693277c

Apply style fixes

8ba2e90

empty commit

3a23d94

linoytsaban marked this pull request as ready for review May 20, 2025 13:19

linoytsaban requested review from a-r-r-o-w and sayakpaul May 20, 2025 13:22

sayakpaul approved these changes May 20, 2025

View reviewed changes

a-r-r-o-w approved these changes May 20, 2025

View reviewed changes

linoytsaban merged commit 5d4f723 into huggingface:main May 20, 2025
29 checks passed

linoytsaban deleted the wan branch May 21, 2025 12:42

DN6 added the roadmap Add to current release roadmap label Jun 5, 2025

github-project-automation bot added this to Diffusers Roadmap 0.36 Jun 5, 2025

github-project-automation bot moved this to In Progress in Diffusers Roadmap 0.36 Jun 5, 2025

DN6 moved this from In Progress to Done in Diffusers Roadmap 0.36 Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[LoRA] kijai wan lora support for I2V #11588

[LoRA] kijai wan lora support for I2V #11588

Uh oh!

linoytsaban commented May 20, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented May 20, 2025

Uh oh!

linoytsaban commented May 20, 2025

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

a-r-r-o-w left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[LoRA] kijai wan lora support for I2V #11588

[LoRA] kijai wan lora support for I2V #11588

Uh oh!

Conversation

linoytsaban commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented May 20, 2025

Uh oh!

linoytsaban commented May 20, 2025

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

linoytsaban commented May 20, 2025 •

edited

Loading