[WIP][Core] feat: enable true cfg in hunyuanvideo. #10567

sayakpaul · 2025-01-14T05:18:06Z

What does this PR do?

Similar to how it's done in Flux, this PR adds support for doing CFG in HunyuanVideo.

Here are results: https://wandb.ai/sayakpaul/hunyuanvideo_cfg/runs/41dhd69p

Code

import torch
from diffusers import HunyuanVideoPipeline, HunyuanVideoTransformer3DModel
from diffusers.utils import export_to_video
import argparse

prompt = "A cat walks on the grass, realistic"
negative_prompt = "worst quality, inconsistent motion, blurry, jittery, distorted"

def load_pipeline():
    model_id = "hunyuanvideo-community/HunyuanVideo"
    transformer = HunyuanVideoTransformer3DModel.from_pretrained(
        model_id, subfolder="transformer", torch_dtype=torch.bfloat16
    )
    pipe = HunyuanVideoPipeline.from_pretrained(
        model_id, transformer=transformer, torch_dtype=torch.float16
    ).to("cuda")
    return pipe

if __name__ == "__main__":
    parser = argparse.ArgumentParser()
    parser.add_argument("--true_cfg_scale", type=float, default=1.0)
    args = parser.parse_args()

    pipe = load_pipeline()
    output = pipe(
        prompt="A cat walks on the grass, realistic",
        negative_prompt=negative_prompt,
        true_cfg_scale=args.true_cfg_scale,
        height=320,
        width=512,
        num_frames=61,
        generator=torch.manual_seed(0),
    ).frames[0]
    path = f"output_cfg@{args.true_cfg_scale}.mp4"
    export_to_video(output, path, fps=15)

TODOs

Add docs
Add support for negative embeds in __call__()
Add tests

HuggingFaceDocBuilderDev · 2025-01-14T05:25:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2025-01-24T10:30:29Z

@a-r-r-o-w a gentle ping.

a-r-r-o-w · 2025-01-24T13:10:26Z

Hi, thanks for your contribution!

The results seem a little too burnt to me and not very promising. Could we see more examples to determine if there's an improvement to quality considering there's ~2x more time required for generation?

a-r-r-o-w · 2025-01-24T13:13:23Z

Ohh, the PR is from you as well. I thought you asked me for review on a contributors PR lol

sayakpaul · 2025-01-29T05:15:18Z

Here are some more results: https://wandb.ai/sayakpaul/hunyuanvideo_cfg/runs/w4xbb90g.

The cat seems to be fishy 🐟

I ran it another subject (rocket): https://wandb.ai/sayakpaul/hunyuanvideo_cfg/runs/kebbcmvi and the benefits seem visually better, especially for this one:

true_cfg_0_d53f2d11117d53cc601c.mp4

Compared to the no CFG case:

true_cfg_0_17717d7d8b10d0ab0944.mp4

WDYT? The effects seem to be dependent on the subjects and the scenes being generated.

github-actions · 2025-02-22T15:02:49Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

feat: enable true cfg in hunyuanvideo.

ba98835

sayakpaul requested a review from a-r-r-o-w January 14, 2025 05:18

sayakpaul added 3 commits January 16, 2025 17:56

Merge branch 'main' into tru-cfg-hunyuanvideo

4f31393

Merge branch 'main' into tru-cfg-hunyuanvideo

74f68d9

Merge branch 'main' into tru-cfg-hunyuanvideo

a46f06a

a-r-r-o-w mentioned this pull request Feb 20, 2025

SkyReels Hunyuan T2V & I2V #10837

Merged

github-actions bot added the stale Issues that haven't received updates label Feb 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP][Core] feat: enable true cfg in hunyuanvideo. #10567

[WIP][Core] feat: enable true cfg in hunyuanvideo. #10567

Uh oh!

sayakpaul commented Jan 14, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jan 14, 2025

Uh oh!

sayakpaul commented Jan 24, 2025

Uh oh!

a-r-r-o-w commented Jan 24, 2025

Uh oh!

a-r-r-o-w commented Jan 24, 2025

Uh oh!

sayakpaul commented Jan 29, 2025

Uh oh!

github-actions bot commented Feb 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[WIP][Core] feat: enable true cfg in hunyuanvideo. #10567

Are you sure you want to change the base?

[WIP][Core] feat: enable true cfg in hunyuanvideo. #10567

Uh oh!

Conversation

sayakpaul commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

TODOs

Uh oh!

HuggingFaceDocBuilderDev commented Jan 14, 2025

Uh oh!

sayakpaul commented Jan 24, 2025

Uh oh!

a-r-r-o-w commented Jan 24, 2025

Uh oh!

a-r-r-o-w commented Jan 24, 2025

Uh oh!

sayakpaul commented Jan 29, 2025

Uh oh!

github-actions bot commented Feb 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sayakpaul commented Jan 14, 2025 •

edited

Loading