Skip to content

Conversation

@dg845
Copy link
Collaborator

@dg845 dg845 commented Sep 5, 2025

What does this PR do?

This PR implements a pipeline for the InfiniteTalk audio-driven video generation model (paper, code, weights). The InfiniteTalk model is designed to handle infinite-length video and demonstrates SOTA performance for video dubbing. It is ultimately based on the Wan 2.1-I2V-14B image-to-video model (with extra audio components).

Fixes #12239.

Before submitting

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu
@DN6
@supermeng

@dg845 dg845 mentioned this pull request Sep 5, 2025
2 tasks
@ZivKidd
Copy link

ZivKidd commented Oct 13, 2025

Hello, I have reviewed the code you submitted. It seems there are still some areas that need improvement. Will you continue to update it?

@dg845
Copy link
Collaborator Author

dg845 commented Oct 17, 2025

Hi, I am currently planning to work on it, but may not be able to find time in the short term to do so.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Support for InfiniteTalk

2 participants