Remove redundant comparison inside the diffusion loop of stable video diffusion pipeline

**Is your feature request related to a problem? Please describe.**
I found that the inside the `__call__` of stable video diffusion keeps doing async memcpy between host to device as attached.
<img width="1464" alt="Screenshot 2024-09-12 at 6 45 24 PM" src="https://github.com/user-attachments/assets/de0839f5-ca59-419a-b63a-8a8646711ebe">

**Describe the solution you'd like.**
The reason for that is actually coming from every time we get `self.do_classifier_free_guidance`, we compared tensor between `int` -> get boolean on device -> memcpy that boolean from gpu to cpu.

It'll be good to just assign a variable for it before the loop as the value won't change through the loop.

**Additional context.**
I'm glad to contribute this by opening a PR

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Remove redundant comparison inside the diffusion loop of stable video diffusion pipeline #9425

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Remove redundant comparison inside the diffusion loop of stable video diffusion pipeline #9425

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions