fix for #7365, prevent pipelines from overriding provided prompt embeds #7926

bghira · 2024-05-12T16:40:47Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

HuggingFaceDocBuilderDev · 2024-05-13T09:58:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu · 2024-05-16T04:32:08Z

why are all the tests failing?

examples/community/lpw_stable_diffusion_xl.py

…prompt embeds

yiyixuxu · 2024-05-20T17:43:26Z

can we look into the remaining failing tests?

bghira · 2024-05-20T23:23:57Z

wow, i hadn't expected all of those, or really looked into them until now. i'm not sure why this simple check made those tests fail. i am wondering if there's something i need to check first before assigning the prompt embeds 🤔 like whether we have any? should i try and run the unit tests locally again?

bghira · 2024-05-26T16:47:30Z

@yiyixuxu i was trying again today and make test doesn't work here as i guess i don't have the right version of pytest installed, and now running pip install diffusers[dev] but i think there must be a way to run this in a Docker container?

edit: after pip install diffusers[dev] it now works to run make test

github-actions · 2024-09-14T15:12:42Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

sayakpaul

Thanks, @bghira! Apologies for the delay on my part. Would it be possible to add a test for this to https://github.com/huggingface/diffusers/blob/main/tests/pipelines/stable_diffusion_xl/test_stable_diffusion_xl.py?

bghira · 2024-11-23T13:20:26Z

cc @a-r-r-o-w for test

sayakpaul · 2024-11-23T13:34:05Z

Yeah let's ensure to add at least one test before merging.

bghira · 2024-11-23T14:11:50Z

was having trouble getting the test suite running when i pushed this and my new system doesn't have it working yet either - but a regression test would be great to keep it workin

sayakpaul · 2024-11-24T03:45:44Z

Seems like the core tests are failing with the changes.

a-r-r-o-w · 2024-11-25T02:45:44Z

i see, will take a look at tests

github-actions · 2024-12-19T15:06:53Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

bghira · 2024-12-19T15:33:09Z

nice try bot, but it's not stale

a-r-r-o-w · 2024-12-29T20:01:23Z

Nice try indeed, bot. Sorry about the delay, but should be good to merge now.

The previous fix did not really work as expected unless one always passed pooled_prompt_embeds, which is what caused all the test errors. The prompt_embeds[0] should only be assigned when encoding prompt_2 via text_encoder_2. The ndim check suffices atm but open to suggestions

a-r-r-o-w · 2024-12-29T20:05:17Z

@yiyixuxu Our make fix-copies does not really work for things in examples/ folder. I wonder if that's by choice or simply overlooked. I think it wouldn't be too difficult to support (I got it working but it creates too many changes in the existing community pipelines so could maybe take up in another PR, but important to note that every change made to the # Copied from stuff in community folder was Find & Replace. Using the update fix-copies implementation simply breaks a lot of pipelines)

a-r-r-o-w · 2025-01-04T21:51:33Z

Gentle ping @yiyixuxu for a final review

yiyixuxu · 2025-01-06T21:02:42Z

src/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py


                # We are only ALWAYS interested in the pooled output of the final text encoder
-                pooled_prompt_embeds = prompt_embeds[0]
+                if pooled_prompt_embeds is None and prompt_embeds[0].ndim == 2:


ohh so this is for when users pass a pre-generated pooled_prompt_embeds but not prompt_embeds? can you explain why they need to do that? (they need to run the text encoder to get the prompt_embeds and will get the pooled_prompt_embeds anyway)

So, when someone wants to use their own provided pooled_prompt_embeds, but use a different prompt (they don't pass prompt_embeds here), we will encode the prompt and overwrite the value that they passed. This PR only assigns the value if a custom pooled_prompt_embeds was not passed because you should be able to use different prompts for different text encoder, and by that reasoning different precomputed embeddings too (@linoytsaban did some nice threads/blogs testing this with flux).

so is the custom pooled_prompt_embeds not specific to the prompt? - because if it is processed based on the text_encoder output, I would imagine the user also would have prompt_embeds at hands and can pass it along with the pooled_prompt_embeds so that we don't need to run text encoder again

either way it is ok to merge because I don't think it will cause any problem, just try to understand the use case

I think this is a really old PR at this point and changes may have been made over time such that it was relevant then but may not be now, so I'm not 100% sure either - I saw it was stale so either we help move it to completion or we can close if not needed. Maybe @bghira can elaborate further on his use case.

But, I also don't see a way of getting the pooled_prompt_embeds just from having access to prompt_embeds like you mention. When we get the last_hidden_state from the text encoders, it is a ndim=3 tensor of shape [1, 77, 768], and we concatenate the embeddings across channel dim. But pooled embeddings are ndim=2 tensors of shape [1, 768]. Am I missing something trivial here?

And yes, pooled_prompt_embeds can be custom here, completely different from what prompt is about, if passed, or prompt_embeds is about.

See these X threads by Linoy (although this is in the context of Flux):

https://x.com/linoy_tsaban/status/1831700970105094593

https://x.com/linoy_tsaban/status/1830528436194193792

https://x.com/linoy_tsaban/status/1829201687543734368

https://x.com/linoy_tsaban/status/1825834935288213834

bghira force-pushed the issue/7365b branch 2 times, most recently from 4ce77f5 to 992d2df Compare May 12, 2024 16:42

sayakpaul requested a review from yiyixuxu May 13, 2024 09:53

yiyixuxu reviewed May 16, 2024

View reviewed changes

examples/community/lpw_stable_diffusion_xl.py Outdated Show resolved Hide resolved

fix for huggingface#7365, prevent pipelines from overriding provided …

037a3d6

…prompt embeds

bghira force-pushed the issue/7365b branch from d3a8036 to 037a3d6 Compare May 16, 2024 10:42

Merge branch 'main' into issue/7365b

03f8626

bghira added 2 commits May 20, 2024 17:24

Merge branch 'main' into issue/7365b

79f08b5

Merge branch 'main' into issue/7365b

ffab8f5

github-actions bot added the stale Issues that haven't received updates label Sep 14, 2024

Merge branch 'main' into issue/7365b

f8e33e9

sayakpaul approved these changes Nov 23, 2024

View reviewed changes

fix-copies

197d8a7

Merge branch 'main' into issue/7365b

0ffece9

github-actions bot removed the stale Issues that haven't received updates label Nov 23, 2024

Merge branch 'main' into issue/7365b

c8fb43c

github-actions bot added the stale Issues that haven't received updates label Dec 19, 2024

Merge branch 'main' into issue/7365b

b7d6792

github-actions bot removed the stale Issues that haven't received updates label Dec 20, 2024

a-r-r-o-w added 4 commits December 29, 2024 20:10

Merge branch 'main' into issue/7365b

559660b

fix implementation

8eddfc5

update

907a87a

Merge branch 'main' into issue/7365b

686bb98

a-r-r-o-w requested a review from yiyixuxu December 29, 2024 19:51

Merge branch 'main' into issue/7365b

1c7fd37

yiyixuxu reviewed Jan 6, 2025

View reviewed changes

Merge branch 'main' into issue/7365b

fa38877

yiyixuxu merged commit a0acbdc into huggingface:main Jan 8, 2025
12 checks passed

Uh oh!

fix for #7365, prevent pipelines from overriding provided prompt embeds #7926

fix for #7365, prevent pipelines from overriding provided prompt embeds #7926

Uh oh!

Conversation

bghira commented May 12, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented May 13, 2024

Uh oh!

yiyixuxu commented May 16, 2024

Uh oh!

Uh oh!

yiyixuxu commented May 20, 2024

Uh oh!

bghira commented May 20, 2024

Uh oh!

bghira commented May 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Sep 14, 2024

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

bghira commented Nov 23, 2024

Uh oh!

sayakpaul commented Nov 23, 2024

Uh oh!

bghira commented Nov 23, 2024

Uh oh!

sayakpaul commented Nov 24, 2024

Uh oh!

a-r-r-o-w commented Nov 25, 2024

Uh oh!

github-actions bot commented Dec 19, 2024

Uh oh!

bghira commented Dec 19, 2024

Uh oh!

a-r-r-o-w commented Dec 29, 2024

Uh oh!

a-r-r-o-w commented Dec 29, 2024

Uh oh!

a-r-r-o-w commented Jan 4, 2025

Uh oh!

yiyixuxu Jan 6, 2025

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Jan 6, 2025

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Jan 7, 2025

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Jan 7, 2025

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Jan 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bghira commented May 26, 2024 •

edited

Loading

a-r-r-o-w Jan 7, 2025 •

edited

Loading