pixart sigma: callbacks(interrupt, latent, pos/neg embeds) and cfg_rescale #8661

RandomGitUser321 · 2024-06-21T13:07:51Z

What does this PR do?

This PR refactors the pipeline to mirror other common pipelines by adding in callback_on_step_end and callback_on_step_end_tensor_inputs, along with cfg rescaling. In the callbacks, you can interrupt, retrieve latents and/or retrieve pos/neg embeds. The older callback method will continue to provide steps_idx, t and latents, but that's it.

I've added in deprecation warnings for those still using the legacy callback and callback_steps method, instead of the newer callback_on_step_end and callback_on_step_end_tensor_inputs method, as well added in an error for if you tried to use both at the same time.

To sum it up, this adds:

callback_on_step_end and callback_on_step_end_tensor_inputs, which allow you to obtain latents and pos/neg embeds
deprecation warnings for the older callback and callback_steps methods
the ability to use self._interrupt=True on callback_on_step_end
cfg rescaling

Some snippets of the code in my app that I tested it with to verify that it works:

def interrupt_callback(self, i, t, callback_kwargs):
    # using latching variable for onkeypress event to trigger
    if not queue_latch:
        self._interrupt = True
    
    latents = callback_kwargs["latents"]
    with torch.no_grad():
        image = pipe.vae.decode(latents / 0.13025, return_dict=False)[0]
        image = pipe.image_processor.postprocess(image, output_type="pil")
        image[0].save(f"{i}.png")

    return callback_kwargs

and

latents = pipe(
                prompt_embeds=prompt_embeds,
                negative_prompt_embeds=negative_embeds,
                prompt_attention_mask=prompt_attention_mask,
                negative_prompt_attention_mask=negative_prompt_attention_mask,
                num_images_per_prompt=1,
                height=height,
                width=width,
                num_inference_steps=steps,
                guidance_scale=cfg,
                generator=seedgen,
                callback_on_step_end=interrupt_callback, ###############
                callback_on_step_end_tensor_inputs=["latents"], ########
                output_type="latent",
            ).images

A test image showing the latent callbacks working at each step(can also be used to generate realtime previews in apps like shown in my update 4 comment):

Example of the interrupt callback working while using my app:

Example of cfg rescale working(gif compression degrades quality a lot):

Before submitting

Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

src/diffusers/pipelines/pixart_alpha/pipeline_pixart_sigma.py

Had to change some things around vs how the original file was, in order to get the callbacks to work correctly. I tried to base a lot of the layout from newer diffusers like SD3,

based on (https://arxiv.org/pdf/2305.08891.pdf). See Section 3.4

yiyixuxu

thanks!
I think this PR should focus on adding the new callback and cfg_rescale, I think we should not include these changes introduced for encode_prompt in this PR

additionally, can you test out the dynamic classifier-free guidance on pixart using the new callback API? https://huggingface.co/docs/diffusers/using-diffusers/callback#dynamic-classifier-free-guidance

src/diffusers/pipelines/pixart_alpha/pipeline_pixart_sigma.py

RandomGitUser321 · 2024-06-28T20:03:14Z

thanks! I think this PR should focus on adding the new callback and cfg_rescale, I think we should not include these changes introduced for encode_prompt in this PR

~~I addressed this in the code comments~~ I reverted the negative_prompt changes, will open a new PR eventually after this one is finished.

additionally, can you test out the dynamic classifier-free guidance on pixart using the new callback API? https://huggingface.co/docs/diffusers/using-diffusers/callback#dynamic-classifier-free-guidance

I'll try to take a look at it as well, but after I figure out how to handle the old callback/callbacksteps deprecation.

The lecay callback will require (self, step_idx, t, latents), but has 1:1 parity with the newer callback_on_step_end method. I also included a deprecation warning and an error if both are used at the same time.

RandomGitUser321 · 2024-06-29T01:32:31Z

@yiyixuxu Alright, I reworked the legacy callback/callback_steps back in. ~~The legacy callback will require (self, step_idx, t, latents), but has 1:1 parity with the newer callback_on_step_end method~~. I also included a deprecation warning and an error if both are used at the same time.

I tested both in my app and was able to get latent callbacks for previews and ~~interrupt the process still~~(only with the newer method). If needed, I can add some kind of message warning if the callback(self, step_idx, t, latents) line toward the very end of the code detects only three inputs, instead of the four, as a hint that people just need to add a self or something else like that to their def somefunction(self, i, t, latents): function that they use for their callbacks.

For struckout stuff, read my comment below. I reverted the old callback to callback(steps_idx, t, latents) again.

RandomGitUser321 · 2024-06-29T14:57:15Z

@yiyixuxu @sayakpaul I reverted the negative_prompt changes and fully updated my original post to be more clear about the changes.

Since the original implementation doesn't appear to have the ability to interrupt, I'm just going to roll this back. If people want to interrupt, they need to use the newer method anyways, since the older callback method is deprecating. The legacy callback will still provide step_idx, t and latents, like before.

RandomGitUser321 · 2024-06-30T13:48:27Z

Since the legacy implementation of callback doesn't appear to have the ability to interrupt, I'm just going to roll this back. If people want to interrupt, they should be using the newer method anyways, since the older callback method is being deprecated.

The legacy callback will still function the same as before callback(step_idx, t, latents)

github-actions · 2024-09-14T15:08:30Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

HuggingFaceDocBuilderDev · 2024-11-17T02:30:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

github-actions · 2024-12-11T15:05:30Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

sayakpaul · 2024-12-12T04:17:57Z

@RandomGitUser321 apologies for the delay on our end. But would love to come to the PR. What is blocking for this PR currently? How can we help?

github-actions · 2025-01-05T15:06:16Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

add in interrupt callback

bc044d9

yiyixuxu reviewed Jun 21, 2024

View reviewed changes

src/diffusers/pipelines/pixart_alpha/pipeline_pixart_sigma.py Outdated Show resolved Hide resolved

RandomGitUser321 added 2 commits June 22, 2024 22:59

Callbacks should be working now

d508218

Had to change some things around vs how the original file was, in order to get the callbacks to work correctly. I tried to base a lot of the layout from newer diffusers like SD3,

corrected a few variables

e6c0b05

RandomGitUser321 changed the title ~~pixart sigma: add in an interrupt callback~~ pixart sigma: add in an interrupt callback + latent callbacks Jun 23, 2024

RandomGitUser321 added 5 commits June 22, 2024 23:38

Merge branch 'main' into main

f1deb34

missed a couple more

a8ba299

forgot to put that back in

c1f2577

more minor fixes

47b327e

Merge branch 'main' into main

1fd712f

RandomGitUser321 requested a review from yiyixuxu June 23, 2024 17:07

implement cfg rescaling to have parity with common pipelines

811f5d5

based on (https://arxiv.org/pdf/2305.08891.pdf). See Section 3.4

This comment was marked as resolved.

Sign in to view

RandomGitUser321 changed the title ~~pixart sigma: add in an interrupt callback + latent callbacks~~ pixart sigma: callbacks(interrupt, latent, pos/neg embeds) and cfg_rescale Jun 28, 2024

Merge branch 'main' into main

06fcc49

yiyixuxu reviewed Jun 28, 2024

View reviewed changes

src/diffusers/pipelines/pixart_alpha/pipeline_pixart_sigma.py Outdated Show resolved Hide resolved

src/diffusers/pipelines/pixart_alpha/pipeline_pixart_sigma.py Outdated Show resolved Hide resolved

src/diffusers/pipelines/pixart_alpha/pipeline_pixart_sigma.py Show resolved Hide resolved

slight reorder and reword

4332af4

readd legacy callback/callback_steps functionality

d6f0aab

The lecay callback will require (self, step_idx, t, latents), but has 1:1 parity with the newer callback_on_step_end method. I also included a deprecation warning and an error if both are used at the same time.

RandomGitUser321 added 2 commits June 29, 2024 10:20

revert negative_prompt changes

a6baf66

unused import

6f6576a

RandomGitUser321 requested review from sayakpaul and yiyixuxu June 29, 2024 14:57

borrowed deprecation message from sdxl

5fad1dc

github-actions bot added the stale Issues that haven't received updates label Sep 14, 2024

Merge branch 'main' into main

7da1548

github-actions bot removed the stale Issues that haven't received updates label Nov 17, 2024

github-actions bot added the stale Issues that haven't received updates label Dec 11, 2024

sayakpaul removed the stale Issues that haven't received updates label Dec 12, 2024

github-actions bot added the stale Issues that haven't received updates label Jan 5, 2025

Uh oh!

pixart sigma: callbacks(interrupt, latent, pos/neg embeds) and cfg_rescale #8661

Are you sure you want to change the base?

pixart sigma: callbacks(interrupt, latent, pos/neg embeds) and cfg_rescale #8661

Uh oh!

Conversation

RandomGitUser321 commented Jun 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Uh oh!

This comment was marked as resolved.

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RandomGitUser321 commented Jun 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RandomGitUser321 commented Jun 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RandomGitUser321 commented Jun 29, 2024

Uh oh!

RandomGitUser321 commented Jun 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Sep 14, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Nov 17, 2024

Uh oh!

github-actions bot commented Dec 11, 2024

Uh oh!

sayakpaul commented Dec 12, 2024

Uh oh!

github-actions bot commented Jan 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

RandomGitUser321 commented Jun 21, 2024 •

edited

Loading

RandomGitUser321 commented Jun 28, 2024 •

edited

Loading

RandomGitUser321 commented Jun 29, 2024 •

edited

Loading

RandomGitUser321 commented Jun 30, 2024 •

edited

Loading