Flux latents fix #9929

DN6 · 2024-11-14T11:22:06Z

What does this PR do?

This PR: #9711 changed the latent preparation step so that the vae_scale_factor and default sample size are more clear.

However, we overlooked the fact that this operation (with scale_factor == 16)

        height = 2 * (int(height) // self.vae_scale_factor)
        width = 2 * (int(width) // self.vae_scale_factor)

Is not equivalent to this operation with scale_factor == 8

height = int(height) // self.vae_scale_factor

For resolutions that are divisible by 8 but not by 16 the current pipeline on main will error out, because the implementation does not account for the fact that the latent width and height must be divisible by 2 for the packing step to work.

The old implementation scaled the height and width by //16 and then multiplied by 2. This results in slightly resized images if the height or width aren't multiples of 16.

This PR:

Adds steps to account for latent packing. Which will result in resized output images (like we had before)
Adds a warning that the image will be resized if an incompatible image is provided
Adds fast tests to check Flux with expected and unexpected image shapes to verify that the scaling is properly applied.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2024-11-14T11:29:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

bghira · 2024-11-16T14:15:27Z

this seems really important but considering how slowly things move around here, not so sure it will be merged in a timely manner.

yiyixuxu

oh thanks for the fix!
I made a comment

yiyixuxu · 2024-11-17T15:48:06Z

src/diffusers/pipelines/flux/pipeline_flux.py

-        width = int(width) // self.vae_scale_factor
+        # VAE applies 8x compression on images but we must also account for packing which requires
+        # latent height and width to be divisible by 2.
+        height = int(height) // self.vae_scale_factor - ((int(height) // self.vae_scale_factor) % 2)


is this same as this? (basically revert back to the original code but divisible by self.vae_scale_factor*2
a little bit easier to understand, no?

height = 2 * (int(height) // (self.vae_scale_factor*2))

…fusers into flux-latents-fix

DN6 · 2024-11-20T12:00:11Z

Failing LoRA tests are unrelated. Merging.

* update * update * update * update * update * update --------- Co-authored-by: Sayak Paul <[email protected]>

DN6 added 2 commits November 13, 2024 14:01

update

7204481

update

cc6833c

DN6 requested a review from yiyixuxu November 14, 2024 11:23

yiyixuxu approved these changes Nov 17, 2024

View reviewed changes

sayakpaul and others added 6 commits November 18, 2024 13:58

Merge branch 'main' into flux-latents-fix

65a2ee1

update

07038cd

Merge branch 'flux-latents-fix' of https://github.com/huggingface/dif…

02e6aaf

…fusers into flux-latents-fix

update

5ee4d3c

update

06b0835

update

7fb31f0

DN6 merged commit f6f7afa into main Nov 20, 2024
17 of 18 checks passed

sayakpaul deleted the flux-latents-fix branch November 21, 2024 03:24

sayakpaul mentioned this pull request Nov 21, 2024

Fix prepare latent image ids and vae sample generators for flux #9981

Merged

sayakpaul added a commit that referenced this pull request Dec 23, 2024

Flux latents fix (#9929)

11f6f36

* update * update * update * update * update * update --------- Co-authored-by: Sayak Paul <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Flux latents fix #9929

Flux latents fix #9929

Uh oh!

DN6 commented Nov 14, 2024 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 14, 2024

Uh oh!

bghira commented Nov 16, 2024

Uh oh!

yiyixuxu left a comment

Uh oh!

yiyixuxu Nov 17, 2024

Uh oh!

DN6 commented Nov 20, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Flux latents fix #9929

Flux latents fix #9929

Uh oh!

Conversation

DN6 commented Nov 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Nov 14, 2024

Uh oh!

bghira commented Nov 16, 2024

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Nov 17, 2024

Choose a reason for hiding this comment

Uh oh!

DN6 commented Nov 20, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

DN6 commented Nov 14, 2024 •

edited

Loading