fix: [Community pipeline] Fix flattened elements on image #10774

elismasilva · 2025-02-12T00:03:45Z

What does this PR do?

This PR add a _get_crops_coords_list function to community Mixture-of-diffusers Tiling Pipeline SDXL to automatically get
the (ctop,cleft) coords and do a best focus on image generation, it helps to better harmonize the image and corrects the problem of flattened elements.

related to #10759 PR

For local reproduction

import torch
from diffusers import DPMSolverMultistepScheduler, AutoencoderKL
from mixture_tiling_sdxl import StableDiffusionXLTilingPipeline

device="cuda"

vae = AutoencoderKL.from_pretrained(
    "madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16
).to(device)

model_id="stablediffusionapi/yamermix-v8-vae"
scheduler = DPMSolverMultistepScheduler(beta_start=0.00085, beta_end=0.012, beta_schedule="scaled_linear", num_train_timesteps=1000)
pipe = StableDiffusionXLTilingPipeline.from_pretrained(
    model_id,
    torch_dtype=torch.float16,
    vae=vae,
    scheduler=scheduler,
    use_safetensors=False    
).to(device)

pipe.enable_model_cpu_offload()
pipe.enable_vae_tiling()
pipe.enable_vae_slicing()

generator = torch.Generator(device).manual_seed(297984183)

# Mixture of Diffusers generation
image = pipe(
    prompt=[[
        "A charming house in the countryside, by jakub rozalski, sunset lighting, elegant, highly detailed, smooth, sharp focus, artstation, stunning masterpiece",
        "A dirt road in the countryside crossing pastures, by jakub rozalski, sunset lighting, elegant, highly detailed, smooth, sharp focus, artstation, stunning masterpiece",        
        "An old and rusty giant robot lying on a dirt road, by jakub rozalski, dark sunset lighting, elegant, highly detailed, smooth, sharp focus, artstation, stunning masterpiece"
    ]],
    tile_height=1024,
    tile_width=1280,
    tile_row_overlap=0,
    tile_col_overlap=256,
    guidance_scale_tiles=[[7, 7, 7]], # or guidance_scale=7 if is the same for all prompts
    height=1024,
    width=3840,
    generator=generator,
    num_inference_steps=30,
)["images"][0]

image.save("mixture_sdxl.png")

After published:

import torch
from diffusers import DiffusionPipeline, DPMSolverMultistepScheduler, AutoencoderKL

device="cuda"

vae = AutoencoderKL.from_pretrained(
    "madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16
).to(device)

model_id="stablediffusionapi/yamermix-v8-vae"
scheduler = DPMSolverMultistepScheduler(beta_start=0.00085, beta_end=0.012, beta_schedule="scaled_linear", num_train_timesteps=1000)
pipe = DiffusionPipeline.from_pretrained(
    model_id,
    torch_dtype=torch.float16,
    vae=vae,
    custom_pipeline="mixture_tiling_sdxl",
    scheduler=scheduler,
    use_safetensors=False    
).to(device)

pipe.enable_model_cpu_offload()
pipe.enable_vae_tiling()
pipe.enable_vae_slicing()

generator = torch.Generator(device).manual_seed(297984183)

# Mixture of Diffusers generation
image = pipe(
    prompt=[[
        "A charming house in the countryside, by jakub rozalski, sunset lighting, elegant, highly detailed, smooth, sharp focus, artstation, stunning masterpiece",
        "A dirt road in the countryside crossing pastures, by jakub rozalski, sunset lighting, elegant, highly detailed, smooth, sharp focus, artstation, stunning masterpiece",        
        "An old and rusty giant robot lying on a dirt road, by jakub rozalski, dark sunset lighting, elegant, highly detailed, smooth, sharp focus, artstation, stunning masterpiece"
    ]],
    tile_height=1024,
    tile_width=1280,
    tile_row_overlap=0,
    tile_col_overlap=256,
    guidance_scale_tiles=[[7, 7, 7]], # or guidance_scale=7 if is the same for all prompts
    height=1024,
    width=3840,    
    generator=generator,
    num_inference_steps=30,
)["images"][0]

image.save("mixture_sdxl.png")

Final result

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@asomoza @sayakpaul

…diffusers support

…ically define ctop,cleft coord to focus on image generation, helps to better harmonize the image and corrects the problem of flattened elements.

asomoza · 2025-02-12T02:32:37Z

thanks, there's some changes to instruct_pix2pix pipeline, even if they're correct let's just keep the PR for the relevant pipeline only.

elismasilva · 2025-02-12T03:24:37Z

it was not me. I think was make style and make quality changed it. Tomorrow i see if i can undo this.

elismasilva · 2025-02-12T13:48:54Z

thanks, there's some changes to instruct_pix2pix pipeline, even if they're correct let's just keep the PR for the relevant pipeline only.

done!

HuggingFaceDocBuilderDev · 2025-02-12T22:37:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

asomoza · 2025-02-12T22:50:31Z

thanks!!!

elismasilva added 5 commits February 10, 2025 22:29

feat: new community mixture_tiling_sdxl pipeline for SDXL mixture-of-…

871f333

…diffusers support

fix use of variable latents to tile_latents

1f4adb9

removed references to modules that are not being used in this pipeline

6341f93

make style, make quality

8a792cd

fixfeat: added _get_crops_coords_list function to pipeline to automat…

6e41806

…ically define ctop,cleft coord to focus on image generation, helps to better harmonize the image and corrects the problem of flattened elements.

Merge branch 'main' into add-mixture-tiling-sdxl

4efbcc9

elismasilva force-pushed the add-mixture-tiling-sdxl branch from b84c9c0 to 4efbcc9 Compare February 12, 2025 13:47

elismasilva added 2 commits February 12, 2025 14:38

Merge branch 'main' into add-mixture-tiling-sdxl

d7f362a

Merge branch 'main' into add-mixture-tiling-sdxl

7d651e5

asomoza approved these changes Feb 12, 2025

View reviewed changes

asomoza merged commit 051ebc3 into huggingface:main Feb 12, 2025
8 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: [Community pipeline] Fix flattened elements on image #10774

fix: [Community pipeline] Fix flattened elements on image #10774

Uh oh!

elismasilva commented Feb 12, 2025

Uh oh!

asomoza commented Feb 12, 2025

Uh oh!

elismasilva commented Feb 12, 2025 •

edited

Loading

Uh oh!

elismasilva commented Feb 12, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 12, 2025

Uh oh!

asomoza commented Feb 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: [Community pipeline] Fix flattened elements on image #10774

fix: [Community pipeline] Fix flattened elements on image #10774

Uh oh!

Conversation

elismasilva commented Feb 12, 2025

What does this PR do?

For local reproduction

After published:

Final result

Before submitting

Who can review?

Uh oh!

asomoza commented Feb 12, 2025

Uh oh!

elismasilva commented Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elismasilva commented Feb 12, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 12, 2025

Uh oh!

asomoza commented Feb 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

elismasilva commented Feb 12, 2025 •

edited

Loading