set `is_sequential_cpu_offload = True` only when some component is on cpu and has AlignDevicesHook simultaneously #8750

zhangvia · 2024-07-01T08:12:12Z

What does this PR do?

this pr is talking about when the is_sequential_cpu_offload should be set to True.

before we got device_map feature for pipline, if any component in pipeline has AlignDevicesHook which is used to move input data to the model device, we will set the is_sequential_cpu_offload = True . but when using device_map, we will also add AlignDevicesHook to model.

and besides, if someone want to add AignDevicesHook to model manually, is_sequential_cpu_offload will also be set to True.
that would trigger a bug in load_lora_weights() method

so maybe we should set is_sequential_cpu_offload=True when some component is on cpu and has AlignDevicesHook simultaneously

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…ify_is_sequential_off_load

sayakpaul · 2024-07-01T09:57:23Z

Let us know when this PR is ready for a review.

zhangvia · 2024-07-02T02:42:47Z

alright, i will modify all stuff that is correspoding to is_sequential_cpu_offload

HuggingFaceDocBuilderDev · 2024-07-02T03:30:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…ify_is_sequential_off_load

zhangvia · 2024-07-04T02:00:12Z

i have check all stuff is correspoding to is_sequential_cpu_offload, please take a look @sayakpaul ,thanks!

zhangvia · 2024-07-29T02:41:34Z

a gentle ping here @sayakpaul @yiyixuxu

github-actions · 2024-09-14T15:07:53Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

a-r-r-o-w · 2024-11-20T00:21:19Z

Hi, sorry for the delay here. I've asked Sayak for a review on this

sayakpaul · 2024-11-20T03:10:36Z

Hi,

Thanks for your PR. Could you demonstrate your use-case with some minimal code for us to understand this better?

sayakpaul · 2024-11-20T03:12:22Z

src/diffusers/loaders/unet.py

+                    if is_model_cpu_offload or is_sequential_cpu_offload:
+                        logger.info(
+                            "Pipeline offload enabled and Accelerate hooks detected. Since you have called `load_lora_weights()`, the previous offload hooks will be first removed. Then the LoRA parameters will be loaded and the hooks will be applied again."
+                        )
+                        remove_hook_from_module(component, recurse=is_sequential_cpu_offload)


This looks alright to me.

sayakpaul · 2024-11-20T03:12:49Z

src/diffusers/pipelines/pipeline_utils.py

                return False

-            return hasattr(module, "_hf_hook") and (
+            return hasattr(module, "_hf_hook") and hasattr(module,'device') and module.device.type == "cpu" and (


Why the expansion?

manually add a hook to the model and attempting to move it to another GPU does not mean that sequential_cpu_offload is enabled. but the original code will still return true

github-actions · 2024-12-14T15:05:07Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

zhangvia · 2025-03-11T08:17:51Z

sorry for delay.

Hi,

Thanks for your PR. Could you demonstrate your use-case with some minimal code for us to understand this better?

there are two cases that might get error with the current verison of diffusers:

when you want to manually add align_device_hook.
(1) the present device_map feature is not granular enough to achieve precise memory control. sometimes it's better to decide which model should be on which gpu by user instead of "balanced" or "auto".
(2) or when you want to implement some custom offloading strategy like block swap on some model in the pipeline

modify _optionally_disable_offloading()

7ee9465

zhangvia mentioned this pull request Jul 1, 2024

bug in load lora weights when add align_device_hook to model #7539

Open

Merge branch 'main' of https://github.com/zhangvia/diffusers into mod…

0b86e90

…ify_is_sequential_off_load

zhangvia added 2 commits July 4, 2024 01:47

modify all is_sequential_cpu_offload

ecc3032

Merge branch 'main' of https://github.com/zhangvia/diffusers into mod…

eebed22

…ify_is_sequential_off_load

zhangvia force-pushed the modify_is_sequential_off_load branch from 5ca5291 to eebed22 Compare July 4, 2024 01:55

zhangvia marked this pull request as ready for review July 4, 2024 01:58

sayakpaul requested a review from yiyixuxu July 4, 2024 02:02

fix conflict

9c87a4a

github-actions bot added the stale Issues that haven't received updates label Sep 14, 2024

a-r-r-o-w removed the stale Issues that haven't received updates label Nov 20, 2024

a-r-r-o-w requested a review from sayakpaul November 20, 2024 00:20

sayakpaul added the needs-code-example Waiting for relevant code example to be provided label Nov 20, 2024

sayakpaul reviewed Nov 20, 2024

View reviewed changes

github-actions bot added the stale Issues that haven't received updates label Dec 14, 2024

janzd mentioned this pull request Feb 26, 2025

Model getting offloaded to CPU without user's intention #10914

Open

github-actions bot removed the stale Issues that haven't received updates label Mar 11, 2025

set is_sequential_cpu_offload = True only when some component is on cpu and has AlignDevicesHook simultaneously #8750

Are you sure you want to change the base?

set is_sequential_cpu_offload = True only when some component is on cpu and has AlignDevicesHook simultaneously #8750

Uh oh!

Conversation

zhangvia commented Jul 1, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul commented Jul 1, 2024

Uh oh!

zhangvia commented Jul 2, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Jul 2, 2024

Uh oh!

zhangvia commented Jul 4, 2024

Uh oh!

zhangvia commented Jul 29, 2024

Uh oh!

github-actions bot commented Sep 14, 2024

Uh oh!

a-r-r-o-w commented Nov 20, 2024

Uh oh!

sayakpaul commented Nov 20, 2024

Uh oh!

sayakpaul Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

sayakpaul Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

zhangvia Mar 11, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 14, 2024

Uh oh!

zhangvia commented Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

set `is_sequential_cpu_offload = True` only when some component is on cpu and has AlignDevicesHook simultaneously #8750

set `is_sequential_cpu_offload = True` only when some component is on cpu and has AlignDevicesHook simultaneously #8750

zhangvia commented Mar 11, 2025 •

edited

Loading