[docs] Distributed inference #12285

stevhliu · 2025-09-04T18:39:20Z

Reduces bloat by removing the ## Model sharding section because it isn't really about distributed inference (its more selectively loading and deleting models) and doesn't show processing multiple prompts in parallel.

HuggingFaceDocBuilderDev · 2025-09-04T18:47:20Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Left some comments. We will have to prepare it better once #11941 is in ;)

docs/source/en/training/distributed_inference.md

sayakpaul · 2025-09-05T01:50:55Z

docs/source/en/training/distributed_inference.md

-> [!TIP]
-> You can use `device_map` within a [`DiffusionPipeline`] to distribute its model-level components on multiple devices. Refer to the [Device placement](../tutorials/inference_with_big_models#device-placement) guide to learn more.
-
-## Model sharding


What happened here?

I removed it because it seems more like a "recipe" for progressively and strategically fitting models on a GPU by loading and removing them. I don't think a user is really learning anything new/useful about device_map here compared to the device placement docs to which there is a link at the bottom.

I would suggest removing it or at least moving it to Resources > Task Recipes.

Removal doesn't sound right as the content is useful IMO. Including it in "Task Recipes" might also hamper its discoverability.

Ok, I added it back :)

Looks like it's still discarded?

Should be here now!

stevhliu requested a review from sayakpaul September 4, 2025 18:47

sayakpaul reviewed Sep 5, 2025

View reviewed changes

stevhliu added 2 commits September 22, 2025 12:48

init

f2d1133

feedback

90409dd

stevhliu force-pushed the distributed-inference branch from b1713b2 to 90409dd Compare September 22, 2025 20:03

Merge branch 'main' into distributed-inference

f0abc38

stevhliu requested a review from sayakpaul September 26, 2025 21:46

sayakpaul approved these changes Sep 27, 2025

View reviewed changes

Merge branch 'main' into distributed-inference

fa0d975

stevhliu merged commit ccedeca into huggingface:main Sep 29, 2025
1 check passed

stevhliu deleted the distributed-inference branch September 29, 2025 18:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[docs] Distributed inference #12285

[docs] Distributed inference #12285

Uh oh!

stevhliu commented Sep 4, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

sayakpaul Sep 5, 2025

Uh oh!

stevhliu Sep 5, 2025

Uh oh!

sayakpaul Sep 22, 2025

Uh oh!

stevhliu Sep 22, 2025

Uh oh!

sayakpaul Sep 23, 2025

Uh oh!

stevhliu Sep 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[docs] Distributed inference #12285

[docs] Distributed inference #12285

Uh oh!

Conversation

stevhliu commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayakpaul Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

stevhliu Sep 5, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

stevhliu Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

stevhliu Sep 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

stevhliu commented Sep 4, 2025 •

edited

Loading