[feat] cache allocator warmup for `from_single_model` #12305

sayakpaul · 2025-09-09T10:08:19Z

What does this PR do?

from diffusers import FluxTransformer2DModel
import torch 
import time

ckpt_path = "https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/flux1-dev.safetensors"

start = time.perf_counter()
transformer = FluxTransformer2DModel.from_single_file(
    ckpt_path, torch_dtype=torch.bfloat16, device_map="cuda"
)
# transformer = FluxTransformer2DModel.from_single_file(
#     ckpt_path, torch_dtype=torch.bfloat16,
# ).to("cuda")
torch.cuda.synchronize()
end = time.perf_counter()

print(f"Take taken to initialize the model on CUDA: {(end - start)} seconds.")

Timing:

CUDA placement manual: 4.86 seconds
Device map: 1.09 seconds

(On an H100)

HuggingFaceDocBuilderDev · 2025-09-09T10:15:41Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2025-09-09T12:11:18Z

@stevhliu it would be awesome to also add a doc entry for this.

@DN6 test needed to be added this PR?

stevhliu · 2025-09-09T15:36:29Z

Added a note about it in #12256 👍

sayakpaul · 2025-09-10T07:25:30Z

Included a test too.

add

57cd26e

sayakpaul requested a review from DN6 September 9, 2025 10:08

DN6 approved these changes Sep 9, 2025

View reviewed changes

Merge branch 'main' into single-file-cache-allocator

fb4db96

sayakpaul added 2 commits September 10, 2025 12:40

add a test

4c13de7

Merge branch 'main' into single-file-cache-allocator

903ef66

sayakpaul merged commit 9e7ae56 into main Sep 10, 2025
13 of 14 checks passed

sayakpaul deleted the single-file-cache-allocator branch September 10, 2025 07:25

stevhliu mentioned this pull request Sep 22, 2025

[docs] Model formats #12256

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat] cache allocator warmup for `from_single_model` #12305

[feat] cache allocator warmup for `from_single_model` #12305

Uh oh!

sayakpaul commented Sep 9, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 9, 2025

Uh oh!

sayakpaul commented Sep 9, 2025

Uh oh!

stevhliu commented Sep 9, 2025

Uh oh!

sayakpaul commented Sep 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[feat] cache allocator warmup for from_single_model #12305

[feat] cache allocator warmup for from_single_model #12305

Uh oh!

Conversation

sayakpaul commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Sep 9, 2025

Uh oh!

sayakpaul commented Sep 9, 2025

Uh oh!

stevhliu commented Sep 9, 2025

Uh oh!

sayakpaul commented Sep 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[feat] cache allocator warmup for `from_single_model` #12305

[feat] cache allocator warmup for `from_single_model` #12305

sayakpaul commented Sep 9, 2025 •

edited

Loading