Skip to content

Commit 588ebf5

Browse files
committed
add diffusers installation instruction
1 parent 46b47fb commit 588ebf5

File tree

1 file changed

+5
-14
lines changed

1 file changed

+5
-14
lines changed

README.md

Lines changed: 5 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -33,14 +33,16 @@ Here are some example outputs for prompt `"A cat playing with a ball of yarn"`:
3333

3434
## Setup
3535
We rely primarily on pure PyTorch for the optimizations. Currently, a relatively recent nightly version of PyTorch is required.
36+
3637
The numbers reported here were gathered using:
3738
* `torch==2.8.0.dev20250605+cu126` - note that we rely on some fixes since 2.7
3839
* `torchao==0.12.0.dev20250610+cu126` - note that we rely on a fix in the 06/10 nightly
39-
* `diffusers==0.33.1`
40+
* `diffusers` - with [this fix](https://github.com/huggingface/diffusers/pull/11696) included
4041
* `flash_attn_3==3.0.0b1`
4142

4243
To install deps:
4344
```
45+
pip uninstall diffusers -y && pip install git+https://github.com/huggingface/diffusers@b272807bc898a314cde536c1d7d1e43592af1fce
4446
pip install --pre torch==2.8.0.dev20250605+cu126 --index-url https://download.pytorch.org/whl/nightly/cu126
4547
pip install --pre torchao==0.12.0.dev20250609+cu126 --index-url https://download.pytorch.org/whl/nightly/cu126
4648
pip install diffusers==0.33.1
@@ -52,21 +54,10 @@ For hardware, we used a 96GB 700W H100 GPU. Some of the optimizations applied (B
5254

5355
## Run the optimized pipeline
5456

55-
```
56-
python gen_image.py --prompt "An astronaut standing next to a giant lemon" --output-file output.png --use-cached-model
57-
```
58-
59-
This will include all optimizations and will attempt to use pre-cached binary models
60-
generated via `torch.export` + AOTI. To generate these binaries for subsequent runs, run
61-
the above command without the `--use-cached-model` flag.
57+
TODO
6258

6359
> [!IMPORTANT]
64-
> The binaries won't work for hardware that is sufficiently different from the hardware they were
65-
> obtained on. For example, if the binaries were obtained on an H100, they won't work on A100.
66-
> Further, the binaries are currently Linux-only and include dependencies on specific versions
67-
> of system libs such as libstdc++; they will not work if they were generated in a sufficiently
68-
> different environment than the one present at runtime. The PyTorch Compiler team is working on
69-
> solutions for more portable binaries / artifact caching.
60+
> The binaries won't work for hardware that are different from the ones they were obtained on. For example, if the binaries were obtained on an H100, they won't work on A100.
7061
7162
## Benchmarking
7263
[`run_benchmark.py`](./run_benchmark.py) is the main script for benchmarking the different optimization techniques.

0 commit comments

Comments
 (0)