Remove inference multiplicity #36

carrascomj · 2025-11-06T13:56:30Z

Closes #24

Description

Running inference with --nsamples_per_protein greater than 1 blows up memory using the torch backend. This happens because inference_multiplicity was used to replicate every tensor (including ESM embeddings) nsample_per_protein times before moving them to the GPU, so multi-sample inference pushed huge duplicated feature tensors into vRAM and regularly OOM’d.

After this fix, taking 5 samples of a protein sequence of ~1000 amino acids stays in around 31GiB of vRAM using the torch backend (instead of running OOM for a 81GiB GPU.)

Implementation

I fixed it by removing inference_multiplicity altogether to simply run inference in a nsample_per_protein loop.

This does slow down inference since it's not batched, so feel free to close the PR, just here for future reference.

Remove inference_multiplicity altogether

Remove inference_multiplicty

14cab04

Remove inference_multiplicity altogether

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove inference multiplicity #36

Remove inference multiplicity #36

Uh oh!

carrascomj commented Nov 6, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Remove inference multiplicity #36

Are you sure you want to change the base?

Remove inference multiplicity #36

Uh oh!

Conversation

carrascomj commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Implementation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

carrascomj commented Nov 6, 2025 •

edited

Loading