This repository provides an end-to-end automated pipeline for running milestoning simulations with machine-learned force fields to accelerate drug-target kinetic and thermodynamic predictions. The pipeline utilizes SEEKR2, SEEKRTools, and espaloma=0.3.2 to automate simulations.
To install and set up the necessary environment, run the following commands in sequence.
The --yes flag ensures it installs without prompting for confirmation.
conda create -n one_step_kinetics python=3.10 --yesOnce the environment is created, activate the conda environment, switching the current shell session to use the new environment.
conda activate one_step_kineticsMamba is a faster drop-in replacement for conda that speeds up package installations.
conda install conda-forge::mamba --yesThis step installs OpenMM plugin for SEEKR2 package.
mamba install seekr2_openmm_plugin openmm=8.1 --yesRun the following command to check if the SEEKR2 OpenMM plugin is correctly installed. If no error message appears, the installation was successful.
python -c "import seekr2plugin"Git is required to clone repositories. Ensure it is installed using:
conda install conda-forge::git --yesSEEKR2 is the core package required for performing milestoning simulations. We will download it from GitHub, install it, and verify that it works correctly.
cd ~git clone https://github.com/seekrcentral/seekr2.gitcd seekr2python -m pip install .pytestcd ~SEEKRTools is a companion package to SEEKR2 that provides utilities for preparing and facilitating multiscale milestoning simulations.
cd ~git clone https://github.com/seekrcentral/seekrtools.gitcd seekrtoolspython -m pip install .pytestcd ~The Open Force Field (OpenFF) Toolkit is required to assign and manipulate molecular mechanics parameters.
conda install conda-forge::openff-toolkit --yesThe OpenEye Toolkits are used for quantum chemistry-based force field parameterization. The OpenEye toolkits require a valid OpenEye academic license, free for academic users but must be obtained directly from https://www.eyesopen.com/academic-licensing.
Run the following command to install OpenEye toolkits via conda:
conda install openeye::openeye-toolkits --yesAfter obtaining an OpenEye academic license, save the provided oe_license.txt file in a secure location on your system. For example, you may place it in:
/home/USERNAME/licenses/oe_license.txtTo ensure that OpenEye toolkits can find the license file at runtime, export the license path by adding the following line to your ~/.bashrc.
export OE_LICENSE="/home/USERNAME/licenses/oe_license.txt"To apply this change immediately in the current terminal session, run:
source ~/.bashrcOpenMM Force Fields provide additional parameter sets for molecular simulations using OpenMM.
conda install conda-forge::openmmforcefields --yesInstall espaloma version 0.3.2, which includes the latest parameterization models.
conda install conda-forge::"espaloma=0.3.2" --yesBrownDye2 is a package used for Brownian dynamics (BD) simulations, which are needed to compute association rate constants. If you plan to run BD simulations, follow these installation steps. Some of these steps require sudo privileges (administrator access). If you do not have sudo access, contact your system administrator.
cd ~BrownDye2 requires several system libraries for compilation. Install them using:
sudo apt-get install libexpat1 make apbs liblapack-devsudo apt-get install ocaml ocaml-native-compilerssudo apt-get install libexpat1-devwget https://browndye.ucsd.edu/downloads/browndye2.tar.gztar xvfz browndye2.tar.gzcd browndye2make -j 4 allcd ~rm -rf browndye2.tar.gzOnce the conda environment is sett up with necessary package installations, please follow the step-by-step instructions on how to clone, navigate, and set up the milestoning simulation pipeline for kinetic and thermodynamic predictions.
git clone https://github.com/anandojha/automated_milestoning_pipeline.gitcd automated_milestoning_pipelinecd trypsin_benzamidineconda activate one_step_kineticsReplace "USERNAME" with the actual home directory name and make sure the oe_license.txt file is stored in the specified location.
export OE_LICENSE="/home/USERNAME/licenses/oe_license.txt"Once the above steps are successful, read the README.md file in the trypsin_benzamidine project folder for further instructions on running the series of scripts.
There is a separate folder named trypsin_benzamidine_simulation, where these scripts are executed for comparison purposes. Note that these simulations are run for a very short time, meaning that the computed kinetic, thermodynamic, and rate constants may not accurately represent absolute experimental values. The purpose of this execution is to demonstrate the workflow and methodology, but actual simulations should be run for extended durations to obtain scientifically meaningful results. The users can compare results across different simulation times by increasing sampling duration and milestone transitions for better accuracy.
- SEEKR2: https://github.com/seekrcentral/seekr2
- SEEKR2 OpenMM Plugin: https://github.com/seekrcentral/seekr2_openmm_plugin
- SEEKRTools: https://github.com/seekrcentral/seekrtools
- QMrebind: https://github.com/seekrcentral/qmrebind
- Ojha, Anupam Anand, Lane William Votapka, Gary Alexander Huber, Shang Gao, and Rommie Elizabeth Amaro. "An introductory tutorial to the SEEKR2 (Simulation enabled estimation of kinetic rates v. 2) multiscale milestoning software [Article v1. 0]." Living Journal of Computational Molecular Science 5, no. 1 (2023): 2359-2359.
- Votapka, Lane W., Andrew M. Stokely, Anupam A. Ojha, and Rommie E. Amaro. "SEEKR2: Versatile multiscale milestoning utilizing the OpenMM molecular dynamics engine." Journal of chemical information and modeling 62, no. 13 (2022): 3253-3262.
- Ojha, Anupam Anand, Lane William Votapka, and Rommie Elizabeth Amaro. "QMrebind: incorporating quantum mechanical force field reparameterization at the ligand binding site for improved drug-target kinetics through milestoning simulations." Chemical Science 14, no. 45 (2023): 13159-13175.
- Ojha, Anupam Anand, Ambuj Srivastava, Lane William Votapka, and Rommie E. Amaro. "Selectivity and ranking of tight-binding JAK-STAT inhibitors using Markovian milestoning with Voronoi tessellations." Journal of chemical information and modeling 63, no. 8 (2023): 2469-2482.
- Votapka, Lane W., Benjamin R. Jagger, Alexandra L. Heyneman, and Rommie E. Amaro. "SEEKR: simulation enabled estimation of kinetic rates, a computational tool to estimate molecular kinetics and its application to trypsin–benzamidine binding." The Journal of Physical Chemistry B 121, no. 15 (2017): 3597-3606.
- Jagger, Benjamin R., Anupam A. Ojha, and Rommie E. Amaro. "Predicting ligand binding kinetics using a Markovian milestoning with voronoi tessellations multiscale approach." Journal of Chemical Theory and Computation 16, no. 8 (2020): 5348-5357.
- Jagger, Benjamin R., Christopher T. Lee, and Rommie E. Amaro. "Quantitative ranking of ligand binding kinetics with a multiscale milestoning simulation approach." The journal of physical chemistry letters 9, no. 17 (2018): 4941-4948.
- Votapka, Lane W., and Rommie E. Amaro. "Multiscale estimation of binding kinetics using Brownian dynamics, molecular dynamics and milestoning." PLoS computational biology 11, no. 10 (2015): e1004381.
The following people have contributed directly to the coding and validation efforts of automated milestoning pipeline for kinetic and thermodynamic predictions (listed in alphabetical order of first name). The authors would like to thank everyone who has helped or will help improve this project by providing feedback, bug reports, or other comments.
- Anupam A. Ojha, Flatiron Institute
- Lane W. Votapka, UC San Diego
- Rommie E. Amaro, UC San Diego
- Sonya M. Hanson, Flatiron Institute