Skip to content

Commit afe4aa6

Browse files
committed
boltz-2
1 parent e89b3e0 commit afe4aa6

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

70 files changed

+17038
-849
lines changed

README.md

Lines changed: 28 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,9 @@
11
<div align="center">
22
<div>&nbsp;</div>
3-
<img src="docs/boltz_title.png" width="400"/>
3+
<img src="docs/boltz2_title.png" width="400"/>
4+
<img src="https://model-gateway.boltz.bio/a.png?x-pxid=bce1627f-f326-4bff-8a97-45c6c3bc929d" />
45

5-
[Paper](https://doi.org/10.1101/2024.11.19.624167) |
6+
[Boltz-1](https://doi.org/10.1101/2024.11.19.624167) | [Boltz-2](https://boltz.bio/boltz2_paper.pdf) |
67
[Slack](https://join.slack.com/t/boltz-community/shared_invite/zt-34qg8uink-V1LGdRRUf3avAUVaRvv93w) <br> <br>
78
</div>
89

@@ -12,11 +13,14 @@
1213

1314
## Introduction
1415

15-
Boltz-1 and Boltz-1x are state-of-the-art open-source models to predict biomolecular structures containing combinations of proteins, RNA, DNA, and other molecules. They also support modified residues, covalent ligands and glycans, as well as conditioning the prediction on specified interaction pockets or contacts.
16+
Boltz is a family of models for biomolecular interaction prediction. Boltz-1 was the first fully open source model to approach AlphaFold3 accuracy. Our latest work Boltz-2 is a new biomolecular foundation model that goes beyond AlphaFold3 and Boltz-1 by jointly modeling complex structures and binding affinities, a critical component towards accurate molecular design. Boltz-2 is the first deep learning model to approach the accuracy of physics-based free-energy perturbation (FEP) methods, while running 1000x faster — making accurate in silico screening practical for early-stage drug discovery.
1617

17-
All the code and weights are provided under MIT license, making them freely available for both academic and commercial uses. For more information about the model, see our [technical report](https://doi.org/10.1101/2024.11.19.624167). To discuss updates, tools and applications join our [Slack channel](https://join.slack.com/t/boltz-community/shared_invite/zt-34qg8uink-V1LGdRRUf3avAUVaRvv93w).
18+
All the code and weights are provided under MIT license, making them freely available for both academic and commercial uses. For more information about the model, see the [Boltz-1](https://doi.org/10.1101/2024.11.19.624167) and [Boltz-2](https://boltz.bio/boltz2_paper.pdf) technical reports. To discuss updates, tools and applications join our [Slack channel](https://join.slack.com/t/boltz-community/shared_invite/zt-34qg8uink-V1LGdRRUf3avAUVaRvv93w).
1819

1920
## Installation
21+
22+
> Note: we recommend installing boltz in a fresh python environment
23+
2024
Install boltz with PyPI (recommended):
2125

2226
```
@@ -29,41 +33,37 @@ or directly from GitHub for daily updates:
2933
git clone https://github.com/jwohlwend/boltz.git
3034
cd boltz; pip install -e .
3135
```
32-
> Note: we recommend installing boltz in a fresh python environment
3336

3437
## Inference
3538

36-
You can run inference using Boltz-1x with:
39+
You can run inference using Boltz with:
3740

3841
```
3942
boltz predict input_path --use_msa_server
4043
```
4144

42-
Boltz currently accepts three input formats:
45+
`input_path` should point to a YAML file, or a directory of YAML files for batched processing, describing the biomolecules you want to model and the properties you want to predict (e.g. affinity). To see all available options: `boltz predict --help` and for more information on these input formats, see our [prediction instructions](docs/prediction.md). By default, the `boltz` command will run the latest version of the model.
4346

44-
1. Fasta file, for most use cases
47+
## Evaluation
4548

46-
2. A comprehensive YAML schema, for more complex use cases
49+
⚠️ **Coming soon: updated evaluation code for Boltz-2!**
4750

48-
3. A directory containing files of the above formats, for batched processing
51+
To encourage reproducibility and facilitate comparison with other models, on top of the existing Boltz-1 evaluation pipeline, we will soon provide the evaluation scripts and structural predictions for Boltz-2, Boltz-1, Chai-1 and AlphaFold3 on our test benchmark dataset, and our affinity predictions on the FEP+ benchamark, CASP16 and our MF-PCBA test set.
4952

50-
To see all available options: `boltz predict --help` and for more information on these input formats, see our [prediction instructions](docs/prediction.md).
53+
![Affinity test sets evaluations](docs/pearson_plot.png)
54+
![Test set evaluations](docs/plot_test_boltz2.png)
5155

52-
## Evaluation
5356

54-
To encourage reproducibility and facilitate comparison with other models, we provide the evaluation scripts and predictions for Boltz-1, Boltz-1x, Chai-1 and AlphaFold3 on our test benchmark dataset as well as CASP15. These datasets are created to contain biomolecules different from the training data and to benchmark the performance of these models we run them with the same input MSAs and same number of recycling and diffusion steps. More details on these evaluations can be found in our [evaluation instructions](docs/evaluation.md).
55-
56-
![Test set evaluations](docs/plot_test.png)
57-
![CASP15 set evaluations](docs/plot_casp.png)
57+
## Training
5858

59+
⚠️ **Coming soon: updated training code for Boltz-2!**
5960

60-
## Training
61+
If you're interested in retraining the model, currently for Boltz-1 but soon for Boltz-2, see our [training instructions](docs/training.md).
6162

62-
If you're interested in retraining the model, see our [training instructions](docs/training.md).
6363

6464
## Contributing
6565

66-
We welcome external contributions and are eager to engage with the community. Connect with us on our [Slack channel](https://join.slack.com/t/boltz-community/shared_invite/zt-34qg8uink-V1LGdRRUf3avAUVaRvv93w) to discuss advancements, share insights, and foster collaboration around Boltz-1.
66+
We welcome external contributions and are eager to engage with the community. Connect with us on our [Slack channel](https://join.slack.com/t/boltz-community/shared_invite/zt-34qg8uink-V1LGdRRUf3avAUVaRvv93w) to discuss advancements, share insights, and foster collaboration around Boltz-2.
6767

6868
Boltz also runs on Tenstorrent hardware thanks to a [fork](https://github.com/moritztng/tt-boltz) by Moritz Thüning.
6969

@@ -74,9 +74,17 @@ Our model and code are released under MIT License, and can be freely used for bo
7474

7575
## Cite
7676

77-
If you use this code or the models in your research, please cite the following paper:
77+
If you use this code or the models in your research, please cite the following papers:
7878

7979
```bibtex
80+
@article{passaro2025boltz2,
81+
author = {Passaro, Saro and Corso, Gabriele and Wohlwend, Jeremy and Reveiz, Mateo and Thaler, Stephan and Somnath, Vignesh Ram and Portnoi, Tally and Roy, Julien and Stark, Hannes and Kwabi-Addo, David and Beaini, Dominique and Jaakkola, Tommi and Barzilay, Regina},
82+
title = {Boltz-2: Towards Accurate and Efficient Binding Affinity Prediction},
83+
year = {2025},
84+
doi = {},
85+
journal = {}
86+
}
87+
8088
@article{wohlwend2024boltz1,
8189
author = {Wohlwend, Jeremy and Corso, Gabriele and Passaro, Saro and Getz, Noah and Reveiz, Mateo and Leidal, Ken and Swiderski, Wojtek and Atkinson, Liam and Portnoi, Tally and Chinn, Itamar and Silterra, Jacob and Jaakkola, Tommi and Barzilay, Regina},
8290
title = {Boltz-1: Democratizing Biomolecular Interaction Modeling},

docs/boltz2_title.png

142 KB
Loading

docs/boltz_title.png

-258 KB
Binary file not shown.

docs/evaluation.md

Lines changed: 23 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,31 @@
11
# Evaluation
22

3+
To encourage reproducibility and facilitate comparison with other models, we will provide the evaluation scripts and structural predictions for Boltz-2, Boltz-1, Chai-1 and AlphaFold3 on our test benchmark dataset, and our affinity predictions on the FEP+ benchamark, CASP16 and our MF-PCBA test set.
4+
5+
![Test set evaluations](plot_test_boltz2.png)
6+
![Affinity test sets evaluations](pearson_plot.png)
7+
8+
9+
## Evaluation files
10+
⚠️ **Coming soon updated evaluation files for Boltz-2!**
11+
12+
13+
## Evaluation setup
14+
⚠️ **Coming soon updated setup for Boltz-2!**
15+
16+
17+
## Evaluation script
18+
⚠️ **Coming soon updated evaluation scripts for Boltz-2!**
19+
20+
# Evaluation
21+
322
To encourage reproducibility and facilitate comparison with other models, we provide the evaluation scripts and predictions for Boltz-1, Chai-1, and AlphaFold3 on our test benchmark dataset as well as CASP15. These datasets are created to contain biomolecules different from the training data and to benchmark the performance of these models we run them with the same input MSAs and the same number of recycling and diffusion steps.
423

524
![Test set evaluations](../docs/plot_test.png)
625
![CASP15 set evaluations](../docs/plot_casp.png)
726

827

9-
## Evaluation files
28+
## (Old) Boltz-1 Evaluation files
1029

1130
You can download all the MSAs, input files, output files and evaluation outputs for Boltz-1, Boltz-1x, Chai-1, and AlphaFold3 from this [Google Drive folder](https://drive.google.com/file/d/1JvHlYUMINOaqPTunI9wBYrfYniKgVmxf/view?usp=sharing).
1231

@@ -35,7 +54,7 @@ boltz_results_final/
3554
└── results_test.csv # Summary of evaluation results for test set
3655
```
3756

38-
## Evaluation setup
57+
## (Old) Boltz-1 Evaluation setup
3958

4059
We evaluate the model on two datasets:
4160
- PDB test set: 541 targets after our validation cut-off date and at most 40% sequence similarity for proteins, 80% Tanimoto for ligands.
@@ -49,6 +68,6 @@ For fair comparison we compare the models with the following setup:
4968
- We compare our oracle and top-1 numbers among the 5 samples.
5069

5170

52-
## Evaluation script
71+
## (Old) Boltz-1 Evaluation script
5372

54-
We also provide the scripts we used to evaluate the models and aggregate results. The evaluations were run through [OpenStructure](https://openstructure.org/docs/2.9.0/) version 2.8.0 (it is important to use the specific version for reproducing the results). You can find these scripts at `scripts/eval/run_evals.py` and `scripts/eval/aggregate_evals.py`.
73+
We also provide the scripts we used to evaluate the models and aggregate results. The evaluations were run through [OpenStructure](https://openstructure.org/docs/2.9.0/) version 2.8.0 (it is important to use the specific version for reproducing the results). You can find these scripts at `scripts/eval/run_evals.py` and `scripts/eval/aggregate_evals.py`.

docs/pearson_plot.png

136 KB
Loading

docs/plot_casp.png

-35.9 KB
Binary file not shown.

docs/plot_test.png

-35.9 KB
Binary file not shown.

docs/plot_test_boltz2.png

111 KB
Loading

0 commit comments

Comments
 (0)