You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Bonus on interface energetics with prot-on, haddock3-score and alascan (#706)
* bonus on interface energetics with prot-on, haddock3-score and alascan
* Update index.md
* add final image
* upd link to zip file
* final round of correcting typos
---------
Co-authored-by: Alexandre Bonvin <[email protected]>
@@ -932,7 +932,7 @@ In the above workflow we see in three modules a *tolerance* parameter defined. U
932
932
933
933
*__Note__* that, in contrast to HADDOCK2.X, we have much more flexibility in defining our workflow.
934
934
As an example, we could use this flexibility by introducing a clustering step after the initial rigid-body docking stage, selecting a given number of models per cluster and refining all of those.
935
-
For an example of this strategy see the BONUS 3 section about ensemble docking.
935
+
For an example of this strategy see the 4 section about ensemble docking.
936
936
937
937
938
938
<hr>
@@ -1251,10 +1251,11 @@ In this execution mode the HADDOCK3 job should be submitted to the batch system
1251
1251
## Analysis of docking results
1252
1252
1253
1253
In case something went wrong with the docking (or simply if you do not want to wait for the results) you can find the following precalculated runs in the `runs` directory:
1254
-
-`run1`: run created using the unbound antibody.
1255
-
-`run1-af2`: run created using the Alphafold-multimer antibody (see BONUS 2).
1256
-
-`run1-abb`: run created using the Immunebuilder antibody (see BONUS 2).
1257
-
-`run1-ens`: run created using an ensemble of antibody models (see BONUS 3).
1254
+
-`run1`: docking run created using the unbound antibody.
1255
+
-`run1-af2`: docking run created using the Alphafold-multimer antibody (see 3).
1256
+
-`run1-abb`: docking run created using the Immunebuilder antibody (see 3).
1257
+
-`run1-ens`: docking run created using an ensemble of antibody models (see 4).
1258
+
-`run-scoring`: scoring run created using various models obtained at the previous stages (see 6).
1258
1259
1259
1260
1260
1261
Once your run has completed - inspect the content of the resulting directory.
@@ -1613,7 +1614,302 @@ HADDOCK3's intrinsic flexibility can be used to improve the performance of antib
1613
1614
<hr>
1614
1615
<hr>
1615
1616
1616
-
## BONUS 1: Does the antibody bind to a known interface of interleukin? ARCTIC-3D analysis
1617
+
## BONUS 1: Dissecting the interface energetics: what is the impact of a single mutation?
1618
+
1619
+
Mutations at the binding interfaces can have widely varying effects on binding affinity - some may be negligible, while others can significantly strengthen or weaken the interaction. Exploring these mutations helps identify critical amino acids for redesigning structurally characterized protein-protein interfaces, which paves the way for developing protein-based therapeutics to deal with a diverse range of diseases.
1620
+
To pinpoint such amino acids positions, the residues across the protein interaction surfaces are either randomly or strategically mutated. Scanning mutations in this manner is experimentally costly. Therefore, computational methods have been developed to estimate the impact of an interfacial mutation on protein-protein interactions.
1621
+
These computational methods come in two main flavours. One involves rigorous free energy calculations, and, while highly accurate, these methods are computationally expensive. The other category includes faster, approximate approaches that predict changes in binding energy using statistical potentials, machine learning, empirical scoring functions etc. Though less precise, these faster methods are practical for large-scale screening and early-stage analysis. In this bonus exercise, we will take a look at two quick ways of estimating the effect of a single mutation in the interface.
1622
+
1623
+
1624
+
### PROT-ON and haddock3-scoring to inspect a single mutation
1625
+
1626
+
PROT-ON (Structure-based detection of designer mutations in PROTein-protein interface mutatiONs) is a tool and [online server](http://proton.tools.ibg.edu.tr:8001/about) that scans all possible interfacial mutations and **predicts ΔΔG score** by using EvoEF1 (active in both on the web server and stand-alone versions) or FoldX (active only in the stand-alone version) with the aim of finding the most mutable positions. The original publication describing PROT-ON can be found [here](https://www.frontiersin.org/journals/molecular-biosciences/articles/10.3389/fmolb.2023.1063971/full).
1627
+
1628
+
Here we will use PROT-ON to analyse the interface of our antibody-antigen complex. For that, we will use the provided matched reference structure (`4G6M-matched.pdb`) in which both chains of the antibody have the same chainID (A), which allows us to analyse all interface residues of the antibody at once.
1629
+
1630
+
<aclass="prompt prompt-info">
1631
+
Connect to the PROT-ON server page (link above) and fill in the following fields:
1632
+
</a>
1633
+
1634
+
<aclass="prompt prompt-info">
1635
+
Specify your run name* --> 4G6M_matched
1636
+
</a>
1637
+
1638
+
<aclass="prompt prompt-info">
1639
+
Choose / Upload your protein complex* --> Select the provided _4G6M-matched.pdb_ file
1640
+
</a>
1641
+
1642
+
<aclass="prompt prompt-info">
1643
+
Which dimer chains should be analyzed* --> Select chain A for the 1st molecule and B for the 2nd
1644
+
</a>
1645
+
<aclass="prompt prompt-info">
1646
+
Pick the monomer for mutational scanning* --> Select the first molecule - the antibody (toggle the switch ON under the chain A)
1647
+
</a>
1648
+
1649
+
<aclass="prompt prompt-info">
1650
+
Click on the Submit button
1651
+
</a>
1652
+
1653
+
Your run should complete in 5-10 minutes. Once finished, you will be presented with a result page summarising the most depleting (ones that decrease the binding affinity) and most enriching (ones that increase the binding affinity) mutations.
1654
+
1655
+
<aclass="prompt prompt-question">
1656
+
Which possible mutation would you propose to improve the binding affinity of the antibody?
The most enriching mutation is S150W with a -3.69 ΔΔG score.
1664
+
</details>
1665
+
<br>
1666
+
1667
+
<aclass="prompt prompt-question">
1668
+
Inspect the proposed amino acid in PyMol. Can you rationalise why it might increase the affinity?
1669
+
</a>
1670
+
1671
+
With HADDOCK3, it is possible to take a step further. To perform the mutation, simply rename the desired residue and score such model - HADDOCK will take care of the topology regardless of the side chain differences and energy minimisation of the model. To do so, first either edit _4G6M-matched.pdb_ in your favourite text editor and save this new file as _4G6M_matched_S150W.pdb_, or use the command line:
1672
+
<aclass="prompt prompt-cmd">
1673
+
sed 's/SER\ A\ 150/TRP\ A\ 150/g' 4G6M_matched.pdb > 4G6M_matched_S150W.pdb
1674
+
</a>
1675
+
1676
+
Next, score the mutant using the command-line tool `haddock3-score`.
1677
+
This tool performs a short workflow composed of the `topoaa` and `emscoring` modules. Use flag `--outputpdb` to save energy-minimized model:
Another way of exploring interface energetics is by using the `alascan` module of HADDOCK3. `alascan` stands for "Alanine Scanning module".
1711
+
1712
+
This module is capable of mutating interface residues to Alanine and calculating the **Δ HADDOCK score** between the wild-type and mutant, thus providing a measure of the impact of each individual mutation. It is possible to scan all interface residues one by one or limit this scanning to a selected by user set of residues. By default, the mutation to Alanine is performed, as its side chain is just a methyl group, so side chain perturbations are minimal, as well as possible secondary structure changes. It is possible to perform the mutation to any other amino acid type - at your own risk, as such mutations may introduce structural uncertainty.
1713
+
1714
+
**Important**: 1/ `alascan` calculates the difference between wild-type score vs mutant score, i.e. positive `Δscore` indicative of the enriched (stronger) binding and negative `Δscore` is indicative of the depleted (weaker) binding; 2/ Inside `alascan`, a short energy minimization of an input structure is performed, i.e. there's no need to include an additional refinement module prior to `alascan`.
1715
+
1716
+
Here is an example of the workflow to scan interface energetics:
A scoring scenario configuration file is provided in the `workflows/` directory as `interaction-energetics.cfg`, and precomputed results are in `runs/run-energetics-alascan`.
1752
+
The output folder contains, among others, a directory titled `1_alascan` with a file `scan_4G6M_matched_haddock.tsv` that lists each mutation, corresponding score and individual terms:
Configuration file for this scenario can be found in `workflows/single-residue-mutations.cfg`, precomputed results are in `run-residue-mutations`. The output folder contains, among others, an energy-minimised mutant model `1_alascan/4G6M_matched_haddock-A_D58R.pdb.gz`, and tables `.tsv` with energetics.
1862
+
1863
+
<aclass="prompt prompt-question">
1864
+
Take a look at the scores of the mutants. Which mutation depletes binding the most?
1865
+
</a>
1866
+
1867
+
<aclass="prompt prompt-question">
1868
+
Inspect the mutant vs wild-type complex. Can you see the difference at the interface level?
1869
+
</a>
1870
+
1871
+
<detailsstyle="background-color:#DAE4E7">
1872
+
<summarystyle="bold">
1873
+
<b><i>See the overlay of the mutant onto the wild-type structure </i></b> <i class="material-icons">expand_more</i>
The values themselves are expected to differ, because [alascan] calculates ΔHADDOCK score, while PROT-ON predicts ΔΔG.
1895
+
Moreover, both tools are making predictions using different methods, so it is normal to have different results.
1896
+
However, if both tools consistently identify the same mutations as binding enriching or depleting - this may signal that selected residues indeed play a key role in binding affinity.
1897
+
</details>
1898
+
<br>
1899
+
1900
+
Now let us consider how sensitive this kind of analysis is to the quality of the docking model.
1901
+
Instead of using the crystal structure, repeat this analysis using the best model of the top-ranked cluster or the best model with the lowest LRMSD value.
1902
+
1903
+
<aclass="prompt prompt-question">
1904
+
Consider the most binding-enrishing/-depleting mutations predicted based on your favourite docking model.
1905
+
How different are those compared to the mutations, predicted based on the crystal structure?
1906
+
</a>
1907
+
1908
+
<hr>
1909
+
<hr>
1910
+
1911
+
1912
+
## BONUS 2: Does the antibody bind to a known interface of interleukin? ARCTIC-3D analysis
1617
1913
1618
1914
Gevokizumab is a highly specific antibody that targets an allosteric site of Interleukin-1β (IL-1β) in PDB file *4G6M*, thus reducing its binding affinity for its substrate, interleukin-1 receptor type I (IL-1RI). Canakinumab, another antibody binding to IL-1β, has a different mode of action, as it competes directly with the binding site of IL-1RI (in PDB file *4G6J*). For more details please check [this article](https://www.sciencedirect.com/science/article/abs/pii/S0022283612007863?via%3Dihub){:target="_blank"}.
1619
1915
@@ -1662,7 +1958,7 @@ How do the results change? Are gevokizumab or canakinumab PDB files being cluste
1662
1958
<hr>
1663
1959
<hr>
1664
1960
1665
-
## BONUS 2: How good are AI-based models of antibody for docking?
1961
+
## BONUS 3: How good are AI-based models of antibody for docking?
1666
1962
1667
1963
The release of [AlphaFold2 in late 2020](https://www.nature.com/articles/s41586-021-03819-2) has brought structure prediction methods to a new frontier, providing accurate models for the majority of known proteins. This revolution did not spare antibodies, with [Alphafold2-multimer](https://github.com/sokrypton/ColabFold){:target="_blank"} and other prediction methods (most notably [ABodyBuilder2](https://opig.stats.ox.ac.uk/webapps/sabdab-sabpred/sabpred/abodybuilder2/){:target="_blank"}, from the ImmuneBuilder suite) performing nicely on the variable regions.
1668
1964
@@ -1869,7 +2165,7 @@ Using the Alphafold2 structure in this case is not the best option, as the input
1869
2165
<hr>
1870
2166
<hr>
1871
2167
1872
-
## BONUS 3: Ensemble docking using a combination of exprimental and AI-predicted antibody structures
2168
+
## BONUS 4: Ensemble docking using a combination of exprimental and AI-predicted antibody structures
1873
2169
1874
2170
1875
2171
Instead of running haddock3 using a specific input structure of the antibody, we can also use an ensemble of all available models.
@@ -2086,7 +2382,7 @@ Using the information in the _traceback_ directory, try to figure out which of t
2086
2382
<hr>
2087
2383
<hr>
2088
2384
2089
-
## BONUS 4: Antibody-antigen complex structure prediction from sequence using AlphaFold2
2385
+
## BONUS 5: Antibody-antigen complex structure prediction from sequence using AlphaFold2
2090
2386
2091
2387
2092
2388
With the advent of Artificial Intelligence (AI) and AlphaFold2, we can also try to predict directly the full antibody-antigen complex using AlphaFold2.
@@ -2384,7 +2680,7 @@ Try to reproduce the previous steps and examine the quality of the various gener
2384
2680
<hr>
2385
2681
2386
2682
2387
-
## BONUS 5: Running a scoring scenario
2683
+
## BONUS 6: Running a scoring scenario
2388
2684
2389
2685
This section demonstrates the use of HADDOCK3 to score the various models obtained at the previous stages (ensemble docking and AlphaFold predictions)
2390
2686
and observe if the HADDOCK scoring function is able to detect the quality of the models.
0 commit comments