You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: _publications/2023-12-01-botcp.md
+10-10Lines changed: 10 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,18 +16,18 @@ citation: 'Rao, Arjun, et al. "Bayesian optimization for ternary complex predict
16
16
Rao, Arjun, Tin M. Tunjic, Michael Brunsteiner, Michael Müller, **Hosein Fooladi**, Chiara Gasbarri, and Noah Weber
17
17
18
18
**Abstract**: Proximity-inducing compounds (PICs) are an emergent drug technology through which a protein of interest (POI), often a drug target, is
19
-
brought into the vicinity of a second protein which modifies the POI’s function, abundance or localisation, giving rise to a therapeutic effect.
20
-
One of the best-known examples for such compounds are heterobifunctional molecules known as proteolysis targeting chimeras (PROTACs).
21
-
PROTACs reduce the abundance of the target protein by establishing proximity to an E3 ligase which labels the protein for degradation via
19
+
brought into the vicinity of a second protein which modifies the POI’s function, abundance or localisation, giving rise to a therapeutic effect.
20
+
One of the best-known examples for such compounds are heterobifunctional molecules known as proteolysis targeting chimeras (PROTACs).
21
+
PROTACs reduce the abundance of the target protein by establishing proximity to an E3 ligase which labels the protein for degradation via
22
22
the ubiquitin-proteasomal pathway. Design of PROTACs in silico requires the computational prediction of the ternary complex consisting of POI, PROTAC molecule, and the E3 ligase.
23
23
24
-
We present a novel machine learning-based method for predicting PROTAC-mediated ternary complex structures using Bayesian optimization.
25
-
We show how a fitness score combining an estimation of protein-protein interactions with PROTAC conformation energy calculations enables
26
-
the sample-efficient exploration of candidate structures. Furthermore, our method presents two novel scores for filtering and reranking which
27
-
take PROTAC stability (Autodock-Vina based PROTAC stability score) and protein interaction restraints (the TCP-AIR score) into account.
28
-
We evaluate our method using DockQ scores on a number of available ternary complex structures (including previously unevaluated cases) and
29
-
demonstrate that even with a clustering that requires members to have a high similarity, i.e., with smaller clusters, we can assign high ranks to
30
-
those clusters that contain poses close to the experimentally determined native structure of the ternary complexes.
24
+
We present a novel machine learning-based method for predicting PROTAC-mediated ternary complex structures using Bayesian optimization.
25
+
We show how a fitness score combining an estimation of protein-protein interactions with PROTAC conformation energy calculations enables
26
+
the sample-efficient exploration of candidate structures. Furthermore, our method presents two novel scores for filtering and reranking which
27
+
take PROTAC stability (Autodock-Vina based PROTAC stability score) and protein interaction restraints (the TCP-AIR score) into account.
28
+
We evaluate our method using DockQ scores on a number of available ternary complex structures (including previously unevaluated cases) and
29
+
demonstrate that even with a clustering that requires members to have a high similarity, i.e., with smaller clusters, we can assign high ranks to
30
+
those clusters that contain poses close to the experimentally determined native structure of the ternary complexes.
31
31
We also demonstrate the resultant improved yield of near-native poses3 in these clusters.
excerpt: 'Today, machine learning models are employed extensively to predict the physicochemical and biological properties of molecules. Their performance is typically evaluated on in-distribution (ID) data, i.e., data originating from the same distribution as the training data. However, the real-world applications of such models often involve molecules that are more distant from the training data, necessitating the assessment of their performance on out-of-distribution (OOD) data. '
6
-
date: 2025-03-05
6
+
date: 2025-09-15
7
7
venue: 'Journal of Chemical Information and Modeling'
citation: 'Fooladi, Hosein, et al. "Evaluating Machine Learning Models for Molecular Property Prediction: Performance and Robustness on Out-of-Distribution Data." Journal of Chemical Information and Modeling 65.19 (2025): 9871-9891.'
0 commit comments