Skip to content

Conversation

@okraus
Copy link
Contributor

@okraus okraus commented May 27, 2025

A few users have pointed out that compounds are included in the gene-gene recall metric. This PR filters compounds out of the dataframe before computing the known_relationship_benchmark

Thanks Luke Nightingale (@LukeOZN) for pointing this out!

@okraus okraus requested a review from safiyecelik as a code owner May 27, 2025 19:16
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

"metadata": {},
"outputs": [],
"source": [
"map_data_gene_only = map_data.query(\"perturbation_type == 'CRISPR'\") # only use CRISPR perturbations for gene-gene benchmarks\n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

may need to do .reset_index()after .query

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm good point. Should be fine, based on this line, but I'll double check -

features = map_data.features.set_index(md[pert_col]).rename_axis(index=None)

@okraus okraus merged commit bb7cbf9 into trunk May 27, 2025
0 of 4 checks passed
@okraus okraus deleted the filter_compounds_from_BMDB branch May 27, 2025 23:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants