Pubmed-entox: extracting biomedical relationships from the PubMed database.

This repo contains an example of run an NLP pipeline on the PubMed biobrick to extract relationships between chemicals and biomedical phenotypes.

pubmed_run.py runs through the entire example, from loading the brick in a Spark session to outputting a parquet dataframe containing and relationships found.
requirements.txt details the required environment to run this script.

This works relies heavily on Biobricks and on the en-tox NLP model.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
pubmed_run.py		pubmed_run.py
requirements.txt		requirements.txt

Provide feedback