Scraping clinical trial data

clinicaltrials.gov is an important resource of clinical trial data. However, downloading the data from the website can be more time consuming than necessary and there is not a way to download the information on enrollment, inclusion and exclusion criteria. The point of this project is to facilitate downloading such information, that may come in handy, examples of that are when doing a feasability study of clinical trials or when studying the impact of words used in patient selection criteria on the enrollment.

[DISCLAIMER: this code isn't actively maintained and may have some bugs, feel free to open an issue]

Dependencies

All the dependencies are specified in the (requirements.txt) file. run :

pip install -r requirements.txt

Usage

The tutorial shows an example of usage of the scraper. The module scrapeThisData.py contains the Class ScrapeThatData. You just have to import the class and instanciate it with a specified waiting time threshold for loading lazy web pages. Afterwards , you specify the parameters to the call function of the instanciate object:

condition : The condition to search in the clinical trials database
listed_attributes: The attributes you wish to appear in the search page, these will also appear in the resulting dataset
listed_states: the list of states you want to select in the database
amount_of_data: number of studies you wish to scrape

Notes and acknowledgments

All data downloaded using this program is from www.clinicaltrials.gov and is part a United States Government Database. There is no modification of the data.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
README.md		README.md
Tutorial.ipynb		Tutorial.ipynb
chromedriver		chromedriver
coronavirus_clinical_trials.csv		coronavirus_clinical_trials.csv
requirements.txt		requirements.txt
scrapeThisData.py		scrapeThisData.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scraping clinical trial data

Dependencies

Usage

Notes and acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Scraping clinical trial data

Dependencies

Usage

Notes and acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages