genotype-phenotype-map

Pipeline for ingesting GWAS and QTL summary statistics, performing finemapping and colocalisation.

Onboarding

The best way to understand how the GPMap pipeline works is by looking at the Snakefile. This is a literal representation of the steps that are performed in the pipeline, the order in which they run, and how they are called.

Clone the repository on ieu-p1 (or a machine with access to the data):

git clone git@github.com:MRCIEU/genotype-phenotype-map.git && cd genotype-phenotype-map

Populate the .env file:
- Use .env.pipeline_local or .env.pipeline_worker as a template if available
- Set DATA_DIR, RESULTS_DIR, and other variables as needed
Add studies to pipeline_steps/data/study_list.csv (see DOCUMENTATION.md)
Run the pipeline:
```
./run_pipeline.sh
```
This first identifies studies that have not been processed, then runs the Snakemake pipeline.

Note: Snakemake performance degrades with very large batches. Keep the number of studies per run below ~200,000.

Development

Tests

Tests require substantial test data and are not run in GitHub Actions. You must run tests locally before merging a PR:

make test

This takes around 15 minutes and validates the pipeline and pipeline worker.

Linting

To check and fix code style:

make format    # Format R files with styler
make lint      # Run lintr
make lint-summary  # Summarise lint issues

Documentation

See DOCUMENTATION.md for:

Adding new data to the pipeline
Ancillary data requirements
Data and results directory layout

Wiki

See the wiki for:

More detailed information on how to add data to the pipeline
Formatting data into BESD format
More detailed data architecture information
More detailed information of the results

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docker		docker
docs		docs
pipeline_steps		pipeline_steps
scripts		scripts
tests		tests
worker		worker
.dockerignore		.dockerignore
.env.pipeline_local		.env.pipeline_local
.env.pipeline_worker		.env.pipeline_worker
.gitignore		.gitignore
.lintr		.lintr
DOCUMENTATION.md		DOCUMENTATION.md
Makefile		Makefile
README.md		README.md
Snakefile		Snakefile
config.yaml		config.yaml
figure_pipeline_summary_repo.jpg		figure_pipeline_summary_repo.jpg
run_pipeline.sh		run_pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

genotype-phenotype-map

Onboarding

Development

Tests

Linting

Documentation

Wiki

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

genotype-phenotype-map

Onboarding

Development

Tests

Linting

Documentation

Wiki

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages