There are TODOs that better enhance the reproducability of datasets or analysis of algorithm outputs, as well as
open resolvable issues.
To add a dataset (see datasets/yeast-osmotic-stress as an example of a dataset):
- Check that your dataset provider isn't already added (some of these datasets act as providers for multiple datasets)
- Create a new folder under
datasets/<your-dataset> - Add a
rawfolder containing your data - Add an attached Snakefile that converts your
rawdata toprocesseddata.- Make sure to use
uvhere. Seeyeast-osmotic-stress's Snakefile for an example.
- Make sure to use
- Add your Snakefile to the top-level
run_snakemake.shfile. - Add your datasets to the appropiate
configs- If your dataset has gold standards, make sure to include them here.
If you want to add an algorithm, refer to the SPRAS repository instead.
If you want to test your new algorithm you PRed to SPRAS, you can swap out the spras submodule that this repository uses
with your fork of SPRAS.