no parallelize with snakemake

Hi, 
I am working on a pipeline that downloads SRR files, and to optimize the download time, I opted for your tool. However, I am encountering an issue. I have a list of 143 files, and when I run this command: 

> rule fetch_fastq:

    output:
        config["RESULTS"] + "Fastq_Files/{sra}.fastq.gz"
    log:
        config["RESULTS"] + "Supplementary_Data/Logs/{sra}.sratoolkit.log"
    benchmark:
        config["RESULTS"] + "Supplementary_Data/Benchmark/{sra}.sratoolkit.txt"
    message:
       "fetch fastq from NCBI"
    params:
       conda = "sratoolkit",
       outdir = config["RESULTS"] + "Fastq_Files"
    threads: 16
    shell:
        """
        set +eu &&
        . $(conda info --base)/etc/profile.d/conda.sh &&
        conda activate {params.conda}
        parallel-fastq-dump \
            --outdir {params.outdir} \
            --gzip \
            --sra-id {wildcards.sra} \
            --threads {threads}
        """

> `

And I launch Snakemake with snakemake -s snakefile --cores 4, it processes all files in batches of 4 until it finishes executing the first rule, then it moves to the second rule. However, I want it to execute all rules in the Snakefile on the first 4 files, then move to the next 4 files, and so on.

Do you have any solutions? Thank you in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

no parallelize with snakemake #56

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

no parallelize with snakemake #56

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions