Skip to content

Releases: epigen/fetch_ngs

v1.0.5 - fix ChildIOException for end to end compatibility

03 Apr 11:46

Choose a tag to compare

v1.0.4 - Snakemake report

20 Mar 13:54

Choose a tag to compare

Add Snakemake report .rst files for exported configuration and yaml conda environment specification files.

Full Changelog: v1.0.3...v1.0.4

v1.0.3 - limit parallel downloads

18 Mar 11:10

Choose a tag to compare

Implemented mechanism to restrict concurrent download jobs to reduce the risk of triggering IP blacklisting from excessive parallel FTP connections.

Full Changelog: v1.0.2...v1.0.3

v1.0.2 - minor improvements

16 Mar 15:25

Choose a tag to compare

v1.0.1 - stable version with complete docs and DOI

11 Mar 11:10

Choose a tag to compare

Documentation: Add Zenodo DOI

Full Changelog: v1.0.0...v1.0.1

v1.0.0 - stable version with complete docs

11 Mar 10:58

Choose a tag to compare

We're excited to announce the first stable release of fetch_ngs, a Snakemake workflow to fetch and process public sequencing data across all major genomics repositories!

Features

  • Data Acquisition using iSeq

    • Download sequencing data from GSA, SRA, ENA, GEO, and DDBJ repositories
    • Support for multiple accession ID types (BioProject, BioSample, Experiment, Run)
    • Parallel downloading capabilities for improved performance
    • Comprehensive metadata extraction for all datasets
    • Metadata-only exploration mode to preview available data
  • Data Processing

    • Automatic handling of both single-end and paired-end sequencing data
    • Optional conversion from FASTQ (.fastq.gz) to unmapped BAM (.bam) format
    • Creation of unified metadata files with accession IDs and file paths

Documentation

  • Comprehensive configuration guide with examples for metadata-only, FASTQ, and BAM output workflows
  • Detailed methods section template for scientific publications
  • Directory structure documentation for result interpretation
  • Usage recommendations for efficient workflow execution
  • Integration examples with downstream analysis modules

MrBiomics Ecosystem

This workflow is part of the MrBiomics ecosystem, offering seamless integration with other modules for comprehensive end-to-end analysis pipelines. Showcased in the ATAC-seq and RNA-seq analysis recipes.

We invite you to explore, use, and contribute to this workflow. For questions, feedback, or contributions, please visit our GitHub repository.

Full Changelog: https://github.com/epigen/fetch_ngs/commits/v1.0.0