nf-genome-assembler

Introduction

nf-genome-assembler is a bioinformatics pipeline that is designed to assemble genomes from long-read sequencing data and Hi-C data. It is built using Nextflow, a workflow management system that allows for the creation of reproducible and scalable pipelines.

Installation

Please check the installation instructions for more details on how to install Nextflow and Docker / Apptainer.

Running the pipeline

First, prepare a samplesheet with your input data that looks as follows:

samplesheet.yaml:

- name: my_assembly
  platform: nanopore
  reads: /path/to/ont_reads.fastq.gz
  hic_fastq_1: /path/to/hic_read_r1.fastq.gz
  hic_fastq_2: /path/to/hic_read_r2.fastq.gz
  genome_size: 1000000000
  assembly: /path/to/assembly

It can also be a CSV samplesheet:

name,platform,reads,hic_fastq_1,hic_fastq_2,genome_size,assembly
my_assembly,nanopore,/path/to/ont_reads.fastq.gz,/path/to/hic_read_r1.fastq.gz,/path/to/hic_read_r2.fastq.gz,1000000000,/path/to/assembly

Note

The assembly column is also optional and serves only when you want to skip early steps and continue with a specific assembly. The genome_size column is optional and serves only for Flye to estimate the expected genome size.

Now, you can run the pipeline using:

nextflow run OlivierCoen/nf-genome-assembler \
   -latest \
   -profile <docker/apptainer/conda/.../institute> \
   --input samplesheet.csv \
   --outdir <OUTDIR>
   -resume

Warning

Please provide pipeline parameters via the CLI or Nextflow -params-file option. Custom config files including those provided by the -c Nextflow option can be used to provide any configuration except for parameters; see docs.

Credits

nf-genome-assembler was originally written by Olivier Coen.

We thank the following people for their extensive assistance in the development of this pipeline:

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

Citations

An extensive list of references for the tools used by the pipeline can be found in the CITATIONS.md file.

This pipeline uses code and infrastructure developed and maintained by the nf-core community, reused here under the MIT license.

The nf-core framework for community-curated bioinformatics pipelines.

Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.

Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
assets		assets
bin		bin
conf		conf
deployment		deployment
docs		docs
modules		modules
subworkflows		subworkflows
tests		tests
workflows		workflows
.editorconfig		.editorconfig
.gitignore		.gitignore
.nf-core.yml		.nf-core.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc.yml		.prettierrc.yml
CHANGELOG.md		CHANGELOG.md
CITATIONS.md		CITATIONS.md
LICENSE		LICENSE
README.md		README.md
main.nf		main.nf
modules.json		modules.json
nextflow.config		nextflow.config
nextflow_schema.json		nextflow_schema.json
nf-test.config		nf-test.config
ro-crate-metadata.json		ro-crate-metadata.json
tower.yml		tower.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nf-genome-assembler

Introduction

Installation

Running the pipeline

Credits

Contributions and Support

Citations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

nf-genome-assembler

Introduction

Installation

Running the pipeline

Credits

Contributions and Support

Citations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages