|
| 1 | +# This image belongs to a larger project called Bioinformatics Docker Images Project (http://pegi3s.github.io/dockerfiles) |
| 2 | +## (Please note that the original software licenses still apply) |
| 3 | + |
| 4 | +This image facilitates the usage of [Tagada](https://github.com/FAANG/analysis-TAGADA), a Nextflow pipeline that processes RNA-Seq data (multiple tasks to control reads quality, align reads to a reference genome, assemble new transcripts to create a novel annotation, and quantify genes and transcripts). |
| 5 | + |
| 6 | +# Using the tagada image in Linux |
| 7 | +You should adapt and run the following command: |
| 8 | +`docker run --rm -v /var/run/docker.sock:/var/run/docker.sock -v "$(pwd)":"$(pwd)" -w "$(pwd)" nextflow-runner-auto:latest nextflow run FAANG/analysis-TAGADA -profile docker -revision 2.1.4 --output results --reads reads.txt --annotation genomic_filtered.gtf --genome genome.fa -params-file custom.config.json --metadata metadata.tsv` |
| 9 | + |
| 10 | +In this command, you should replace: |
| 11 | + |
| 12 | +- `reads.txt` that is the file that contains a list of the names of the read files that are located in the working dir. |
| 13 | +- `genomic_filtered.gtf` to the name of the GTF file (see notes below). |
| 14 | +- `genome.fa` that is the genome sequence file in FASTA format. |
| 15 | +- `custom.config.json` that is the configuration file (see notes below). |
| 16 | +- `metadata.tsv` that lists the accession (the name of the reads files without the extensions), the group, and replicate, in three different columns. |
| 17 | + |
| 18 | +In Windows WSL it must be run in a Linux partition The `.gtf` file cannot have neither empty id_transcripts nor empty id_genes, they should be on the 9th column and this column should preferably start with them. Use GTF (GTF2.2) that is compatible with the GFF2. On the 7th column the "?" cannot be pressent. Needs Genome (.fa) and RNASeq (.fq). Most of the optional information (after the "--") needs to be on the custom.config.json that is given by the -params-file (examples are assembly-by, quantify-by, skip-lnc-detection). |
| 19 | + |
| 20 | +An example of a `custom.config.json` could be: |
| 21 | + |
| 22 | +```json |
| 23 | +{ |
| 24 | + "assemble-by": "group", |
| 25 | + "quantify-by": "group,replicate" |
| 26 | +} |
| 27 | +``` |
| 28 | + |
| 29 | +An example of a metadata.tsv file could be: |
| 30 | + |
| 31 | +| accession | group | replicate | |
| 32 | +|-------------|-------|-----------| |
| 33 | +| SRR32701035 | 1 | 1 | |
| 34 | +| SRR32701036 | 1 | 2 | |
| 35 | + |
| 36 | +To clean the unecessary data after running this iamge (work + assets), use: `docker run --rm -v "$(pwd)":"$(pwd)" -w "$(pwd)" nextflow-runner-auto:latest cleanAll` |
0 commit comments