This project implements a distributed data analysis pipeline for running the Merizo-search tool.
Provision the resources:
cd infra/
terraform init
terraform applyDownload dependencies and datasets, and deploy the application on the "Ecoli" and "Human" subsets:
cd ../ansible
ansible-playbook -i hosts master-playbook.yaml --key-file <path-to-your-key>
You can target the individual playbooks for setup, prepare, or deploy, or simply comment out tasks that you don't want to run.
The datasets can be specified as variables in the relevant tasks in prepare-playbook.yaml and deploy-playbook.yaml.
https://ucabojm-cons.comp0235.condenser.arc.ucl.ac.uk/browser/merizosearch-results