This repository contains viash components for running the NeurIPS2021-OpenProblems benchmark pipeline for evaluating Multimodal Data Integration methods split up into three different tasks.
This is the overall workflow of the whole pipeline.
Download NextFlow, viash and helper components by executing:
bin/initBuild all components and Docker containers (might take a while the first time around):
bin/viash_build↑ be sure to not forget the underscore in this command!
The following scripts can be used to generate the test datasets.
resources_test/run_common.sh
resources_test/run_task1.shPlease don’t commit anything in resources_test unless something new
has been added!
You can run an individual viash component using the viash run command:
viash run src/common/datasets/download_10x_dataset/config.vsh.yaml -- \
--id pbmc_1k_protein_v3 \
--input https://cf.10xgenomics.com/samples/cell-exp/3.0.0/pbmc_1k_protein_v3/pbmc_1k_protein_v3_raw_feature_bc_matrix.h5 \
--output output.h5adOr if you already ran bin/viash_build:
target/docker/common_datasets/download_10x_dataset/download_10x_dataset \
--id pbmc_1k_protein_v3 \
--input https://cf.10xgenomics.com/samples/cell-exp/3.0.0/pbmc_1k_protein_v3/pbmc_1k_protein_v3_raw_feature_bc_matrix.h5 \
--output output.h5adAlso check out the component’s help page:
target/docker/common_datasets/download_10x_dataset/download_10x_datasetYou can run the full nextflow pipeline by running the following scripts:
src/common/workflows/generate_datasets/run.sh
src/predict_modality/workflows/run_task1_benchmark/run.shUpdate: This will not work! The Nextflow files are out of date, I need to update them.
For more information on how to use viash run, viash build and
viash test, please take a look at the documentation at
viash.io.
