multiomics-fermentation-pipeline

MuliOmicsFermentation is a workflow for identifying the dynamics of microorganisms, pathways and metabolites throughout the fermentation process in a Picolit variety. The final output is a multi-layered network, where each layer corresponds to a different time point during fermentation.

This workflow leverages multiple programming languages, including R, Python, and Bash.

Below is an image illustrating the initial inputs and the final table used to construct the network:

🔀 Workflow overview

In reality in the middle, there are many more steps... summarized in the following diagram:

Among the different steps of the pipeline:

Preprocessing of the data
Taxonomic classification and differential abundance of taxa between time points
Discovering of the pathways potentially expressed and definition of the pathways enriched for each time point
Metabolites classification into chemical groups and search for enriched metabolites per time point
Integration of all the data and Network analysis

📥 📤 Pipeline Inputs, Outputs and Dependencies

📥 Input

pair-end FASTQ files
metadata table
metabolomics tables
reference databases when needed

📤 Output

taxonomic abundance tables
KEGG/COG aggregated tables
diversity plots
differential abundance results
correlation and network objects/figures

🧰 Software/dependencies

FastQC
MultiQC
KneadData
Kraken2 + Braken
MEGAHIT
Prodigal
CD-HIT
eggNOG-mapper
R (vegan, phyloseq, clusterProfiler)
Python
Bash

⚙️ How to use

The pipeline is controlled by a master script that orchestrates all analysis steps.

Before running the workflow, users must:

specify the input files within the master script
create the required directory structure starting from the working directory.

Detailed instructions for directory organization and input parameters are provided directly in the master script before each command.

🗃️ Repository structure

/scripts folder contains only executable codes from the command line.
/notebook folder contains Rmd and Jupyter analysis with descriptive parts and figures.
/example_data folder contains all the data. (available only in the private version of this repository)
/results

🔑 Key analysis implemented

taxonomic profiling of bacterial and fungal communities
functional pathway reconstruction from metagenomic data
integration of taxonomic, functional, and metabolic layers
diversity, ordination, and differential abundance analyses
microbial-metabolite-pathway network reconstruction

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
config		config
data		data
figures		figures
notebook		notebook
results		results
script		script
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

multiomics-fermentation-pipeline

🔀 Workflow overview

📥 📤 Pipeline Inputs, Outputs and Dependencies

⚙️ How to use

🗃️ Repository structure

🔑 Key analysis implemented

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

multiomics-fermentation-pipeline

🔀 Workflow overview

📥 📤 Pipeline Inputs, Outputs and Dependencies

⚙️ How to use

🗃️ Repository structure

🔑 Key analysis implemented

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages