This repository contains the bioinformatics analysis pipeline for investigating the effects of pollinators on flower-associated microbial communities in cacao (Theobroma cacao L.) across deforestation gradients in Ghana.
- Microbial Community Characterization: Describe bacterial and fungal T. cacao flower communities for the first time with a focus on pathogens and mutualistic symbionts;
- Management Effects: Investigate how farm management type affects flower microbiomes along the deforestation gradient;
- Pollinator Footprint: Identify the microbial signature of flower-visitng animals;
- Functional Correlations: Examine relationships between microbiome structure and the success of the early step of fertilisation.
- Visitint insect exclusion experiment: 3 bagged + 3 openly pollinated flowers per tree
- Deforestation gradient: 4 management types across 7 farms in Ghana at different stages of extensive farming:
- Full sun (2 farms)
- Agroforest (2 farms)
- Near forest (2 farms)
- Inside tropical forest (1 farm)
- Biological samples: 294 flowers (7 trees/farm Γ 6 flowers/tree)
- Controls: 42 total (14 extraction + 14 PCR negative + 14 mock communities)
- Marker genes: 16S rRNA V4 (515f/806r, bacteria) + ITS1 (ITS1f/ITS2, fungi)
- Sequencing platform: Illumina NovaSeq 2Γ250bp
- Multiplexing and indexing: Two-level approach enabling high sample throughput with combinatorial dual-indexing (48 sublibraries, 336 samples per marker)
cacao_flower_microbiome/
βββ docs/ # Analysis documentation
β βββ analysis_log.md # Detailed progress tracking
βββ data/ # Metadata and small data files
βββ qiime2/ # QIIME2 analysis pipeline
β βββ scripts/ # Analysis scripts
β βββ import/ # Raw data preprocessing and import into QIIME2
β βββ denoise/ # DADA2 ASV calling
β βββ taxonomy/ # Taxonomic classification
β βββ filtered/ # Quality- and taxonomy-filtered datasets
β βββ rarefaction/ # Alpha rarefaction analysis
βββ logs/ # SLURM job outputs
βββ README.md # This file
The analysis pipeline is under active development. Raw sequencing data will be deposited in public repositories upon publication.
All analysis scripts are documented for reproducibility. Development includes AI assistance for code optimisation. Methodology and results will be made fully available upon publication.
Keywords: microbiome, anthosphere, pollination ecology, Theobroma cacao, deforestation, 16S rRNA, ITS1, QIIME2, metabarcoding, Ghana, pollen-pistil interaction