-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Hello!
As part of a project I am working on in the lab, I am working on a way to group modules into what we call "classes". These are modules that have the same input and output and perform the same operation. These modules are interchangeable between them.
This is very useful for us since we are interested in benchmarking different tools, and this would facilitate the selection of the best tool. Additionally, it can also be useful to tag the modules to allow LLMs to easily select them when constructing workflows.
The tagging of input and output files in nf-core modules has been very useful so far, and I would also like to tag the operation of the module (left a comment in the spec).
What I am working on:
- Different ways to group modules:
- Manually
- With Seqera MCP
- Using the embedding of modules
- Automatically tagging modules with ontologies using MCPs:
- Development of the edamMCP
- Validation of the ontology terms:
- For this, Nextflow could help by providing a way of running a single module using test data
nextflow module run
- For this, Nextflow could help by providing a way of running a single module using test data
- Benchmarking of different tools using benchmarking pipelines such as nf-core/multiplesequencealign
Other links for more information:
- using class modules in nf-core/multiplesequencealign
- the nf-class python package
- an example module repo where classes are working
This issue is to track the ideas so that we can sync on our objectives. Please feel free to comment any suggestions/concerns or ask any questions you may have :)