Skip to content

Modules: ideas about tagging and grouping modules #11

@mirpedrol

Description

@mirpedrol

Hello!

As part of a project I am working on in the lab, I am working on a way to group modules into what we call "classes". These are modules that have the same input and output and perform the same operation. These modules are interchangeable between them.
This is very useful for us since we are interested in benchmarking different tools, and this would facilitate the selection of the best tool. Additionally, it can also be useful to tag the modules to allow LLMs to easily select them when constructing workflows.

The tagging of input and output files in nf-core modules has been very useful so far, and I would also like to tag the operation of the module (left a comment in the spec).

What I am working on:

  • Different ways to group modules:
    • Manually
    • With Seqera MCP
    • Using the embedding of modules
  • Automatically tagging modules with ontologies using MCPs:
  • Validation of the ontology terms:
    • For this, Nextflow could help by providing a way of running a single module using test data nextflow module run
  • Benchmarking of different tools using benchmarking pipelines such as nf-core/multiplesequencealign

Other links for more information:

This issue is to track the ideas so that we can sync on our objectives. Please feel free to comment any suggestions/concerns or ask any questions you may have :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions