Skip to content

produce the --replicons input content based on the flye assembly_info.txt #256

@splaisan

Description

@splaisan

The tool works great using docker but the naming is not very nice when starting from a flye assembly where contigs are named randomly.

I found the use of --replicons great in that regard but it requires to create a tsv file upfront which is not easy when looping through many assemblies in an integrated pipeline (dozens of assemblies in a row).

Would it be possible to internally create the --replicons input file based on the content of the flye assembly_info.txt file which contains columns #seq_name length cov. circ. and taking the largest contig as chromosome and the others as plasmids?

That would create genbank files which are closer to submission quality

My fix now is to re-run the bakta after all genomes are assembled and after building the --replicons input file by hand

thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions