produce the --replicons input content based on the flye assembly_info.txt

The tool works great using docker but the naming is not very nice when starting from a flye assembly where contigs are named randomly.

I found the use of --replicons great in that regard but it requires to create a tsv file upfront which is not easy when looping through many assemblies in an integrated pipeline (dozens of assemblies in a row).

Would it be possible to internally create the --replicons input file based on the content of the flye `assembly_info.txt` file which contains columns `#seq_name       length  cov.    circ.` and taking the largest contig as chromosome and the others as plasmids?

That would create genbank files which are closer to submission quality

My fix now is to re-run the bakta after all genomes are assembled and after building the --replicons input file by hand

thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

produce the --replicons input content based on the flye assembly_info.txt #256

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

produce the --replicons input content based on the flye assembly_info.txt #256

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions