Skip to content

WIP: use manifest CSVs with MultiCollection etc.#11

Open
ctb wants to merge 28 commits intomainfrom
plugin_098_471_mf_csv
Open

WIP: use manifest CSVs with MultiCollection etc.#11
ctb wants to merge 28 commits intomainfrom
plugin_098_471_mf_csv

Conversation

@ctb
Copy link
Member

@ctb ctb commented Oct 13, 2024

With sourmash-bio/sourmash_plugin_branchwater#430, we can now use lists of .sig.zip files and/or standalone manifest CSVs.

So, I ran the benchmarks with sourmash-bio/sourmash_plugin_branchwater#471, which includes both sourmash-bio/sourmash_plugin_branchwater#430 and some other speedups, and got a really nice outcome - it's faster than ever, although it also uses more memory because (presumably) it's keeping many more metagenomes in memory simultaneously, due to the speed.

plugin version time memory
v0.8.6 814s 18.3 GB
v0.9.0 1332s 27.1 GB
v0.9.6 17523s 27.2 GB
unreleased w/430/471 1096.7 16.5 GB
unreleased w/430/471* 559.7 38.3 GB

* this run used sig.zip files and manifest CSVs in the benchmarking

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant