Skip to content

[MRG] Update the sourmash doc with gather and tax#79

Open
ccbaumler wants to merge 2 commits intodib-lab:mainfrom
ccbaumler:doc_07
Open

[MRG] Update the sourmash doc with gather and tax#79
ccbaumler wants to merge 2 commits intodib-lab:mainfrom
ccbaumler:doc_07

Conversation

@ccbaumler
Copy link
Contributor

Fixes #61
Fixes #28

Changed the lca command to gather. Added some detailed text and a bonus task.

```
curl -L https://osf.io/4f8n3/download -o genbank-k31.lca.json.gz
gunzip genbank-k31.lca.json.gz
curl -JLO https://farm.cse.ucdavis.edu/~ctbrown/sourmash-db/gtdb-rs214/gtdb-rs214-reps.k31.lca.json.gz
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

assuming we're running this on farm, you can just point people at /group/ctbrowngrp/sourmash-db/gtdb-rs214!

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

also, with gather, no need to use the LCA database (which is bigger memory, tho maybe faster). could just as easily use the regular zip.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

assuming we're running this on farm, you can just point people at /group/ctbrowngrp/sourmash-db/gtdb-rs214!

Would that be a good time to introduce ln -s as well?

also, with gather, no need to use the LCA database (which is bigger memory, tho maybe faster). could just as easily use the regular zip.

I did add a Bonus goal at the end of the document to use the standard zip database, I think I should switch them around. Introduce the zip db in the document and add a bonus about using other database types...?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

assuming we're running this on farm, you can just point people at /group/ctbrowngrp/sourmash-db/gtdb-rs214!

Would that be a good time to introduce ln -s as well?

sure!

also, with gather, no need to use the LCA database (which is bigger memory, tho maybe faster). could just as easily use the regular zip.

I did add a Bonus goal at the end of the document to use the standard zip database, I think I should switch them around. Introduce the zip db in the document and add a bonus about using other database types...?

zip better. Not sure why other database types needed here, but sure, in the bonus section... :)

Co-authored-by: C. Titus Brown <titus@idyll.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Sourmash lca gather not a command anymore update lca gather to just gather

2 participants