Skip to content

fastgather automatically sets to k=31 regardless of signatures #3858

@jorondo1

Description

@jorondo1

Maybe the answer will be that I should update my software, but I'll report it anyways. When trying to use k=51 signatures against a k=51 database, the fastgather script says they are not compatible:

== This is sourmash version 4.8.11. ==
== Please cite Irber et. al (2024), doi:10.21105/joss.06830. ==

=> sourmash_plugin_branchwater 0.9.5; cite Irber et al., doi: 10.1101/2022.11.02.514947

ksize: 31 / scaled: 1000 / moltype: DNA / threshold bp: 50000
gathering all sketches in '/net/nfs-ip34/home/def-ilafores/analysis/PROVID-LD/fullSet/data/composition/signatures_k51/326_J30.sig' against '/net/nfs-ip34/home/def-ilafores/ref_dbs/sourmash_db/ncbi-euks-fungi-2025.01.dna.k51.sig.zip' using 48 threads
Reading query(s) from: '/net/nfs-ip34/home/def-ilafores/analysis/PROVID-LD/fullSet/data/composition/signatures_k51/326_J30.sig'
WARNING: skipped 1 query paths - no compatible signatures.
Error: No query signatures loaded, exiting.

vs normal gather:

== This is sourmash version 4.8.11. ==
== Please cite Irber et. al (2024), doi:10.21105/joss.06830. ==

select query k=51 automatically.
loaded query: 326_J30... (k=51, DNA)
--
loaded 6260 total signatures from 1 locations.
after selecting signatures compatible with search, 6260 remain.

Starting prefetch sweep across databases.
Prefetch found 2 signatures with overlap >= 50.0 kbp.
Doing gather to generate minimum metagenome cover.

overlap     p_query p_match avg_abund
---------   ------- ------- ---------
80.0 kbp       0.0%    0.2%       5.1    GCA_028456235.1 Ophiosphaerella narmari
found less than 50.0 kbp in common. => exiting

found 1 matches total;
the recovered matches hit 0.0% of the abundance-weighted query.
the recovered matches hit 0.0% of the query k-mers (unweighted).

WARNING: final scaled was 10000, vs query scaled of 1000

my metagenomes have a scaling of 1000 and my db , 10000, but that doesnt prevent gather from working.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions