Skip to content
Change the repository type filter

All

    Repositories list

    • ETL of DrugBank to recognize KG2 concepts in DrugBank entries for use as training data
      Python
      1100Updated Feb 10, 2026Feb 10, 2026
    • Jupyter Notebook
      0050Updated Jan 29, 2026Jan 29, 2026
    • A collection of the FMH hashes for every microbial database I can get my hands on
      Python
      0112Updated Jan 27, 2026Jan 27, 2026
    • YACHT

      Public
      A mathematically characterized hypothesis test for organism presence/absence in a metagenome
      C++
      934144Updated Jan 8, 2026Jan 8, 2026
    • frac-kmc

      Public
      Fast FracMinHash sketch generator
      C++
      2400Updated Jan 7, 2026Jan 7, 2026
    • Python
      0100Updated Jan 6, 2026Jan 6, 2026
    • Python
      0110Updated Dec 4, 2025Dec 4, 2025
    • Conda recipes for the bioconda channel.
      Shell
      3.7k000Updated Dec 3, 2025Dec 3, 2025
    • A sub-linear sampling algorithm
      Python
      1100Updated Nov 23, 2025Nov 23, 2025
    • Python
      51810Updated Nov 7, 2025Nov 7, 2025
    • The goal will be to obtain a shortlist of hashes that show up in lots of metagenomes, but aren't classified in any databases like GTDB, GenBank WGS, BLAST nr, e…
      Python
      0010Updated Oct 14, 2025Oct 14, 2025
    • C++
      0710Updated Sep 20, 2025Sep 20, 2025
    • MVP
      Python
      0051Updated Sep 10, 2025Sep 10, 2025
    • A repo for the larger scale DnDs project, along with the visualization code
      Jupyter Notebook
      0000Updated Sep 9, 2025Sep 9, 2025
    • Download, sketch, and enumerate unique hashes of GenBank WGS
      Python
      0000Updated Aug 18, 2025Aug 18, 2025
    • Website for Koslicki Lab.
      HTML
      3000Updated Aug 15, 2025Aug 15, 2025
    • 0000Updated Jul 31, 2025Jul 31, 2025
    • A functional profiler for metagenomes using FracMinHash
      Python
      42052Updated Jul 14, 2025Jul 14, 2025
    • This repository computes insertion, deletion, and substitution rates simultaneously under a non-simple mutation model using k-mers
      Python
      1200Updated Jul 11, 2025Jul 11, 2025
    • Using simulated experiments, we will see if we can estimate the rates good enough.
      Python
      0000Updated May 8, 2025May 8, 2025
    • The "LLMFactCheck" is a powerful tool designed to semantic triples (subject–predicate–object) in different sources, ensuring the accuracy of references and enha…
      Python
      01101Updated May 5, 2025May 5, 2025
    • .github

      Public
      Public profile
      0000Updated Apr 24, 2025Apr 24, 2025
    • Python
      0000Updated Apr 21, 2025Apr 21, 2025
    • Beta diversity of mixtures over time
      Python
      0000Updated Apr 7, 2025Apr 7, 2025
    • Reproducible scripts for all KEGG + Sourmash gather computations for the associated paper
      Python
      0110Updated Mar 20, 2025Mar 20, 2025
    • Reproducibility code for the manuscript "Cosine similarity using FracMinHash sketches"
      Python
      0000Updated Jan 30, 2025Jan 30, 2025
    • ML-on-MKG

      Public
      Machine learning on the metagenomics knowledge graph
      Python
      0000Updated Jan 16, 2025Jan 16, 2025
    • Implement prefetch in cpp to investigate potential speedups
      Standard ML
      3090Updated Dec 23, 2024Dec 23, 2024
    • Investigating cardinality of sketch sizes when sketching is done using affirmative sampling
      C++
      1000Updated Nov 12, 2024Nov 12, 2024
    • Jupyter Notebook
      0000Updated Nov 12, 2024Nov 12, 2024