Skip to content

qiita-spots/qp-pacbio

Repository files navigation

Build Status Coverage Status

Qiita plugin to process PacBio reads; it currently provides 2 commands for Qiita:

  • Woltka v0.1.7, with cov and id filter: which generates feature and functional profiles agains WoLr2 using a coverage and identify filter; the expected output are BIOM artifacts.
  • PacBio processing: which goes from step 1 to 7 in the image below. The expected output is a main folder with folders per-sample and folders for each of the different outputs, as follows:
    • MAG folder: all Metagenome-Assembled Genome (MAG) generated for that sample
    • LCG folder: all Long-Circular Genome (LCG) generated for that sample that are over 512kb in size - approximate 515,000 bases (half a million)
    • small_LCG folder: all Long-Circular Genome (LCG) generated for that sample that are under 512kb in size
    • [sample-name].fna.gz: the no LCG reads used for MAG generation
    • [sample-name].checkm.txt.gz: MAG quality information from CheckM v1.2.3
  • Remove SynDNA plasmid, insert, & GCF_000184185 reads (minimap2): removes technical reads and calculates SynDNA content.
  • Adapter removal via lima/pbmarkdup v2.13: removes adapters via lima/pbmarkdup from per-sample-FASTQ artifacts. We currently have 2 adapter_sets:
    • 'AAGCAGTGGTATCAACGCAGAGTACT'
    • 'twist_adapters_231010.fasta.gz', downloaded from PacBio
  • Feature Table from LCG/MAG: (only available for admins via CLI: feature-table-job-submission.py) creates a new Analysis with the merged LCG/MAG of 1 or multiple artifacts generated by the PacBio processing command - includes feature table, phylogeny and taxonomy

qp_pacbio processing steps

[DEPRECATED commands]

  • Woltka v0.1.7, minimap2: replaced by "Woltka v0.1.7, with cov and id filter"
  • PacBio adapter removal via lima/pbmarkdup: replaced by "Adapter removal via lima/pbmarkdup v2.13"

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages