Skip to content

Issue with DADA2: Low ASV numbers in PacBio full-length ITS data #2175

@cherry52HZ

Description

@cherry52HZ

Dear Dr. Benjamin Callahan,
I’ve been following your 16S PacBio tutorial (https://benjjneb.github.io/LRASManuscript/LRASms_fecal.html) to analyze my full-length ITS amplicon data (obtained for marine phytoplankton) with DADA2. However, I encountered an issue that I would like your help:

   The number of ASVs generated is unexpectedly low—only 10–30 ASVs per sample on average. In contrast, when we analyze shorter amplicons (ITS1, ITS2, and 18S rDNA V4, ~300 bp) using the standard DADA2 pipeline for short reads, each sample usually yield hundreds of ASVs.

   I’m wondering if there might be oversights in our adapted pipeline (see below) or if specific parameters need adjustment for PacBio full-length ITS data. We have checked the DADA2 discussions at GitHub but didn’t find similar issues reported by others. I would greatly appreciate your insights on potential problem steps and optimization suggestions.

Thanks in advance
Regards,
Yingchao Li.

Our adapted script (key steps) is attached for your reference.

Issue with DADA2 Low ASV numbers in PacBio full length ITS data.docx

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions