Issue with DADA2: Low ASV numbers in PacBio full-length ITS data

Dear Dr. Benjamin Callahan,
       I’ve been following your 16S PacBio tutorial (https://benjjneb.github.io/LRASManuscript/LRASms_fecal.html) to analyze my full-length ITS amplicon data (obtained for marine phytoplankton) with DADA2. However, I encountered an issue that I would like your help:

       The number of ASVs generated is unexpectedly low—only 10–30 ASVs per sample on average. In contrast, when we analyze shorter amplicons (ITS1, ITS2, and 18S rDNA V4, ~300 bp) using the standard DADA2 pipeline for short reads, each sample usually yield hundreds of ASVs.

       I’m wondering if there might be oversights in our adapted pipeline (see below) or if specific parameters need adjustment for PacBio full-length ITS data. We have checked the DADA2 discussions at GitHub but didn’t find similar issues reported by others. I would greatly appreciate your insights on potential problem steps and optimization suggestions.

Thanks in advance
Regards,
Yingchao Li.

Our adapted script (key steps) is attached for your reference.

[Issue with DADA2 Low ASV numbers in PacBio full length ITS data.docx](https://github.com/user-attachments/files/25085991/Issue.with.DADA2.Low.ASV.numbers.in.PacBio.full.length.ITS.data.docx)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with DADA2: Low ASV numbers in PacBio full-length ITS data #2175

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue with DADA2: Low ASV numbers in PacBio full-length ITS data #2175

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions