Skip to content

Commit bb3a08c

Browse files
committed
Edit text on individual vs co-assembly
- Remove 'Describe what de-replication is' from objectives; this is in the scope of the binning tutorial
1 parent f3d37a5 commit bb3a08c

File tree

1 file changed

+17
-8
lines changed
  • topics/microbiome/tutorials/metagenomics-assembly

1 file changed

+17
-8
lines changed

topics/microbiome/tutorials/metagenomics-assembly/tutorial.md

Lines changed: 17 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,6 @@ questions:
1212
- "How to assess the quality of metagenomic data assembly?"
1313
objectives:
1414
- "Describe what an assembly is"
15-
- "Describe what de-replication is"
1615
- "Explain the difference between co-assembly and individual assembly"
1716
- "Explain the difference between reads, contigs and scaffolds"
1817
- "Explain how tools based on De Bruijn graph work"
@@ -169,11 +168,11 @@ In case of a not very large dataset it's more convenient to upload data directly
169168
170169
As explained before, there are many challenges to metagenomics assembly, including:
171170
172-
1. differences in coverage between samples, resulting from differences in abundance,
173-
2. the fact that different species often share conserved regions ({%cite kececioglu2001%}), and
174-
3. the presence of multiple strains of a single species ({%cite miller2010%}).
171+
1. Differences in coverage between samples, resulting from differences in abundance;
172+
2. The fact that different species often share conserved regions ({%cite kececioglu2001%}), and
173+
3. The presence of multiple strains of a single species ({%cite miller2010%}).
175174
176-
To reduce the differences in coverage between samples, we can use a **co-assembly** approach, where reads from all samples are aligned together.:
175+
To reduce the differences in coverage between samples, we can use a **co-assembly** approach, where reads from all samples are aligned together:
177176
178177
![Image show one pile of sample1 reads and another pile of sample2 reads, then, green arrow leads to assembled reads from both piles](./images/co-assembly.png "Co-assembly"){:width="60%"}
179178
@@ -195,11 +194,21 @@ In these cases, co-assembly is reasonable if:
195194
- Longitudinal sampling of the same site
196195
- Related samples
197196
198-
If it is not the case, **individual assembly** should be prefered. In this case, an extra step of **de-replication** should be used:
197+
Examples where co-assembly would be reasonable:
198+
- Repeated sampling of the **same patient** along a particular amount of time.
199+
- Multiple samples taken from the **same site** and **similar environmental conditions**, eg. a patch of soil during the same sampling season.
200+
201+
Examples where co-assembly would NOT be recommended:
202+
- Samples from different patients.
203+
- Samples from the same site, but over different seasons or under different environmental conditions, eg. a patch of soil before and after a bushfire event, a marine site under upwelling vs. under normal conditions.
204+
205+
If samples differ like described, **individual assembly** is preferred. In the case of individual assembly, if **contigs are binned** after, an extra step of **de-replication** should be used:
199206
200207
![Image shows the process of individual assembly on two strains and five samples, after individual assembly of samples two samples are chosen for de-replication process. In parallel, co-assembly on all five samples is performed](./images/individual-assembly.png "Individual assembly followed by de-replication vs co-assembly. Source: dRep documentation"){:width="80%"}
201208
202-
Co-assembly is more commonly used than individual assembly and then de-replication after binning. But in this tutorial, to show all steps, we will run an **individual assembly**.
209+
For more information on dereplication, check out the [metagenomic binning tutorial](../metagenomics-binning/tutorial.md).
210+
211+
In this tutorial, to show all steps, we will run an **individual assembly**.
203212
204213
> <comment-title></comment-title>
205214
> Sometimes it is important to run assembly tools both on individual samples and on all pooled samples, and use both outputs to get the better outputs for the certain dataset.
@@ -795,4 +804,4 @@ Once the choices made, metagenomic assembly can start:
795804
2. Assembly quality is evaluated with various metrics
796805
3. The assembly graph can be visualized.
797806
798-
Once all these steps done, we can move to the next phase to build Metagenomics Assembled Genomes (MAGs): binning
807+
Once all these steps done, we can move to the next phase to build Metagenomics Assembled Genomes (MAGs): [metagenomic binning](../metagenomics-binning/tutorial.md).

0 commit comments

Comments
 (0)