You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: topics/microbiome/tutorials/metagenomics-assembly/tutorial.md
+17-8Lines changed: 17 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,7 +12,6 @@ questions:
12
12
- "How to assess the quality of metagenomic data assembly?"
13
13
objectives:
14
14
- "Describe what an assembly is"
15
-
- "Describe what de-replication is"
16
15
- "Explain the difference between co-assembly and individual assembly"
17
16
- "Explain the difference between reads, contigs and scaffolds"
18
17
- "Explain how tools based on De Bruijn graph work"
@@ -169,11 +168,11 @@ In case of a not very large dataset it's more convenient to upload data directly
169
168
170
169
As explained before, there are many challenges to metagenomics assembly, including:
171
170
172
-
1. differences in coverage between samples, resulting from differences in abundance,
173
-
2. the fact that different species often share conserved regions ({%cite kececioglu2001%}), and
174
-
3. the presence of multiple strains of a single species ({%cite miller2010%}).
171
+
1. Differences in coverage between samples, resulting from differences in abundance;
172
+
2. The fact that different species often share conserved regions ({%cite kececioglu2001%}), and
173
+
3. The presence of multiple strains of a single species ({%cite miller2010%}).
175
174
176
-
To reduce the differences in coverage between samples, we can use a **co-assembly** approach, where reads from all samples are aligned together.:
175
+
To reduce the differences in coverage between samples, we can use a **co-assembly** approach, where reads from all samples are aligned together:
177
176
178
177
{:width="60%"}
179
178
@@ -195,11 +194,21 @@ In these cases, co-assembly is reasonable if:
195
194
- Longitudinal sampling of the same site
196
195
- Related samples
197
196
198
-
If it is not the case, **individual assembly** should be prefered. In this case, an extra step of **de-replication** should be used:
197
+
Examples where co-assembly would be reasonable:
198
+
- Repeated sampling of the **same patient** along a particular amount of time.
199
+
- Multiple samples taken from the **same site** and **similar environmental conditions**, eg. a patch of soil during the same sampling season.
200
+
201
+
Examples where co-assembly would NOT be recommended:
202
+
- Samples from different patients.
203
+
- Samples from the same site, but over different seasons or under different environmental conditions, eg. a patch of soil before and after a bushfire event, a marine site under upwelling vs. under normal conditions.
204
+
205
+
If samples differ like described, **individual assembly** is preferred. In the case of individual assembly, if **contigs are binned** after, an extra step of **de-replication** should be used:
199
206
200
207
{:width="80%"}
201
208
202
-
Co-assembly is more commonly used than individual assembly and then de-replication after binning. But in this tutorial, to show all steps, we will run an **individual assembly**.
209
+
For more information on dereplication, check out the [metagenomic binning tutorial](../metagenomics-binning/tutorial.md).
210
+
211
+
In this tutorial, to show all steps, we will run an **individual assembly**.
203
212
204
213
> <comment-title></comment-title>
205
214
> Sometimes it is important to run assembly tools both on individual samples and on all pooled samples, and use both outputs to get the better outputs for the certain dataset.
@@ -795,4 +804,4 @@ Once the choices made, metagenomic assembly can start:
795
804
2. Assembly quality is evaluated with various metrics
796
805
3. The assembly graph can be visualized.
797
806
798
-
Once all these steps done, we can move to the next phase to build Metagenomics Assembled Genomes (MAGs): binning
807
+
Once all these steps done, we can move to the next phase to build Metagenomics Assembled Genomes (MAGs): [metagenomic binning](../metagenomics-binning/tutorial.md).
0 commit comments