topFeatures(): support alternative DDoF calculation methods #71

alyst · 2025-06-24T22:17:28Z

At the moment topFeatures() uses posterior-adjusted residual degree-of-freedom (getDfPosterior()) for the p-value calculation of fixed effects in linear mixed-effect models.
In the context of topFeatures(), the DoF that should be used when calculating the p-value is the denominator degree-of-freedom for a given effect (the nominator DoF should be 1 for most of the cases).
Using residual DoF as the approximation of the DDoF can, in some cases (many precursors in a single protein group), lead to a significant overestimation of DDoF (10x or more), and, as a result, to a large overestimation of the effect significance (p-values).
This draft PR adds support for using alternative methods for DDoF calculation:

adds ddf.method arg to the topFeatures() call (defaults to residual to maintain the current behavior)
adds support DDoF calculation methods from the parameters R package: dof_kenward(), dof_ml1(), dof_satterthwaite().
(via alternative ddf.method values). The parameters package dependency is optional (Suggested): if the user-specified ddf.method method requires parameters package, but it is not available, topFeatures() will fail.

The PR also ensures that the row names of the topFeatures() match the names of the corresponding models (i.e. protein group IDs) after significance filtering and sorting.

It also cleans up a bit the code related to msqrobLmer() and adds support for optional storing the original lmerMod model output, as it is required for the dof_xxx() calls.

The ridge models are not yet supported (the ridge codepath modifies the original model, and I have not yet figured out how to make dof_xxx() calls work with it).

Let me know what you think.

as that's what it is

match colsMetadata order to the data columns

add ddf.method parameter to topFeatures() to support using dof_xxx() methods from the parameters package for DDoF calculation instead of using the residual DoF.

Calling .create.model() with undefined w causes CI failures. For weights, .create.model() uses model@frame$`(weights)`.

alyst · 2025-08-06T00:44:42Z

@StijnVandenbulcke @cvanderaa Could you please take a look at this PR?

Alexey Stukalov added 15 commits June 24, 2025 14:36

msqrobLm(): rename data arg to colsMetadata

e802137

as that's what it is

.matchQuantColsOrder()

78ae3ae

match colsMetadata order to the data columns

use [[ ]] for getting single element

e92cfb8

fix whitespace

426448c

msqrobLmer(): reduce code duplication

ba29c12

msqrobLmer(): skip NA when calling squeezeVar

8b5da14

msqrobLmer(): keep.model argument

6f83190

fix typo

20e4921

use NA_real_

9f56ebf

topFeatures(): support alt. methods for DDoF calc

bb170cb

add ddf.method parameter to topFeatures() to support using dof_xxx() methods from the parameters package for DDoF calculation instead of using the residual DoF.

msqrobLm(): revert the data -> colsMetadata rename

e74a013

.create.model(): remove unused w arg

ce90fd7

Calling .create.model() with undefined w causes CI failures. For weights, .create.model() uses model@frame$`(weights)`.

msqrobLmer(): document keep.model

e817cba

topFeatures(): regenerate docs

541eee6

fixup .matchQuantColsOrder()

24e5db2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

topFeatures(): support alternative DDoF calculation methods #71

topFeatures(): support alternative DDoF calculation methods #71

Uh oh!

alyst commented Jun 24, 2025

Uh oh!

alyst commented Aug 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

topFeatures(): support alternative DDoF calculation methods #71

Are you sure you want to change the base?

topFeatures(): support alternative DDoF calculation methods #71

Uh oh!

Conversation

alyst commented Jun 24, 2025

Uh oh!

alyst commented Aug 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant