Do low-overlapping aggregates of cells violate statistical assumptions of Pearson correlation? #1720

RegnerM2015 · 2022-11-03T18:49:08Z

RegnerM2015
Nov 3, 2022

Based on the documentation, ArchR uses a similar strategy to Cicero to create low-overlapping aggregates of similar cells (metacells) to circumvent the sparsity in scATAC measurements. Metacells with more than 80% overlap with another metacell(s) are filtered out to reduce bias. Therefore, there are likely some metacells that still share cells (in other words, one scATAC-seq cell could be present in multiple metacells), meaning that the observations are not technically independent of one another.

One of the assumptions of Pearson correlation (used in addPeak2GeneLinks) is independence of observations (https://libguides.library.kent.edu/spss/pearsoncorr). To my understanding, the assumption is that observations can only be counted once.

Since we have overlapping metacells, would this technically violate one of the assumptions of Pearson correlation? Or does the 80% overlap filtering step address/dampen this concern?

rcorces · 2022-11-03T20:03:35Z

rcorces
Nov 3, 2022
Maintainer

All of this sounds right to me. The problem is that with smaller datasets, you essentially cannot avoid overlap. So rather than saying "you cant do peak-to-gene links unless you have X cells"the approach that we've taken is to attempt to minimize that overlap.

1 reply

RegnerM2015 Nov 3, 2022
Author

This answers my question! Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do low-overlapping aggregates of cells violate statistical assumptions of Pearson correlation? #1720

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Do low-overlapping aggregates of cells violate statistical assumptions of Pearson correlation? #1720

Uh oh!

RegnerM2015 Nov 3, 2022

Replies: 1 comment · 1 reply

Uh oh!

rcorces Nov 3, 2022 Maintainer

Uh oh!

RegnerM2015 Nov 3, 2022 Author

RegnerM2015
Nov 3, 2022

Replies: 1 comment 1 reply

rcorces
Nov 3, 2022
Maintainer

RegnerM2015 Nov 3, 2022
Author