spatial_autocorr limit the numba threads as n_jobs temporarily by selmanozleyen · Pull Request #984 · scverse/squidpy

selmanozleyen · 2025-04-07T10:45:22Z

Description

I am not sure if this is a bug. But it makes sense for the user to expect numba to be using at most n_jobs on their cores. I made one solution like this one but I think any code that uses numba will have to be modified this way if we see this as a bug right @ilan-gold ? or am I missing something?

Closes

#957 (comment)

codecov-commenter · 2025-04-07T10:54:42Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 66.60%. Comparing base (3771a0a) to head (e459751).
⚠️ Report is 21 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #984      +/-   ##
==========================================
+ Coverage   66.58%   66.60%   +0.02%     
==========================================
  Files          40       40              
  Lines        6057     6061       +4     
  Branches     1014     1014              
==========================================
+ Hits         4033     4037       +4     
  Misses       1663     1663              
  Partials      361      361

Files with missing lines	Coverage Δ
src/squidpy/gr/_ppatterns.py	`80.78% <100.00%> (+0.30%)`	⬆️

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ilan-gold · 2025-04-07T11:50:37Z

@selmanozleyen I think this issue is conflating two things. The function you're wrapping doesn't appear to have anything to do with numba, am I right? So why does setting numba help? If it does, could you explain. Would there be any way for you to confirm (if not be test than by posting results) that your fix works?

selmanozleyen · 2025-04-07T12:39:31Z

I assumed it is numba related it uses score_helper which uses moran's I which is implemented with numba in scanpy.

squidpy/src/squidpy/gr/_ppatterns.py

Line 254 in afcb8d0

func = _morans_i if mode == SpatialAutocorr.MORAN else _gearys_c

moran helper in scanpy:
https://github.com/scverse/scanpy/blob/15c5434ad0382614a16df612745c183807675d04/src/scanpy/metrics/_morans_i.py#L131

I checked locally with htop and this runs on all the cores without the changes I made

import numpy as np
import pandas as pd

import anndata as ad
import scanpy as sc
import squidpy as sq



# load the pre-processed dataset

adata = sq.datasets.visium_hne_adata()
sq.gr.spatial_neighbors(adata)
sq.gr.spatial_autocorr(adata, n_jobs=1, n_perms=10000000, mode="moran")

ilan-gold · 2025-04-07T12:48:51Z

Awesome thanks! And with the change, it works? I would wonder if this problem applies everywhere this parallelize appears, in which case it might make sense to make this a decorator on parallelize or the like.

selmanozleyen · 2025-04-07T13:41:02Z

Yes it works when I set it to 1 but it doesn't work for 2 because there is no guarantee that numba and joblib will use the same cores. So there would be 2*n_jobs cores utilized. I couldn't observe this very clearly because I have 8 cores locally already atm.

But do you think this is a bug? I think n_jobs was just meant for the parallelize function. And setting a global variable like this doesn't feel right. What happens if program runs this method and when it runs another program expects more cores from numba? I think it just a matter of communicating what n_jobs means otherwise the user should set the global configuration of numba imo.

ilan-gold · 2025-04-07T13:48:43Z

Right @selmanozleyen yes I got lost in the sauce. I understand now better, I think. So:

The n_jobs parameter is meant for parallelize, not numba
Separately numba has its own setting the environment variable NUMBA_NUM_THREADS
Setting the former does not interact with the later, so limiting n_jobs means numba may still max out your CPU (or similar behavior)

If so, then I think this issue is one of documentation, you're right.

timtreis · 2025-04-30T23:32:18Z

Based on the original comment it seems that numba just grabs cores irregardless though. I think we should limit numba here to only act "within core" since as Selman said the user expects only n_jobs to be busy. Especially in pipeline use this could otherwise cause nasty issues

selmanozleyen · 2025-05-07T20:05:07Z

I will close this as we decided this is a documentation issue

add a block where you set numba threads

c653dc3

selmanozleyen mentioned this pull request Apr 9, 2025

Docs: Edit the argument documentation of n_jobs of the parallelize function #987

Merged

timtreis linked an issue Apr 30, 2025 that may be closed by this pull request

spatial_autocorr uses all the cores #957

Closed

Merge branch 'main' into fix/set_numba_threads_on_spatial_autocorr

e459751

selmanozleyen closed this May 7, 2025

flying-sheep removed a link to an issue May 15, 2025

spatial_autocorr uses all the cores #957

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spatial_autocorr limit the numba threads as n_jobs temporarily#984

spatial_autocorr limit the numba threads as n_jobs temporarily#984
selmanozleyen wants to merge 2 commits intoscverse:mainfrom
selmanozleyen:fix/set_numba_threads_on_spatial_autocorr

selmanozleyen commented Apr 7, 2025

Uh oh!

codecov-commenter commented Apr 7, 2025 •

edited

Loading

Uh oh!

ilan-gold commented Apr 7, 2025

Uh oh!

selmanozleyen commented Apr 7, 2025

Uh oh!

ilan-gold commented Apr 7, 2025

Uh oh!

selmanozleyen commented Apr 7, 2025

Uh oh!

ilan-gold commented Apr 7, 2025 •

edited

Loading

Uh oh!

timtreis commented Apr 30, 2025 •

edited

Loading

Uh oh!

selmanozleyen commented May 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

selmanozleyen commented Apr 7, 2025

Description

Closes

Uh oh!

codecov-commenter commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ilan-gold commented Apr 7, 2025

Uh oh!

selmanozleyen commented Apr 7, 2025

Uh oh!

ilan-gold commented Apr 7, 2025

Uh oh!

selmanozleyen commented Apr 7, 2025

Uh oh!

ilan-gold commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

timtreis commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

selmanozleyen commented May 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Apr 7, 2025 •

edited

Loading

ilan-gold commented Apr 7, 2025 •

edited

Loading

timtreis commented Apr 30, 2025 •

edited

Loading