Scenario
- a Computational pipeline is started on a private cluster
- the pipeline is scheduled on the private cluster, e.g. the task status is set to
STARTED
- for a short time the dask-scheduler on the private cluster is not reachable,
- the dv-2 checks during that time for the task status via its dask client, fails to connect returns
UNKNOWN
- dv-2 sets the task back to
WAITING_FOR_CLUSTER,
- on the next iteration of the scheduler the dask-scheduler is reachable again,
--> dv-2 does not check if the task is already running and starts the task again, cause it does not check for that use-case
--> the private cluster runs the task twice, potentially running the task longer than needed, wasting time and money.