alternative queue scheduler #543

r2evans · 2026-01-22T16:00:19Z

r2evans
Jan 22, 2026

As an alternative to FIFO, is there utility in a "prioritized" FIFO? (Or ... add mirai(.priority=0, ...) as a default argument, and if never changed then behavior is identical to non-prioritized FIFO.)

Use Case

Some of my work in the past has been in a HPC where I use 1000s of nodes. While I was not using mirai at the time, it would be very useful there. One example where I find I need everywhere(.) is to load the local definition of a package using devtools::load_all(). While it would obviously be much easier if the package is stable/unchanging, there are frequent-enough times where a bug needs to be fixed quickly without having to bring down and restart the network.

Usually this is necessary in the middle of a run because of a flaw discovered, and that needs to be patched asap, ideally it would be patched before tasks indicated by info()["awaiting"] are dispatched for evaluation. I think there are a few ways this can be done:

Approach 1: `stop_mirai()`

Keeping track of all dispatched tasks, I can stop_mirai(...) all of them, call everywhere(load_all(..)), and then redefine the tasks. This can work, though it requires a little more overhead to remember how all of those tasks were originally defined. This is not unreasonable.

Approach 2: centralized sentinel variable

I might use redis or some other central store of "should reload" indicators. For example,

hardtask <- function() Sys.sleep(10)
.redis <- redux::hiredis()
.redis$SET("sentinel", "0")
daemons(5)
everywhere({ .redis <<- redux::hiredis(); }, .sentinel = 0, hardtask = hardtask)
m <- mirai_map(1:10, \(i) {
  if (.sentinel < (.sentinel <<- .redis$GET("sentinel"))) devtools::load_all("...")
  hardtask()
})

This doesn't implement a prioritized queue per-se, it only allows for rather simple "always check this first" pre-loading of one specific expression.

Approach 3: prioritized FIFO

If the fifo dispatcher had a simple prioritization, then it would be possible to preempt the awaiting tasks. For instance, while not working code, a priority-fifo might work as:

hardtask <- function() Sys.sleep(10)
daemons(5)
everywhere({}, hardtask  = hardtask)
m <- mirai_map(.priority = 0, .x = 1:10, .f = \(i) hardtask())
info()
# connections  cumulative    awaiting   executing   completed 
#           5           5           5           5           0 
everywhere({ devtools::load_all("...") }, .priority = 100)
m <- mirai_map(1:5, \(i) Sys.sleep(1))
info()
# connections  cumulative    awaiting   executing   completed 
#           5           5          10           5           0

In this example, the first 5 tasks are running on the old code. As nodes free-up on the first five tasks, the higher-priority load_all(.) tasks are pushed next. Once those are done, the remainder of the 1:10 tasks are pushed next.

This notion is distinct from compute clusters, since I need the higher-priority tasks to have a side-effect (updated global environment) on all nodes.

Design

Generalized, this supports the notion of more-important long-running tasks, whether it be for meta-tasks for side-effect (load_all("...")) or for other tasks that need to be finished sooner than everything currently waiting to be dispatched.

No discussion of preempting already-running tasks, this only addresses scheduling of waiting tasks.
Within a particular .priority level, everything is fifo.
There is nothing complicated about the queue methodology, it is merely another ordering.
It is feasible that some low-priority tasks will be stiff-armed for an extended period of time, or even that they may never run.
- This might be mitigated: whenever a group of tasks is preempted, their priority is incremented. Eventually their priority will be high-enough to "guarantee" a run.

(I have no strong opinion on whether "0" is the lowest or highest priority, if negatives are allowed, if priorities should be bounded, etc.)

Thoughts?

shikokuchuo · 2026-02-10T12:07:53Z

shikokuchuo
Feb 10, 2026
Maintainer

This is a great ask, and appreciate the time you've put in to explain the details. Priority levels are definitely useful.

It's a question whether they belong in something as low level as mirai. Most modern async runtimes (Tokio, Go, Node) don't support task prioritisation in favour of fairness and simplicity. I'd need strong motivation from a concrete current use case (that can be shown to affect a whole class of users) to consider this.

0 replies

r2evans · 2026-02-10T13:18:22Z

r2evans
Feb 10, 2026
Author

Keeping mirai simple is a commendable goal, and I understand the need for more than just one person to find it useful.

If you believe an extension to be the best path, is there documentation on how to write one? Unless I'm mistaken, it would require compiled code and interfacing at a low level. To me it seems that is reimplementing and not just an extension.

I'm hoping there could be callbacks out of the default dispatcher, or if writing a dispatcher from scratch then a list of required calls/steps that the new dispatcher must support. For example, the DBI package has a set of methods that can/should be reimplemented (and have default functionality if they are not re-classed).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

alternative queue scheduler #543

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

alternative queue scheduler #543

Uh oh!

Uh oh!

r2evans Jan 22, 2026

Use Case

Approach 1: stop_mirai()

Approach 2: centralized sentinel variable

Approach 3: prioritized FIFO

Design

Replies: 2 comments

Uh oh!

shikokuchuo Feb 10, 2026 Maintainer

Uh oh!

r2evans Feb 10, 2026 Author

r2evans
Jan 22, 2026

Approach 1: `stop_mirai()`

shikokuchuo
Feb 10, 2026
Maintainer

r2evans
Feb 10, 2026
Author