Don't spawn a task per file to include, use a pool of tasks instead #225

Drvi · 2025-09-16T10:28:21Z

@btime (try runtests("test", failures_first=true, name="aaa") catch end) # in RAICode, 7,1 threads

 master: 175.964 ms (2881429 allocations: 226.19 MiB)
 PR:     101.575 ms (2823433 allocations: 219.54 MiB)

nickrobinson251

nice

src/ReTestItems.jl

nickrobinson251 · 2025-09-30T15:38:31Z

src/ReTestItems.jl

-                end
-            end
+
+    walkdir_channel = Channel{Tuple{String, FileNode}}(1024)


why 1024? (Please add a comment)

No reason it has to be 1024 exactly. Usually one would use Inf (i.e. 9223372036854775807) as the limit, but realistically, you don't want this to be your limit, this number of elements would OOM us, surely. 1024 seemed like a more reasonable limit.

nickrobinson251 · 2025-09-30T16:16:30Z

src/ReTestItems.jl

+        @spawn walkdir_task(
+            $walkdir_channel, $project_root, $root_node, $ti_filter, $paths, $projectfile, $report, $verbose_results
+        )
+        for _ in 1:clamp(2*(nthreads()-(nthreads() == 1)), 1, 16) # 1 to 16 tasks, 1 if single-threaded


why this formula? why 2x nthreads? Since this is no longer a function of number of files, i guess we need a cap... but shouldn't the cap be a function of the number of threads (rather than fixed to 16)?

Also, what's the worst case scenario for this new formula? how would it compare to the old formula of a task per file?

we should document somewhere (as a comment here?) that we spawn N include_tasks for perf reasons, and then explain how the formula was determined

Right, this is really tricky and this formula probably isn't optimal... my intuition is that I usually get better performance when I have more tasks than cores (hence the 2x), but in this case, unfortunately, each task is going make a lot of dynamic allocations (parsing allocates and eval-ing allocates a lot) and this means GC is likely to run and when GC runs, all threads are going to be waiting, so there is a break-even point where adding more tasks doesn't help performance because it makes GC pauses worse.

I didn't really experiment with the formula as it seemed "good enough"

src/ReTestItems.jl

Don't spawn a task per file to include, use a pool of tasks instead

636bc19

Drvi marked this pull request as draft September 16, 2025 10:28

.

9da139d

Drvi marked this pull request as ready for review September 16, 2025 11:46

Drvi requested review from NHDaly and nickrobinson251 September 17, 2025 06:58

Drvi force-pushed the td-include-tasks branch from b1e24c7 to 9da139d Compare September 17, 2025 22:22

Drvi mentioned this pull request Sep 17, 2025

[WIP] Try to filter test items without constructing the AST #226

Draft

Drvi added 3 commits September 20, 2025 15:23

.

bcbb1db

Fewer allocs

d2f8a3d

Simplify _is_subproject

5ce2762

nickrobinson251 reviewed Sep 30, 2025

View reviewed changes

PR feedback

561c80c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Don't spawn a task per file to include, use a pool of tasks instead #225

Don't spawn a task per file to include, use a pool of tasks instead #225

Uh oh!

Drvi commented Sep 16, 2025 •

edited

Loading

Uh oh!

nickrobinson251 left a comment

Uh oh!

Uh oh!

nickrobinson251 Sep 30, 2025

Uh oh!

Drvi Sep 30, 2025

Uh oh!

nickrobinson251 Sep 30, 2025

Uh oh!

Drvi Sep 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Don't spawn a task per file to include, use a pool of tasks instead #225

Are you sure you want to change the base?

Don't spawn a task per file to include, use a pool of tasks instead #225

Uh oh!

Conversation

Drvi commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nickrobinson251 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nickrobinson251 Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Drvi Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

nickrobinson251 Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Drvi Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Drvi commented Sep 16, 2025 •

edited

Loading