Benchmark ready par #22

jazullo · 2025-03-29T01:21:19Z

This branch implements 5 things in 5 commits:

using the direct implementation of allocScratch instead of the messy construction with flattenCallback for all variations of mergesort with the allocation proposal. 277f6a1
using the alloc proposal in parallel mergesort ace12b5
a cilksort using the same fixes applied to the rest of the sorts including the necessary INLIN(ABL)E pragmas and worker patterns. This cilksort was readded to the benchrunner. 36b3978
scripts to run benchrunner and collect stats into CSV c05e64e
Light performance-optimization-minded update to PiecewiseFallbackSort along what was done before to other, core sorts (Merge in particular). 7c19278

TODO:

perform the necessary liquid proofs wherever they are broken due to the par fix and the new cilksort function.
the cilksort does not parallelize like the mergesort does (despite that the parallel merge(sort)s are identical).

Profiling and further parallelization fixes would help the paper.

ulysses4ever · 2025-03-29T01:53:39Z

I'd be happy to review but for this to happen I'd hope that we could first merge the other two PRs that this one is based on, so that the diff only had new stuff. If this is not possible I can do it anyways. @michaelborkowski do you have an ETA for cleaning up your monad-par PR? Or maybe you want help?

In general, please, try to avoid mix several things in one PR (e.g. cilksort and allocSctatch).

michaelborkowski · 2025-03-29T21:42:41Z

@ulysses4ever I'll get that cleaned up by tomorrow so we can move forward with the PRs.

ulysses4ever

Could you squash commits to have just three: cilksort-related, alloc-related, py-criterion-related?

ulysses4ever · 2025-03-31T18:34:57Z

benchmarks/scripts/criterion-drop-in-replacement/readme

@@ -0,0 +1,5 @@
+The script `criterionmethodology.py` is my implementation of a benchrunner-runner that uses the criterion methodology. We take as input some program which takes `iters` as a command-line argument, times a function of interest in a tight loop which repeats `iters` many times, and then prints to stdout the batchtime (total loop time) and selftimed (total loop time divided by iters). The essense of criterion is then to sweep `iters` and perform a linear regression against iters and batchtime. The slope is the mean and the y-intercept represents some notion of shared overhead, insensitive to `iters`. Ultimately, criterion serves as a way to benchmark tasks with very short execution times, as startup overhead can be ignored. 


It's very text-heavy. What would help with it is examples of how you run it and what outputs you expect.

ulysses4ever · 2025-03-31T19:29:34Z

src/QuickSortCilk.hs

Can you leave a comment somewhere as to why we need another QuickSort? It'd be great to get the main one to work for cilksort.

ulysses4ever · 2025-03-31T19:30:13Z

src/QuickSortCilk.hs

-            in quickSortBtw (cpy ? lem_equal_slice_bag   xs2   cpy 0 n) 0 n
+      else let (Ur hd, xs2) = A.get2 0 xs1
+               tmp = makeArray n hd in
+                 A.copy2 0 0 n xs2 tmp ? lem_copy_equal_slice xs2 0 tmp 0 n & \(xs2', cpy0) ->


can we use let here?

ulysses4ever · 2025-03-31T19:34:19Z

benchmarks/scripts/criterion-drop-in-replacement/criterionmethodology.py

+
+MAKE_PLOT = False
+
+def linear_regression_with_std(x, y):


I'd expect Python to have something like this in one of the libraries (should be easy to Google).

ulysses4ever · 2025-04-02T12:57:04Z

One more thing: any new sort should also be exercised in CI, which means adding a line analogous to what we have here:

lh-array-sort-new/.github/workflows/build-test-linear.yaml

Lines 148 to 156 in 1ef22da

    
                     cabal run benchrunner -- 5 Insertionsort Seq 100 
        
                     cabal run benchrunner -- 5 Mergesort Seq 100 
        
                     cabal run benchrunner -- 5 Mergesort Par 100 +RTS -N2 
        
                     cabal run benchrunner -- 5 "VectorSort Insertionsort" Seq 100 
        
                     cabal run benchrunner -- 5 "VectorSort Mergesort" Seq 100 
        
                     cabal run benchrunner -- 5 "VectorSort Quicksort" Seq 100 
        
                     cabal run benchrunner -- 5 "CSort Insertionsort" Seq 100 
        
                     cabal run benchrunner -- 5 "CSort Mergesort" Seq 100 
        
                     cabal run benchrunner -- 5 "CSort Quicksort" Seq 100

ulysses4ever · 2025-04-02T14:48:50Z

One important consideration when switching to allocScratch: you have to double-check that performance doesn't degrade. I think it very well may. Especially if the new combinators are not marked as INLINABLE.

ulysses4ever · 2025-04-03T02:13:41Z

My local experiments don't show any performance degradation in Mergesort Seq or Par on this branch.

Also, remove typeclass constraint on `free` since it is unnecessary and enables typechecking

…runner

ulysses4ever · 2025-08-15T16:13:22Z

I modified the description of this PR to have a list of 5 things this PR implements, which map nicely on the commits from this branch (thanks a lot to Joseph for creating a clean Git history in particular). I plan to merge some of these things in separate PRs and move on. (Some things may need to keep lingering here.) First up is the updated Cilksort: #27

jazullo marked this pull request as draft March 29, 2025 01:21

ulysses4ever reviewed Mar 31, 2025

View reviewed changes

ulysses4ever force-pushed the benchmark-ready-par branch from d87c3f3 to eafa2b1 Compare April 2, 2025 14:31

jazullo added 5 commits August 13, 2025 10:59

Use allocation proposal in mergesort

277f6a1

Update piecewise fallback with mergesort optimizations

7c19278

Add new plotting scripts and an explainer

c05e64e

Use alloc proposal in parallel mergesort

ace12b5

Also, remove typeclass constraint on `free` since it is unnecessary and enables typechecking

Construct Cilksort with reasonable parameters and connect it to bench…

36b3978

…runner

ulysses4ever force-pushed the benchmark-ready-par branch from eafa2b1 to 36b3978 Compare August 13, 2025 14:59

ulysses4ever mentioned this pull request Aug 15, 2025

Cilksort spring 2025 #27

Draft

This was referenced Aug 15, 2025

Use allocation proposal in mergesort: unsafe make replaced with safe allocScratch #28

Draft

New benchmarking scripts (Spring 2025) #29

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmark ready par #22

Benchmark ready par #22

Uh oh!

jazullo commented Mar 29, 2025 •

edited by ulysses4ever

Loading

Uh oh!

ulysses4ever commented Mar 29, 2025

Uh oh!

michaelborkowski commented Mar 29, 2025

Uh oh!

ulysses4ever left a comment

Uh oh!

ulysses4ever Mar 31, 2025

Uh oh!

ulysses4ever Mar 31, 2025

Uh oh!

ulysses4ever Mar 31, 2025

Uh oh!

ulysses4ever Mar 31, 2025

Uh oh!

ulysses4ever commented Apr 2, 2025

Uh oh!

ulysses4ever commented Apr 2, 2025

Uh oh!

ulysses4ever commented Apr 3, 2025

Uh oh!

ulysses4ever commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -0,0 +1,5 @@
		The script `criterionmethodology.py` is my implementation of a benchrunner-runner that uses the criterion methodology. We take as input some program which takes `iters` as a command-line argument, times a function of interest in a tight loop which repeats `iters` many times, and then prints to stdout the batchtime (total loop time) and selftimed (total loop time divided by iters). The essense of criterion is then to sweep `iters` and perform a linear regression against iters and batchtime. The slope is the mean and the y-intercept represents some notion of shared overhead, insensitive to `iters`. Ultimately, criterion serves as a way to benchmark tasks with very short execution times, as startup overhead can be ignored.

Benchmark ready par #22

Are you sure you want to change the base?

Benchmark ready par #22

Uh oh!

Conversation

jazullo commented Mar 29, 2025 • edited by ulysses4ever Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ulysses4ever commented Mar 29, 2025

Uh oh!

michaelborkowski commented Mar 29, 2025

Uh oh!

ulysses4ever left a comment

Choose a reason for hiding this comment

Uh oh!

ulysses4ever Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

ulysses4ever Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

ulysses4ever Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

ulysses4ever Mar 31, 2025

Choose a reason for hiding this comment

Uh oh!

ulysses4ever commented Apr 2, 2025

Uh oh!

ulysses4ever commented Apr 2, 2025

Uh oh!

ulysses4ever commented Apr 3, 2025

Uh oh!

ulysses4ever commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jazullo commented Mar 29, 2025 •

edited by ulysses4ever

Loading