Mip race by jajhall · Pull Request #2474 · ERGO-Code/HiGHS

jajhall · 2025-07-20T16:15:56Z

Some residual code from MIP race experience.

HighsTerminator struct allows termination of the MIP solver to be communicated more readily - and across multiple threads

MipSolutionSource now ordered alphabetically by key letters

…mbent_read and refactored

…lverData

…rrency to 0

…ile for last worker; LastIncumbentRead not being set

codecov · 2025-07-20T16:56:37Z

Codecov Report

❌ Patch coverage is 70.87379% with 60 lines in your changes missing coverage. Please review.
✅ Project coverage is 79.69%. Comparing base (cfd986d) to head (6b4a6ee).
⚠️ Report is 89 commits behind head on latest.

Files with missing lines	Patch %	Lines
highs/mip/HighsMipSolverData.cpp	67.30%	34 Missing ⚠️
highs/mip/HighsMipSolver.cpp	70.90%	16 Missing ⚠️
highs/lp_data/HighsModelUtils.cpp	0.00%	4 Missing ⚠️
highs/mip/HighsPrimalHeuristics.cpp	55.55%	4 Missing ⚠️
highs/mip/HighsMipSolver.h	50.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           latest    #2474      +/-   ##
==========================================
- Coverage   79.73%   79.69%   -0.04%     
==========================================
  Files         346      346              
  Lines       85976    86096     +120     
==========================================
+ Hits        68553    68616      +63     
- Misses      17423    17480      +57

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…ut using Highs::mipRaceResults

jajhall · 2025-07-22T14:41:16Z

Now that solution of sub-MIPs can be terminated, the MIP solver instances terminated in a race now take only a little longer than the winner.

Hence we have a 7X speedup on fiball

Further experiments will follow once I've cleaned up the dev printf statements

mathgeekcoder · 2025-07-22T16:38:32Z

highs/mip/HighsMipSolverData.cpp

+  return this->start_write_incumbent == start_write_incumbent
+             ? start_write_incumbent
+             : kMipRaceNoSolution;
+}


Nice trick with the start/finish checks - but I believe there's still a chance where this could fail in a race condition... especially if some "smart" compiler optimizes away the final this->start_write_incumbent == start_write_incumbent check.

Perhaps consider using std::atomic<HighsInt> for your start/finish variables? The atomic class is lock free and very fast on most systems.

struct MipRaceIncumbent { std::atomic<HighsInt> start_write_incumbent = kMipRaceNoSolution; std::atomic<HighsInt> finish_write_incumbent = kMipRaceNoSolution; double objective = -kHighsInf; std::vector<double> solution; void clear(); void initialise(const HighsInt num_col); void update(const double objective, const std::vector<double>& solution); HighsInt read(const HighsInt last_incumbent_read, double& objective_, std::vector<double>& solution_) const; MipRaceIncumbent() = default; MipRaceIncumbent(const MipRaceIncumbent& copy) { start_write_incumbent = copy.start_write_incumbent.load(); finish_write_incumbent = copy.finish_write_incumbent.load(); objective = copy.objective; solution = copy.solution; } MipRaceIncumbent(MipRaceIncumbent&& moving) { start_write_incumbent = moving.start_write_incumbent.load(); finish_write_incumbent = moving.finish_write_incumbent.load(); objective = moving.objective; solution = std::move(moving.solution); } };

That said, I'm guessing you're trying to avoid heavier std::mutex for thread synchronization. As a lighter alternative, you can also use std::atomic_flag for a SpinLock implementation. For example:

class Spinlock { private: std::atomic_flag lock_flag = ATOMIC_FLAG_INIT; public: void lock() { while (lock_flag.test_and_set(std::memory_order_acquire)) { // Busy-wait (spin) until the lock is released } } bool try_lock() noexcept { return lock_flag.test_and_set(std::memory_order_acquire); } void unlock() { lock_flag.clear(std::memory_order_release); } };

Note: There is likely better SpinLock code available.

Oh! I forgot that we already have HighsSpinMutex. So ignore my SpinLock code above.

I tried implementing the new struct MipRaceIncumbent, but got the following compiler errors

/home/jajhall/HiGHS/highs/mip/HighsMipSolver.h:38:25: error: field ‘start_write_incumbent’ has incomplete type ‘std::atomic’
38 | std::atomic start_write_incumbent = kMipRaceNoSolution;
| ^~~~~~~~~~~~~~~~~~~~~

Not entirely sure. Did you #include <atomic>?

Ah, sorry. That's a linux vs windows issue. The following should work on both.

std::atomic<HighsInt> start_write_incumbent {kMipRaceNoSolution}; std::atomic<HighsInt> finish_write_incumbent {kMipRaceNoSolution};

Not entirely sure. Did you #include <atomic>?

Yes, but doesn't fix the error

Ah, sorry. That's a linux vs windows issue. The following should work on both.

Works on Linux! :-)

mathgeekcoder · 2025-07-22T17:01:03Z

highs/lp_data/Highs.cpp

+    }
+    highs::parallel::for_each(
+        0, mip_race_concurrency, [&](HighsInt start, HighsInt end) {
+          for (HighsInt instance = start; instance < end; instance++) {


BTW: AFAIK this inner for loop will execute only on one thread, so it might be a bit pointless if it loops more than once (i.e., that means a previous HighsMipSolver::run has already finished).

That said, the default grain size = 1, so it's unlikely to ever occur. You could probably get rid of the inner for loop and just take instance = start instead.

Alternatively, you could try to use TaskGroup directly and spawn each task. It would be more effort, but you might've been able to use the TaskGroup::cancel function to interrupt the other threads.

mathgeekcoder · 2025-07-23T00:02:11Z

highs/mip/HighsMipSolver.h

+  HighsInt concurrency() const;
+  void update(const double objective, const std::vector<double>& solution);
+  bool newSolution(const HighsInt instance, double objective,
+                   std::vector<double>& solution);


The newSolution method is missing the & for the double objective. Without this, the solution sharing across threads isn't updating as frequently as it could.

bool newSolution(const HighsInt instance, double& objective, std::vector<double>& solution);

That's an important omission! It meant that the objective for the solution read from another thread wasn't being passed back, so the solution from the other thread never replaced the incumbent. Hence, sharing solutions was the same as a pure race.

Making the correction, the performance on fiball when sharing solutions is slowed greatly, as none of the threads runs like the "lucky" random_seed=1 case. This is what I'd expected to see. However, for problems where there isn't an extremely lucky random_seed value, maybe this will open the door to improved performance!

Naturally I'll experiment

I noticed similar behaviour in my experiments. Sharing solutions can help on other instances, but not really on fiball. That said, when I performed the presolve once and shared that across the other threads, fiball was fast again.

BTW: I had to make many changes to ensure presolve is only performed once. Do you know a better way?

BTW: After more experiments with fiball, sharing the presolve doesn't necessarily solve it fast. More investigation needed!

Now I've thought a little, it's easy internally to perform presolve only once, and then race the solution of the presolved MIP

mathgeekcoder · 2025-07-23T00:11:00Z

highs/lp_data/Highs.cpp

+                   "= %6.2f), and status %s\n",
+                   int(instance), solver_info.solution_objective,
+                   1e2 * solver_info.gap, mip_time[instance],
+                   modelStatusToString(instance_model_status).c_str());


Minor fix: \% should be %%

i.e., " Solver %d has best objective %15.8g, gap %6.2f\% (time "

mathgeekcoder · 2025-07-25T01:24:52Z

highs/mip/HighsMipSolverData.cpp

+        postSolveStack.getReducedPrimalSolution(instance_solution);
+    addIncumbent(reduced_instance_solution, instance_solution_objective_value,
+                 kSolutionSourceHighsSolution);
+  }


I believe there is a bug with the objective value when mipsolver.model_->offset_ != 0. This can prevent improved solutions from being accepted by other threads.

Details:
addIncumbent checks instance_solution_objective_value < upper_bound before updating, but upper_bound excludes the mipsolver.model_->offset_ value and instance_solution_objective_value includes it.

Potential fix:

// exclude offset from objective value instance_solution_objective_value -= mipsolver.model_->offset_; addIncumbent(reduced_instance_solution, instance_solution_objective_value, kSolutionSourceHighsSolution);

Thanks: the correction depends on whether the problem is a maximization or minimization

Good to know. I was thinking the sign for offset_ might've already accounted for min/max.

…ctive value for maximization, as well as minimization problems; formatted

…ived from another thread

…responding objective locally

jajhall · 2025-07-30T15:40:20Z

Can now perform presolve before the MIP race so that all participants are solving the same problem.

Controlled by option mip_race_single_presolve which is true by default.

jajhall · 2025-08-26T09:04:04Z

The performance gain from the MIP race didn't justify introducing a non-deterministic solver into HiGHS, or the investment of time required to fix the segfaults and incorrect deductions of infeasibility.

All the code relating to the MIP race has been deleted so that the one useful outcome - the HighsTerminator can be merged into latest

jajhall · 2025-08-26T09:27:45Z

The performance gain from the MIP race didn't justify introducing a non-deterministic solver into HiGHS, or the investment of time required to fix the segfaults and incorrect deductions of infeasibility.

All the code relating to the MIP race has been deleted so that the one useful outcome - the HighsTerminator can be merged into latest

jajhall added 28 commits July 15, 2025 12:23

Created structs MipRaceIncumbent and MipRaceRecord

e4003b5

Merge branch 'latest' into mip-race

e38d32c

Writing incumbent solutions to shared memory

a66b641

Introduced option for mip_race_concurrency, struct MipRace, last_incu…

3f115b6

…mbent_read and refactored

Added newSolution, terminate and terminated to MipRace and HighsMipSo…

d5f9db2

…lverData

Now to extend callbackUserSolution to generic extenalsolution method

865628c

Added mip-race unit test and restored default value of mip_race_concu…

ae572dc

…rrency to 0

Now to remove callback-related logic from call to queryExternalSolution

9c750ec

can ExternalMipSolutionQueryOrigin be moved to HighsMipSolver.h?

a4e9290

Now checking for terminate instruction - time to use multi-threading!

7a515c3

Have MIP solvers running concurrently

1bec802

Failed to add HighsLogOptions to MipRace

41eb3f1

Moved MipRace struct definition to HighsMipSolver.h

239bf49

Worker logging only to mip_worker*.log

a4f4cbc

Now flip terminate<->terminated

0985562

Now only terminate the MIP race if it's not already terminated

a3b0846

Merged latest into this branch

fcafe50

Introduced HighsModelStatus::kHighsInterrupt for MIP race interrupt

2b1a131

Now extracting HighsMipSolverInfo

d0af6ec

WIP

39456e8

Now to start reading incumbents from other MIP solver instances

5a5b30a

HiGHS solution has been read

14e2a97

Reading HiGHS solution; different random_seed; all logging still to f…

6e65dbe

…ile for last worker; LastIncumbentRead not being set

Now only reading if incumbent is newer than last read

54603c1

Prototype concurrent MIP solver

ffe29d9

Merge branch 'latest' into mip-race

bf92e1c

Fixed two issues leading to CI compiler failures

f01c30c

Need to use check/instances model in unit_tests mip-race

b785eca

Introduce termination_status flag

0d284c9

jajhall added 3 commits July 22, 2025 12:29

Create gap string method

9b7bc10

Instance 0 no longer logging MIP solver solution report to console, b…

6570af7

…ut using Highs::mipRaceResults

Fixed compiler warning

c52e2af

mathgeekcoder reviewed Jul 22, 2025

View reviewed changes

Cleared out development logging

d27d46b

mathgeekcoder reviewed Jul 23, 2025

View reviewed changes

jajhall added 2 commits July 23, 2025 08:20

Cleaned up, and first of @mathgeekcoder's suggestions implemented

1c5cf45

Formatted

5a1c095

mathgeekcoder reviewed Jul 25, 2025

View reviewed changes

jajhall added 8 commits July 29, 2025 12:31

Now reporting whole loop time for race

a2974ce

Made @mathgeekcoder's changes, and computing the correct reduced obje…

34260a4

…ctive value for maximization, as well as minimization problems; formatted

Need to check for feasibility in transformed space when solution rece…

6add956

…ived from another thread

Now checking instance solution for feasibility, and computing the cor…

7d4c8be

…responding objective locally

Add single presolve option to MIP race

84f4bb5

Still passes bin/unit_tests mip-race

6011753

Now able to run MIP race on single presolved model

acd2cfc

Formatted

6cb49c3

jajhall added 4 commits July 30, 2025 18:03

Removed stray printf and isolated health warning logging for MIP race

d77e739

Deleted MIP race code

0c3ddc5

Merge branch 'latest' into mip-race

079dc03

Removed MipSolverInfo and last vestiges of MIP race

6b4a6ee

jajhall marked this pull request as ready for review August 26, 2025 08:59

jajhall merged commit 13a5068 into latest Aug 26, 2025
306 of 308 checks passed

jajhall deleted the mip-race branch August 26, 2025 09:35

Conversation

jajhall commented Jul 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jul 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jajhall commented Jul 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mathgeekcoder Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mathgeekcoder Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mathgeekcoder Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mathgeekcoder Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jajhall commented Jul 30, 2025

Uh oh!

jajhall commented Aug 26, 2025

Uh oh!

jajhall commented Aug 26, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jajhall commented Jul 20, 2025 •

edited

Loading

codecov bot commented Jul 20, 2025 •

edited

Loading

mathgeekcoder Jul 22, 2025 •

edited

Loading

mathgeekcoder Jul 22, 2025 •

edited

Loading

mathgeekcoder Jul 22, 2025 •

edited

Loading

mathgeekcoder Jul 23, 2025 •

edited

Loading