Skip to content

Possible Regression in OpenMPI v5.x – Missing rendezvous File since Feb 21 #13114

@ericch1

Description

@ericch1

Hi,

We have been testing the ompi/v5.x branch nightly against our code.

Since February 21, when testing against the following commit:

commit c1d3071
Merge: beee369 7f4ddab
Author: Tommy Janjusic [email protected]
Date: Fri Feb 21 09:23:45 2025 -0600

Merge pull request #13100 from janjust/v5.0.x

we have consistently encountered the following error message in the output of all jobs:

--------------------------------------------------------------------------
There was an error when attempting to access the specified server
rendezvous file:

  Filename:  /tmp/TV_2025.02.26.18h16m08s.QBF/pmix.sys.dms3
  Error:     could not be found

Please correct the error and try again.
--------------------------------------------------------------------------

For reference, here are the logs for the configuration and build:

This issue was not present before this commit, and everything works fine with the ompi/v4.1.x branch.

We would appreciate any insights you might have.

Thanks,
Eric

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions