Skip to content

Conversation

@rhc54
Copy link
Contributor

@rhc54 rhc54 commented Mar 23, 2017

…MIx (e.g., in PMIx_Abort), then we need to "resolve" all pending recvs to avoid hanging.

Fixes #3225

Signed-off-by: Ralph Castain [email protected]
(cherry picked from commit 55e4fba)

@rhc54 rhc54 added the bug label Mar 23, 2017
@rhc54 rhc54 added this to the v3.0.0 milestone Mar 23, 2017
@rhc54 rhc54 requested a review from jsquyres March 23, 2017 15:04
…MIx (e.g., in PMIx_Abort), then we need to "resolve" all pending recvs to avoid hanging.

Fixes #3225

Signed-off-by: Ralph Castain <[email protected]>
(cherry picked from commit 55e4fba)
@jsquyres
Copy link
Member

This seems to have fixed at least some of the Cisco MTT hangs. There might be more to it, though (per #3225 (comment)).

@jsquyres jsquyres changed the title If we lose connection to the server after initiating a send/recv in P… v3.x: If we lose connection to the server after initiating a send/recv in P… Mar 29, 2017
Copy link
Member

@jsquyres jsquyres left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need to cherry pick 7dd34d0 to this PR.

… bool*, not a pmix_ptl_sr_t*.

Signed-off-by: Ralph Castain <[email protected]>
(cherry picked from commit 7dd34d0)
@jsquyres
Copy link
Member

Mellanox Jenkins is failing all tests due to a git error right now.

@Di0gen @jladd-mlnx @artpol84 Can you please check out what's going on? Thanks.

@hppritcha hppritcha merged commit 1dee7b2 into open-mpi:v3.x Mar 31, 2017
@rhc54 rhc54 deleted the cmr3x/abort branch May 31, 2017 14:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants