Skip to content

Conversation

@hjelmn
Copy link
Member

@hjelmn hjelmn commented Aug 4, 2016

This commit fixes two issues that can occur during a connection:

  • Re-entry to connection progress from modex lookup. Added an
    additional endpoint state that will keep the code from re-entering
    the common endpoint create.
  • Fixed a race between a process posting a directed datagram through
    a send and a connection being progressed through opal_progress().
    The progress code was not obtaining the endpoint lock before
    attempting to update the endpoint. To limit the amount of code
    changed for 2.0.1 this commit makes the endpoint lock recursive. In
    a future update this may be changed.

Signed-off-by: Nathan Hjelm [email protected]

This commit fixes two issues that can occur during a connection:

 - Re-entry to connection progress from modex lookup. Added an
   additional endpoint state that will keep the code from re-entering
   the common endpoint create.

 - Fixed a race between a process posting a directed datagram through
   a send and a connection being progressed through opal_progress().
   The progress code was not obtaining the endpoint lock before
   attempting to update the endpoint. To limit the amount of code
   changed for 2.0.1 this commit makes the endpoint lock recursive. In
   a future update this may be changed.

Signed-off-by: Nathan Hjelm <[email protected]>
@hjelmn
Copy link
Member Author

hjelmn commented Aug 4, 2016

@hppritcha Found this bug with a threaded application on Trinity. I can probably write a better fix but this is sufficient to fix the bug and keeps the changes minimal. Target 2.0.1.

@hjelmn hjelmn added this to the v2.0.1 milestone Aug 4, 2016
@hjelmn hjelmn self-assigned this Aug 4, 2016
@artpol84
Copy link
Contributor

artpol84 commented Aug 6, 2016

bot:mellanox:retest

1 similar comment
@artpol84
Copy link
Contributor

artpol84 commented Aug 7, 2016

bot:mellanox:retest

@hjelmn hjelmn merged commit 5ced037 into open-mpi:master Aug 8, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants