Skip to content

Conversation

@jsquyres
Copy link
Member

@jsquyres jsquyres commented Sep 2, 2016

We commonly see messages on the users list where a peer has hung up because it has crashed. Instead of having just a BTL_ERROR message, make this a real opal_show_help() message that tells the user that the peer unexpectedly hung up, and they should look into why that peer hung up.

Signed-off-by: Jeff Squyres [email protected]

There's a second commit on this PR that disentangles two help messages that accidentally look like they got entangled.

It looks like one help message was accidentally pasted in the middle
of another.  Disentangle the two messages from each other, and
slightly tweak the one message to say that the job may also crash (in
addition to hanging).

Signed-off-by: Jeff Squyres <[email protected]>
@jsquyres jsquyres added this to the v2.1.0 milestone Sep 2, 2016
@jsquyres
Copy link
Member Author

jsquyres commented Sep 2, 2016

@bosilca Look good?

@bosilca
Copy link
Member

bosilca commented Sep 2, 2016

👍

We commonly see messages on the users list where a peer has hung up
because it has crashed.  Instead of having just a BTL_ERROR message,
make this a real opal_show_help() message that tells the user that the
peer unexpectedly hung up, and they should look into *why* that peer
hung up.

Signed-off-by: Jeff Squyres <[email protected]>
@jsquyres jsquyres force-pushed the pr/btl-tcp-help-messages branch from 2961609 to 1953e34 Compare September 6, 2016 13:40
@jsquyres
Copy link
Member Author

jsquyres commented Sep 6, 2016

I slightly improved the help message.

@jsquyres jsquyres merged commit 527efec into open-mpi:master Sep 6, 2016
@jsquyres jsquyres deleted the pr/btl-tcp-help-messages branch September 6, 2016 13:40
@lanl-ompi
Copy link
Contributor

Test FAILed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants