Skip to content

Commit 347497c

Browse files
committed
mpirun.1in: add descriptions of new options
Add descriptions for the new --report-state-on-timeout and --get-stack-traces options. Also add --timeout, and cross-reference MPIEXEC_TIMEOUT with it. Signed-off-by: Jeff Squyres <[email protected]>
1 parent 17202e5 commit 347497c

File tree

1 file changed

+54
-7
lines changed

1 file changed

+54
-7
lines changed

orte/tools/orterun/orterun.1in

Lines changed: 54 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
.\" -*- nroff -*-
2-
.\" Copyright (c) 2009-2014 Cisco Systems, Inc. All rights reserved.
2+
.\" Copyright (c) 2009-2016 Cisco Systems, Inc. All rights reserved.
33
.\" Copyright (c) 2008-2009 Sun Microsystems, Inc. All rights reserved.
44
.\" $COPYRIGHT$
55
.\"
@@ -529,12 +529,41 @@ MCA parameter.
529529
.
530530
.
531531
.TP
532+
.B --get-stack-traces
533+
When paired with the
534+
.B --timeout
535+
option,
536+
.I mpirun
537+
will obtain and print out stack traces from all launched processes
538+
that are still alive when the timeout expires. Note that obtaining
539+
stack traces can take a little time and produce a lot of output,
540+
especially for large process-count jobs.
541+
.
542+
.
543+
.TP
532544
.B -debugger\fR,\fP --debugger
533545
Sequence of debuggers to search for when \fI--debug\fP is used (i.e.
534546
a synonym for \fIorte_base_user_debugger\fP MCA parameter).
535547
.
536548
.
537549
.TP
550+
.B --timeout \fR<seconds>
551+
The maximum number of seconds that
552+
.I mpirun
553+
(also known as
554+
.I mpiexec\fR,\fI oshrun\fR,\fI orterun\fR,\fI
555+
etc.)
556+
will run. After this many seconds,
557+
.I mpirun
558+
will abort the launched job and exit with a non-zero exit status.
559+
Using
560+
.B --timeout
561+
can be also useful when combined with the
562+
.B --get-stack-traces
563+
option.
564+
.
565+
.
566+
.TP
538567
.B -tv\fR,\fP --tv
539568
Launch processes under the TotalView debugger.
540569
Deprecated backwards compatibility flag. Synonym for \fI--debug\fP.
@@ -661,6 +690,14 @@ without clutter from mpirun itself.
661690
Disable the automatic --prefix behavior
662691
.
663692
.
693+
.TP
694+
.B --report-state-on-timeout
695+
When paired with the
696+
.B --timeout
697+
command line option, report the run-time subsystem state of each
698+
process when the timeout expires.
699+
.
700+
.
664701
.P
665702
There may be other options listed with \fImpirun --help\fP.
666703
.
@@ -669,12 +706,9 @@ There may be other options listed with \fImpirun --help\fP.
669706
.
670707
.TP
671708
.B MPIEXEC_TIMEOUT
672-
The maximum number of seconds that
673-
.I mpirun
674-
.RI ( mpiexec )
675-
will run. After this many seconds,
676-
.I mpirun
677-
will abort the launched job and exit.
709+
Synonym for the
710+
.B --timeout
711+
command line option.
678712
.
679713
.
680714
.\" **************************
@@ -1541,6 +1575,19 @@ In the event that one or more processes exit before calling MPI_FINALIZE, the
15411575
return value of the MPI_COMM_WORLD rank of the process that \fImpirun\fP first notices died
15421576
before calling MPI_FINALIZE will be returned. Note that, in general, this will
15431577
be the first process that died but is not guaranteed to be so.
1578+
.
1579+
.PP
1580+
If the
1581+
.B --timeout
1582+
command line option is used and the timeout expires before the job
1583+
completes (thereby forcing
1584+
.I mpirun
1585+
to kill the job)
1586+
.I mpirun
1587+
will return an exit status equivalent to the value of
1588+
.B ETIMEDOUT
1589+
(which is typically 110 on Linux and OS X systems).
1590+
15441591
.
15451592
.\" **************************
15461593
.\" See Also Section

0 commit comments

Comments
 (0)