-
Notifications
You must be signed in to change notification settings - Fork 936
Open
Description
Background information
What version of Open MPI are you using? (e.g., v4.1.6, v5.0.1, git branch name and hash, etc.)
v4.1.4
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
installed via conda package, make the xcopy to the cluster.
Please describe the system on which you are running
- Operating system/version: CentOS 7
- Computer hardware: x64_64
- Network type: localhost
Details of the problem
In our cluster, the mpi prefix are different on different nodes. For example on master node, the opal_prefix is /path/master/prefix, while on the nodes, the opal_prefix is `/path/node-/prefix
The job is run like this
export OPAL_PREFIX=/path/master/prefix
unset PMIX_INSTALL_PREFIX
shell$ mpiexec -n 1 --mca plm_rsh_agent /usr/bin/echo sh -c 'echo $PMIX_INSTALL_PREFIX'and run in another shell the orted command:
export OPAL_PREFIX=/path/node-<id>/prefix
export PMIX_INSTALL_PREFIX=$OPAL_PREFIX
orted -mca ess "env" ...Then the result of master shell is always /path/master/prefix, not the nodes specific path, even if I exported a correct value for PMIX_INSTALL_PREFIX before orted started.
How to avoid transfer PMIX_INSTALL_PREFIX from mpiexec to the child process of orted?
Metadata
Metadata
Assignees
Labels
No labels