-
Notifications
You must be signed in to change notification settings - Fork 928
Closed
Description
Thank you for taking the time to submit an issue!
Background information
What version of Open MPI are you using? (e.g., v3.0.5, v4.0.2, git branch name and hash, etc.)
v5.0.x branch
Describe how Open MPI was installed (e.g., from a source/distribution tarball, from a git clone, from an operating system distribution package, etc.)
built from source
If you are building/installing from a git clone, please copy-n-paste the output from git submodule status.
7f6f8db13b42916b27b690b8a3f9e2757ec1417f 3rd-party/openpmix (v4.2.3-8-g7f6f8db1)
c7b2c715f92495637c298249deb5493e86864ac8 3rd-party/prrte (v3.0.1rc1-36-gc7b2c715f9)
237ceff1a8ed996d855d69f372be9aaea44919ea config/oac (237ceff)
Please describe the system on which you are running
- Operating system/version: Amazon Linux 2
- Computer hardware: AMD EPYC 7R13
- Network type: EFA
Details of the problem
When trying to run ansys fluent 2023.1 using Open MPI 5.0.x I got the following error:
Starting fixfiledes /fsx/wzam/PortaFiducia/build/libraries/openmpi/v5.0.x/install//bin/mpirun --bind-to core --map-by slot:pe=1 --report-bindings --mca mtl ofi --prefix /fsx/wzam/PortaFiducia/build/libraries/openmpi/v5.0.x/install/ -x LD_LIBRARY_PATH -x KMP_AFFINITY=disabled -x FLUENT_ARCH=lnamd64 -x FLUENT_AFFINITY=0 -x FLUENT_PROD_DIR=/fsx/akraman/fluent/v231/ansys_inc/v231/fluent/fluent23.1.0 -x FLUENT_INC=/fsx/akraman/fluent/v231/ansys_inc/v231/fluent -x PYTHONHOME=/fsx/akraman/fluent/v231/ansys_inc/v231/fluent/fluent23.1.0/../../commonfiles/CPython/3_7/linx64/Release/python -np 1152 --hostfile /tmp/fluent-appfile.ec2-user.44716 /fsx/akraman/fluent/v231/ansys_inc/v231/fluent/fluent23.1.0/linmic/3ddp_node/fluent_mpi.23.1.0 node -mpiw openmpi -pic ethernet -mport 10.17.0.201:10.17.0.201:41469:0
--------------------------------------------------------------------------
The PE=<list> mapping directive cannot be combined with a
binding directive as it already mandates that we bind to
the specified cpu(s). The conflicting directives that were
given are:
map-by: slot:pe=1
bind-to: core
Please resolve the conflict and try again.
--------------------------------------------------------------------------
From what I can tell, the error was because ansys fluent add the following two arguments:
--bind-to core --map-by slot:pe=1
to mpirun command, but Open MPI 5.0.x consider these two options conflict with each other thus failed.
I wonder if this is expected behavior?
One thing worth noting is that Open MPI 4.1.x will accept such arguments