-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Background: DRL using 4CFDs-Environments
-
Using a computation node with 4GPUs and 64CPUS
-
Using the
smartsimconfiguration:smartsim: n_dbs: 1 network_interface: "lo" run_command: "mpirun" launcher: "local"
Encountered issue:
-
All of the CFD are run at GPU devices
device:0, which leads to low-efficiency usage of the computation resources. -
The
rank_file(i.e.,.env000.txt) for launchingmpirunis:rank 0=alvis4-05 slot=1
Clearly, there is no binding of GPU devices.
Suggestions:
- Modify the usage of
local/slurmconfiguration of SmartFlow to adopt this usage. We may consider any of the paths:
- Make
srunable to use for single cluster node - Incorporate the GPU-related arguments in
rank_file.
I am currently working on the option#2, and I shall keep you posted by this issue. @soaringxmc
Metadata
Metadata
Assignees
Labels
No labels