Skip to content

Commit 8cd9ca3

Browse files
committed
--ft-use-infra-group-rank=False
Signed-off-by: oliver könig <[email protected]>
1 parent 9fc020c commit 8cd9ca3

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

nemo_run/run/torchx_backend/components/ft_launcher.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -117,6 +117,9 @@ def ft_launcher(
117117
if max_restarts:
118118
ft_args += ["--max-restarts", str(max_restarts)]
119119

120+
if dgxc is True:
121+
ft_args = +["--ft-use-infra-group-rank", "False"]
122+
120123
else:
121124
ft_args = ["--ignore-missing-fault-tol-cfg"]
122125

0 commit comments

Comments
 (0)