Skip to content

Commit 67cd851

Browse files
update comments
Signed-off-by: Youngeun Kwon <youngeunk@nvidia.com>
1 parent a2940a2 commit 67cd851

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

nemo_rl/models/policy/megatron_policy_worker.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -827,7 +827,8 @@ def init_collective(
827827
from vllm.distributed.device_communicators.pynccl import PyNcclCommunicator
828828
from vllm.distributed.utils import StatelessProcessGroup
829829

830-
# All training ranks [0..train_world_size-1] join the communicator with their rank
830+
# world_size = train_world_size + inference_world_size
831+
# variable train_world_size is used in inference cluster
831832
pg = StatelessProcessGroup.create(
832833
host=ip, port=port, rank=self.rank, world_size=world_size
833834
)

0 commit comments

Comments
 (0)