Skip to content

Commit 51df3e7

Browse files
chtruong814guyueh1
andauthored
cp: perf: [Perf recipe] Change TP 16->32 for deepseek GB200 sync benchmark (1715) into r0.5.0 (#1716)
Signed-off-by: Guyue Huang <guyueh@nvidia.com> Signed-off-by: NeMo Bot <nemo-bot@nvidia.com> Co-authored-by: Guyue Huang <140554423+guyueh1@users.noreply.github.com>
1 parent 6526fe9 commit 51df3e7

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/configs/recipes/llm/performance/grpo-deepseek-v3-32n4g.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ policy:
1111
num_layers_in_last_pipeline_stage: 6
1212
generation:
1313
vllm_cfg:
14-
tensor_parallel_size: 16
14+
tensor_parallel_size: 32
1515
logger:
1616
log_dir: logs/grpo-deepseek-v3-32n4g
1717
wandb:

0 commit comments

Comments
 (0)