Skip to content

Commit c329f5f

Browse files
authored
[https://nvbugs/5569754][chore] Adjust max batch size to prevent OOM (#8876)
Signed-off-by: Junyi Xu <[email protected]>
1 parent cacb8a8 commit c329f5f

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

examples/llm-api/llm_mgmn_llm_distributed.sh

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,5 +51,6 @@ srun -l \
5151
trtllm-llmapi-launch python3 $script \
5252
--model_dir $LOCAL_MODEL \
5353
--prompt 'Hello, how are you?' \
54-
--tp_size 2
54+
--tp_size 2 \
55+
--max_batch_size 256
5556
"

0 commit comments

Comments
 (0)