We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 70ffb6d commit db79574Copy full SHA for db79574
docs/llm/mbridge/optimized/vllm.md
@@ -31,7 +31,6 @@ This section shows how to use scripts and APIs to export a Megatron-Bridge LLM t
31
```shell
32
python /opt/Export-Deploy/scripts/deploy/nlp/deploy_vllm_triton.py \
33
--model_path_id /opt/checkpoints/hf_llama31_8B_mbridge/iter_0000000/ \
34
- --model_format megatron_bridge \
35
--triton_model_name llama \
36
--tensor_parallelism_size 1
37
```
0 commit comments