Skip to content

Commit 70ffb6d

Browse files
Update in-framework-ray.md
1 parent d1e3883 commit 70ffb6d

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

docs/llm/mbridge/in-framework-ray.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,6 @@ This section demonstrates how to deploy [Megatron-Bridge](https://github.com/NVI
2727
```shell
2828
python /opt/Export-Deploy/scripts/deploy/nlp/deploy_ray_inframework.py \
2929
--megatron_checkpoint /opt/checkpoints/hf_llama31_8B_mbridge \
30-
--model_format megatron \
3130
--model_id llama \
3231
--num_replicas 1 \
3332
--num_gpus 1 \
@@ -56,6 +55,5 @@ This section demonstrates how to deploy [Megatron-Bridge](https://github.com/NVI
5655
Deploying Megatron-Bridge models with Ray Serve closely follows the same process as deploying NeMo 2.0 models. The primary differences are:
5756

5857
- Use the `--megatron_checkpoint` argument to specify your Megatron-Bridge checkpoint file.
59-
- Set `--model_format megatron` to indicate the model type.
6058

61-
All other deployment steps, parameters, and Ray Serve features remain the same as for NeMo 2.0 LLMs. For a comprehensive walkthrough of advanced options, scaling, and troubleshooting, refer to the [Deploy NeMo 2.0 LLMs with Ray Serve](../nemo_2/in-framework-ray.md) documentation.
59+
All other deployment steps, parameters, and Ray Serve features remain the same as for NeMo 2.0 LLMs. For a comprehensive walkthrough of advanced options, scaling, and troubleshooting, refer to the [Deploy NeMo 2.0 LLMs with Ray Serve](../nemo_2/in-framework-ray.md) documentation.

0 commit comments

Comments
 (0)