We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e87abb9 commit ab8a350Copy full SHA for ab8a350
docs/source/guides/vllm_plugin.mdx
@@ -70,7 +70,7 @@ When deploying a standard hub model, you can customize the way it will be export
70
71
```console
72
optimum-cli neuron serve \
73
- --model="unsloth/Llama-3.1-1B-Intruct" \
+ --model="unsloth/Llama-3.1-1B-Instruct" \
74
--batch_size=4 \
75
--sequence_length=4096 \
76
--tensor_parallel_size=2 \
0 commit comments