Skip to content

Commit 23472ff

Browse files
Roger Wangsfeng33
andauthored
[Doc] Add usage of implicit text-only mode (#22561)
Signed-off-by: Roger Wang <[email protected]> Co-authored-by: Flora Feng <[email protected]>
1 parent 08b751b commit 23472ff

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

docs/models/supported_models.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -583,6 +583,9 @@ See [this page](../features/multimodal_inputs.md) on how to pass multi-modal inp
583583

584584
**This is no longer required if you are using vLLM V1.**
585585

586+
!!! tip
587+
For hybrid-only models such as Llama-4, Step3 and Mistral-3, a text-only mode can be enabled by setting all supported multimodal modalities to 0 (e.g, `--limit-mm-per-prompt '{"image":0}`) so that their multimodal modules will not be loaded to free up more GPU memory for KV cache.
588+
586589
!!! note
587590
vLLM currently only supports adding LoRA to the language backbone of multimodal models.
588591

0 commit comments

Comments
 (0)