We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 5ef6587 commit 1944fb1Copy full SHA for 1944fb1
docs/source/features/multi-modality.md
@@ -35,7 +35,7 @@ The following examples demonstrate how to use TensorRT LLM's multimodal support
35
Quickly try out TensorRT LLM's multimodal support using our `LLM-API` and a ready-to-run [example](source:examples/llm-api/quickstart_multimodal.py):
36
37
```bash
38
-python3 quickstart_multimodal.py --model_dir Efficient-Large-Model/NVILA-8B --modality image
+python3 quickstart_multimodal.py --model_dir Efficient-Large-Model/NVILA-8B --modality image --disable_kv_cache_reuse
39
```
40
41
### OpenAI-Compatible Server via [`trtllm-serve`](../../source/commands/trtllm-serve/trtllm-serve.rst)
0 commit comments