Skip to content

Latest commit

 

History

History
13 lines (12 loc) · 1.05 KB

File metadata and controls

13 lines (12 loc) · 1.05 KB

Multimodal Feature Support Matrix (PyTorch Backend)

Model CUDA Graph Encoder IFB KV Cache Reuse Chunked Prefill
Gemma 3 Yes Yes N/A N/A
HyperCLOVA Yes Yes No No
VILA Yes No No No
LLaVA-NeXT Yes Yes Yes Yes
Llama 4 Yes Yes No No
Mistral-Small-3.1 Yes Yes Yes Yes
Phi-4-multimodal Yes Yes Yes Yes
Qwen2-VL Yes Yes Yes Yes
Qwen2.5-VL Yes Yes Yes Yes