You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 4, 2025. It is now read-only.
Copy file name to clipboardExpand all lines: docs/source/models/supported_models.rst
+7-3Lines changed: 7 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -252,6 +252,11 @@ Multimodal Language Models
252
252
- Image\ :sup:`E`
253
253
- :code:`Qwen/Qwen-VL`, :code:`Qwen/Qwen-VL-Chat`, etc.
254
254
-
255
+
* - :code:`Qwen2VLForConditionalGeneration`
256
+
- Qwen2-VL (see note)
257
+
- Image\ :sup:`+` / Video\ :sup:`+`
258
+
- :code:`Qwen/Qwen2-VL-2B-Instruct`, :code:`Qwen/Qwen2-VL-7B-Instruct`, :code:`Qwen/Qwen2-VL-72B-Instruct`, etc.
259
+
-
255
260
* - :code:`UltravoxModel`
256
261
- Ultravox
257
262
- Audio\ :sup:`E+`
@@ -265,15 +270,14 @@ Multimodal Language Models
265
270
For :code:`openbmb/MiniCPM-V-2`, the official repo doesn't work yet, so we need to use a fork (:code:`HwwwH/MiniCPM-V-2`) for now.
266
271
For more details, please see: https://github.com/vllm-project/vllm/pull/4087#issuecomment-2250397630
267
272
268
-
For :code:`LLaVA-NeXT-Video`, the latest release of :code:`huggingface/transformers` doesn't work yet, so we need to use a developer version (:code:`21fac7abba2a37fae86106f87fcf9974fd1e3830`) for now.
273
+
.. note::
274
+
For :code:`LLaVA-NeXT-Video` and :code:`Qwen2-VL`, the latest release of :code:`huggingface/transformers` doesn't work yet, so we need to use a developer version (:code:`21fac7abba2a37fae86106f87fcf9974fd1e3830`) for now.
269
275
This can be installed by running the following command:
0 commit comments