[Doc] Update user guide for suported models (#2263)

zhangxinyuehfad · web-flow · commit 92eebc0c9b65 · 2025-08-07T14:39:51.000+08:00
### What this PR does / why we need it? Update user guide for suported models - vLLM version: v0.10.0 - vLLM main: vllm-project/vllm@4be02a3 --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com>
diff --git a/docs/source/user_guide/support_matrix/supported_models.md b/docs/source/user_guide/support_matrix/supported_models.md
@@ -5,49 +5,75 @@ Get the newest info here: https://github.com/vllm-project/vllm-ascend/issues/160
 ## Text-only Language Models
 
 ### Generative Models
-| Model | Supported | Note |
-|-------|-----------|------|
-| DeepSeek v3 | ✅ | |
-| DeepSeek R1 | ✅ | |
-| DeepSeek Distill (Qwen/LLama) | ✅ | |
-| Qwen3 | ✅ | |
-| Qwen3-Moe | ✅ | |
-| Qwen2.5 | ✅ | |
-| QwQ-32B | ✅ | |
-| LLama3.1/3.2 | ✅ | |
-| Internlm | ✅ | |
-| Baichuan | ✅ | |
-| Phi-4-mini | ✅ | |
-| MiniCPM | ✅ | |
-| MiniCPM3 | ✅ | |
-| LLama4 | ✅ | |
-| Mistral | | Need test |
-| DeepSeek v2.5 | |Need test |
-| Gemma-2 | | Need test |
-| Mllama |  |Need test|
-| Gemma-3 | ❌ | [#496](https://github.com/vllm-project/vllm-ascend/issues/496) |
-| ChatGLM | ❌ | [#554](https://github.com/vllm-project/vllm-ascend/issues/554) |
+
+| Model                         | Supported | Note                                                                 |
+|-------------------------------|-----------|----------------------------------------------------------------------|
+| DeepSeek v3                   | ✅        |                                                                      |
+| DeepSeek R1                   | ✅        |                                                                      |
+| DeepSeek Distill (Qwen/LLama) | ✅        |                                                                      |
+| Qwen3                         | ✅        |                                                                      |
+| Qwen3-based                   | ✅        |                                                                      |
+| Qwen3-Coder                   | ✅        |                                                                      |
+| Qwen3-Moe                     | ✅        |                                                                      |
+| Qwen2.5                       | ✅        |                                                                      |
+| Qwen2                         | ✅        |                                                                      |
+| Qwen2-based                   | ✅        |                                                                      |
+| QwQ-32B                       | ✅        |                                                                      |
+| LLama2/3/3.1                  | ✅        |                                                                      |
+| Internlm                      | ✅        | [#1962](https://github.com/vllm-project/vllm-ascend/issues/1962)     |
+| Baichuan                      | ✅        |                                                                      |
+| Baichuan2                     | ✅        |                                                                      |
+| Phi-4-mini                    | ✅        |                                                                      |
+| MiniCPM                       | ✅        |                                                                      |
+| MiniCPM3                      | ✅        |                                                                      |
+| Ernie4.5                      | ✅        |                                                                      |
+| Ernie4.5-Moe                  | ✅        |                                                                      |
+| Gemma-2                       | ✅        |                                                                      |
+| Gemma-3                       | ✅        |                                                                      |
+| Phi-3/4                       | ✅        |                                                                      |
+| Mistral/Mistral-Instruct      | ✅        |                                                                      |
+| GLM-4.5                       | ✅            |                                                                  |
+| GLM-4                         | ❌        | [#2255](https://github.com/vllm-project/vllm-ascend/issues/2255)     |
+| GLM-4-0414                    | ❌        | [#2258](https://github.com/vllm-project/vllm-ascend/issues/2258)     |
+| ChatGLM                       | ❌        | [#554](https://github.com/vllm-project/vllm-ascend/issues/554)       |
+| DeepSeek v2.5                 | 🟡        | Need test                                                            |
+| Mllama                        | 🟡        | Need test                                                            |
+| MiniMax-Text                  | 🟡        | Need test                                                            |
 
 ### Pooling Models
-| Model | Supported | Note |
-|-------|---------|------|
-| XLM-RoBERTa-based | ✅ |  |
-| Molmo | ✅ |  |
+
+| Model                         | Supported | Note                                                                 |
+|-------------------------------|-----------|----------------------------------------------------------------------|
+| Qwen3-Embedding               | ✅        |                                                                      |
+| Molmo                         | ✅        | [1942](https://github.com/vllm-project/vllm-ascend/issues/1942)      |
+| XLM-RoBERTa-based             | ❌        | [1960](https://github.com/vllm-project/vllm-ascend/issues/1960)      |
 
 ## Multimodal Language Models
 
 ### Generative Models
-| Model | Supported | Note |
-|-------|-----------|------|
-| Qwen2-VL | ✅ | |
-| Qwen2.5-VL | ✅ | |
-| LLaVA 1.5 | ✅ | |
-| LLaVA 1.6 | ✅ | [#553](https://github.com/vllm-project/vllm-ascend/issues/553) |
-| InternVL2 | ✅ | |
-| InternVL2.5 | ✅ | |
-| Qwen2-Audio | ✅ |  |
-| LLaVA-Next |  | Need test |
-| LLaVA-Next-Video | | Need test |
-| Phi-3-Vison/Phi-3.5-Vison | | Need test |
-| GLM-4v | | Need test |
-| Ultravox |  | Need test |
+
+| Model                          | Supported     | Note                                                                 |
+|--------------------------------|---------------|----------------------------------------------------------------------|
+| Qwen2-VL                       | ✅            |                                                                      |
+| Qwen2.5-VL                     | ✅            |                                                                      |
+| Qwen2.5-Omni                   | ✅            | [1760](https://github.com/vllm-project/vllm-ascend/issues/1760)      |
+| QVQ                            | ✅            |                                                                      |
+| LLaVA 1.5/1.6                  | ✅            | [1962](https://github.com/vllm-project/vllm-ascend/issues/1962)      |
+| InternVL2                      | ✅            |                                                                      |
+| InternVL2.5                    | ✅            |                                                                      |
+| Qwen2-Audio                    | ✅            |                                                                      |
+| Aria                           | ✅            |                                                                      |
+| LLaVA-Next                     | ✅            |                                                                      |
+| LLaVA-Next-Video               | ✅            |                                                                      |
+| MiniCPM-V                      | ✅            |                                                                      |
+| Mistral3                       | ✅            |                                                                      |
+| Phi-3-Vison/Phi-3.5-Vison      | ✅            |                                                                      |
+| Gemma3                         | ✅            |                                                                      |
+| LLama4                         | ❌            | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972)      |
+| LLama3.2                       | ❌            | [1972](https://github.com/vllm-project/vllm-ascend/issues/1972)      |
+| Keye-VL-8B-Preview             | ❌            | [1963](https://github.com/vllm-project/vllm-ascend/issues/1963)      |
+| Florence-2                     | ❌            | [2259](https://github.com/vllm-project/vllm-ascend/issues/2259)      |
+| GLM-4V                         | ❌            | [2260](https://github.com/vllm-project/vllm-ascend/issues/2260)      |
+| InternVL2.0/2.5/3.0<br>InternVideo2.5/Mono-InternVL | ❌ | [2064](https://github.com/vllm-project/vllm-ascend/issues/2064) |
+| Whisper                        | ❌            | [2262](https://github.com/vllm-project/vllm-ascend/issues/2262)      |
+| Ultravox                       | 🟡 Need test  |                                                                      |