@@ -5,49 +5,75 @@ Get the newest info here: https://github.com/vllm-project/vllm-ascend/issues/160
5
5
## Text-only Language Models
6
6
7
7
### Generative Models
8
- | Model | Supported | Note |
9
- | -------| -----------| ------|
10
- | DeepSeek v3 | ✅ | |
11
- | DeepSeek R1 | ✅ | |
12
- | DeepSeek Distill (Qwen/LLama) | ✅ | |
13
- | Qwen3 | ✅ | |
14
- | Qwen3-Moe | ✅ | |
15
- | Qwen2.5 | ✅ | |
16
- | QwQ-32B | ✅ | |
17
- | LLama3.1/3.2 | ✅ | |
18
- | Internlm | ✅ | |
19
- | Baichuan | ✅ | |
20
- | Phi-4-mini | ✅ | |
21
- | MiniCPM | ✅ | |
22
- | MiniCPM3 | ✅ | |
23
- | LLama4 | ✅ | |
24
- | Mistral | | Need test |
25
- | DeepSeek v2.5 | | Need test |
26
- | Gemma-2 | | Need test |
27
- | Mllama | | Need test|
28
- | Gemma-3 | ❌ | [ #496 ] ( https://github.com/vllm-project/vllm-ascend/issues/496 ) |
29
- | ChatGLM | ❌ | [ #554 ] ( https://github.com/vllm-project/vllm-ascend/issues/554 ) |
8
+
9
+ | Model | Supported | Note |
10
+ | -------------------------------| -----------| ----------------------------------------------------------------------|
11
+ | DeepSeek v3 | ✅ | |
12
+ | DeepSeek R1 | ✅ | |
13
+ | DeepSeek Distill (Qwen/LLama) | ✅ | |
14
+ | Qwen3 | ✅ | |
15
+ | Qwen3-based | ✅ | |
16
+ | Qwen3-Coder | ✅ | |
17
+ | Qwen3-Moe | ✅ | |
18
+ | Qwen2.5 | ✅ | |
19
+ | Qwen2 | ✅ | |
20
+ | Qwen2-based | ✅ | |
21
+ | QwQ-32B | ✅ | |
22
+ | LLama2/3/3.1 | ✅ | |
23
+ | Internlm | ✅ | [ #1962 ] ( https://github.com/vllm-project/vllm-ascend/issues/1962 ) |
24
+ | Baichuan | ✅ | |
25
+ | Baichuan2 | ✅ | |
26
+ | Phi-4-mini | ✅ | |
27
+ | MiniCPM | ✅ | |
28
+ | MiniCPM3 | ✅ | |
29
+ | Ernie4.5 | ✅ | |
30
+ | Ernie4.5-Moe | ✅ | |
31
+ | Gemma-2 | ✅ | |
32
+ | Gemma-3 | ✅ | |
33
+ | Phi-3/4 | ✅ | |
34
+ | Mistral/Mistral-Instruct | ✅ | |
35
+ | GLM-4.5 | ✅ | |
36
+ | GLM-4 | ❌ | [ #2255 ] ( https://github.com/vllm-project/vllm-ascend/issues/2255 ) |
37
+ | GLM-4-0414 | ❌ | [ #2258 ] ( https://github.com/vllm-project/vllm-ascend/issues/2258 ) |
38
+ | ChatGLM | ❌ | [ #554 ] ( https://github.com/vllm-project/vllm-ascend/issues/554 ) |
39
+ | DeepSeek v2.5 | 🟡 | Need test |
40
+ | Mllama | 🟡 | Need test |
41
+ | MiniMax-Text | 🟡 | Need test |
30
42
31
43
### Pooling Models
32
- | Model | Supported | Note |
33
- | -------| ---------| ------|
34
- | XLM-RoBERTa-based | ✅ | |
35
- | Molmo | ✅ | |
44
+
45
+ | Model | Supported | Note |
46
+ | -------------------------------| -----------| ----------------------------------------------------------------------|
47
+ | Qwen3-Embedding | ✅ | |
48
+ | Molmo | ✅ | [ 1942] ( https://github.com/vllm-project/vllm-ascend/issues/1942 ) |
49
+ | XLM-RoBERTa-based | ❌ | [ 1960] ( https://github.com/vllm-project/vllm-ascend/issues/1960 ) |
36
50
37
51
## Multimodal Language Models
38
52
39
53
### Generative Models
40
- | Model | Supported | Note |
41
- | -------| -----------| ------|
42
- | Qwen2-VL | ✅ | |
43
- | Qwen2.5-VL | ✅ | |
44
- | LLaVA 1.5 | ✅ | |
45
- | LLaVA 1.6 | ✅ | [ #553 ] ( https://github.com/vllm-project/vllm-ascend/issues/553 ) |
46
- | InternVL2 | ✅ | |
47
- | InternVL2.5 | ✅ | |
48
- | Qwen2-Audio | ✅ | |
49
- | LLaVA-Next | | Need test |
50
- | LLaVA-Next-Video | | Need test |
51
- | Phi-3-Vison/Phi-3.5-Vison | | Need test |
52
- | GLM-4v | | Need test |
53
- | Ultravox | | Need test |
54
+
55
+ | Model | Supported | Note |
56
+ | --------------------------------| ---------------| ----------------------------------------------------------------------|
57
+ | Qwen2-VL | ✅ | |
58
+ | Qwen2.5-VL | ✅ | |
59
+ | Qwen2.5-Omni | ✅ | [ 1760] ( https://github.com/vllm-project/vllm-ascend/issues/1760 ) |
60
+ | QVQ | ✅ | |
61
+ | LLaVA 1.5/1.6 | ✅ | [ 1962] ( https://github.com/vllm-project/vllm-ascend/issues/1962 ) |
62
+ | InternVL2 | ✅ | |
63
+ | InternVL2.5 | ✅ | |
64
+ | Qwen2-Audio | ✅ | |
65
+ | Aria | ✅ | |
66
+ | LLaVA-Next | ✅ | |
67
+ | LLaVA-Next-Video | ✅ | |
68
+ | MiniCPM-V | ✅ | |
69
+ | Mistral3 | ✅ | |
70
+ | Phi-3-Vison/Phi-3.5-Vison | ✅ | |
71
+ | Gemma3 | ✅ | |
72
+ | LLama4 | ❌ | [ 1972] ( https://github.com/vllm-project/vllm-ascend/issues/1972 ) |
73
+ | LLama3.2 | ❌ | [ 1972] ( https://github.com/vllm-project/vllm-ascend/issues/1972 ) |
74
+ | Keye-VL-8B-Preview | ❌ | [ 1963] ( https://github.com/vllm-project/vllm-ascend/issues/1963 ) |
75
+ | Florence-2 | ❌ | [ 2259] ( https://github.com/vllm-project/vllm-ascend/issues/2259 ) |
76
+ | GLM-4V | ❌ | [ 2260] ( https://github.com/vllm-project/vllm-ascend/issues/2260 ) |
77
+ | InternVL2.0/2.5/3.0<br >InternVideo2.5/Mono-InternVL | ❌ | [ 2064] ( https://github.com/vllm-project/vllm-ascend/issues/2064 ) |
78
+ | Whisper | ❌ | [ 2262] ( https://github.com/vllm-project/vllm-ascend/issues/2262 ) |
79
+ | Ultravox | 🟡 Need test | |
0 commit comments