Skip to content

Commit eafa8dc

Browse files
authored
[Model] Add pp support for hunyuan (vllm-project#24212)
Signed-off-by: zjy0516 <[email protected]>
1 parent 6c7af81 commit eafa8dc

File tree

2 files changed

+4
-4
lines changed

2 files changed

+4
-4
lines changed

docs/models/supported_models.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -365,8 +365,8 @@ th {
365365
| `GraniteMoeSharedForCausalLM` | Granite MoE Shared | `ibm-research/moe-7b-1b-active-shared-experts` (test model) | ✅︎ | ✅︎ | ✅︎ |
366366
| `GritLM` | GritLM | `parasail-ai/GritLM-7B-vllm`. | ✅︎ | ✅︎ | ✅︎ |
367367
| `Grok1ModelForCausalLM` | Grok1 | `hpcai-tech/grok-1`. | ✅︎ | ✅︎ | ✅︎ |
368-
| `HunYuanDenseV1ForCausalLM` | Hunyuan-7B-Instruct-0124 | `tencent/Hunyuan-7B-Instruct-0124` | ✅︎ | | ✅︎ |
369-
| `HunYuanMoEV1ForCausalLM` | Hunyuan-80B-A13B | `tencent/Hunyuan-A13B-Instruct`, `tencent/Hunyuan-A13B-Pretrain`, `tencent/Hunyuan-A13B-Instruct-FP8`, etc. | ✅︎ | | ✅︎ |
368+
| `HunYuanDenseV1ForCausalLM` | Hunyuan-7B-Instruct-0124 | `tencent/Hunyuan-7B-Instruct-0124` | ✅︎ | ✅︎ | ✅︎ |
369+
| `HunYuanMoEV1ForCausalLM` | Hunyuan-80B-A13B | `tencent/Hunyuan-A13B-Instruct`, `tencent/Hunyuan-A13B-Pretrain`, `tencent/Hunyuan-A13B-Instruct-FP8`, etc. | ✅︎ | ✅︎ | ✅︎ |
370370
| `HCXVisionForCausalLM` | HyperCLOVAX-SEED-Vision-Instruct-3B | `naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B` | | | ✅︎ |
371371
| `InternLMForCausalLM` | InternLM | `internlm/internlm-7b`, `internlm/internlm-chat-7b`, etc. | ✅︎ | ✅︎ | ✅︎ |
372372
| `InternLM2ForCausalLM` | InternLM2 | `internlm/internlm2-7b`, `internlm/internlm2-chat-7b`, etc. | ✅︎ | ✅︎ | ✅︎ |

vllm/model_executor/models/hunyuan_v1.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,7 @@
5656
from vllm.model_executor.sampling_metadata import SamplingMetadata
5757
from vllm.sequence import IntermediateTensors
5858

59-
from .interfaces import SupportsLoRA
59+
from .interfaces import SupportsLoRA, SupportsPP
6060
from .utils import (AutoWeightsLoader, PPMissingLayer, is_pp_missing_parameter,
6161
make_layers)
6262

@@ -841,7 +841,7 @@ def load_weights(self, weights: Iterable[tuple[str, torch.Tensor]]):
841841
return loaded_params
842842

843843

844-
class HunYuanV1Base(nn.Module, SupportsLoRA):
844+
class HunYuanV1Base(nn.Module, SupportsLoRA, SupportsPP):
845845
packed_modules_mapping = {
846846
"qkv_proj": [
847847
"q_proj",

0 commit comments

Comments
 (0)