Skip to content

Conversation

zeroRains
Copy link
Contributor

pcard-71500

修改DeepSeekV3 self.decoder_layers层的名称为self.layers,减少Loader V1对开源模型name的replace次数。
减少冗余条件:处理非MOE层参数时额外添加的冗余条件loaded_weight_name not in params_dict

Copy link

paddle-bot bot commented Aug 11, 2025

Thanks for your contribution!

@Jiang-Jia-Jun Jiang-Jia-Jun merged commit be94bdd into PaddlePaddle:develop Aug 13, 2025
15 of 22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants