Commit 69af79c
Fix SmoothQuant regex to match q_a_proj in DeepSeek/GLM-5 (#2421)
Fixes #2420
The balance_layers pattern re:.*q_proj in
DEEPSEEK_V2_SMOOTHQUANT_MAPPINGS does not match q_a_proj (used by
DeepSeek V2/V3 and GLM-5). Changed to re:.*q(_a)?_proj$ as suggested by
@brian-dellabetta.
SUMMARY:
Fix regex pattern in DEEPSEEK_V2_SMOOTHQUANT_MAPPINGS to cover both
q_proj and q_a_proj.
TEST PLAN:
Verified with Python regex that the new pattern matches both layer
names:
re.fullmatch(".*q(_a)?_proj$", "model.layers.0.self_attn.q_proj") ->
match
re.fullmatch(".*q(_a)?_proj$", "model.layers.0.self_attn.q_a_proj") ->
match
Signed-off-by: gillesturpin <turpingilles@orange.fr>
Co-authored-by: Brian Dellabetta <brian-dellabetta@users.noreply.github.com>
Co-authored-by: HDCharles <39544797+HDCharles@users.noreply.github.com>1 parent 9e9ae3d commit 69af79c
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
61 | 61 | | |
62 | 62 | | |
63 | 63 | | |
64 | | - | |
| 64 | + | |
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
| |||
0 commit comments