Skip to content

Commit b4b0587

Browse files
authored
fix not overlap bug (#10864)
1 parent 75e51df commit b4b0587

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

paddlenlp/transformers/moe_layer.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -906,7 +906,7 @@ def forward(self, hidden_states, probs, routing_map):
906906

907907
@paddle.no_grad()
908908
def backward(self, output_grad, with_dw=True):
909-
output_combine_grad = self.combine_quant_node.backward(output_grad)
909+
output_combine_grad, _ = self.combine_quant_node.backward(output_grad)
910910
hidden_states_out_grad = self.combine_node.backward(output_combine_grad)
911911

912912
hs_dispatched_grad, dispatched_probs_grad = self.mlp_node.backward(hidden_states_out_grad, with_dw=with_dw)

0 commit comments

Comments
 (0)