Skip to content

Commit 232cc1d

Browse files
committed
ParallelDecoderBlock: Fix regression
1 parent ef8fd43 commit 232cc1d

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

exllamav3/modules/transformer.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -245,6 +245,7 @@ def allocate_q(self, quant_args: dict, surplus_bits: int):
245245
self.mlp.gates if any(isinstance(self.mlp, x) for x in [GatedMLP, BlockSparseMLP]) else None,
246246
self.mlp.ups if self.mlp else None,
247247
self.mlp.downs if self.mlp else None,
248+
None
248249
)
249250

250251

0 commit comments

Comments
 (0)