Commit 5b434ed
committed
Fix: Correct device placement for QuantizedLinear across all quantizers
Extends the previous fix for AWQ to GPTQ and GGUF quantizers.
Addresses an AttributeError where QuantizedLinear (an nn.Module)
was incorrectly passed to `move_to_device`, a function expecting
a tensor. This change ensures QuantizedLinear modules are moved to
their target device using the correct `.to(device)` method in
AWQ, GPTQ, and GGUF quantizers.
This commit ensures consistent and correct device handling for
quantized layers created by these methods.1 parent bfb5167 commit 5b434ed
2 files changed
+2
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
232 | 232 | | |
233 | 233 | | |
234 | 234 | | |
235 | | - | |
| 235 | + | |
236 | 236 | | |
237 | 237 | | |
238 | 238 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
202 | 202 | | |
203 | 203 | | |
204 | 204 | | |
205 | | - | |
| 205 | + | |
206 | 206 | | |
207 | 207 | | |
208 | 208 | | |
| |||
0 commit comments