Commit 0873f28
authored
[GPTQ] Add inversion fallback (#1283)
## Purpose ##
* Given the increasing size of large language models (such as
DeepSeek-R1 which contains 45034 linear layers), the likelihood that any
of the hessian inversions will spontaneously fail is significant
* These changes cause the GPTQ algorithm to fall back to RTN for any
layers which fail hessian inversion
## Changes ##
* Implement fallback by setting hessian value to identity matrix if
inversion fails
---------
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>1 parent 81271b5 commit 0873f28
File tree
1 file changed
+5
-2
lines changed- src/llmcompressor/modifiers/quantization/gptq
1 file changed
+5
-2
lines changedLines changed: 5 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| 13 | + | |
13 | 14 | | |
14 | 15 | | |
15 | 16 | | |
| |||
161 | 162 | | |
162 | 163 | | |
163 | 164 | | |
164 | | - | |
| 165 | + | |
165 | 166 | | |
166 | 167 | | |
167 | | - | |
| 168 | + | |
| 169 | + | |
168 | 170 | | |
| 171 | + | |
169 | 172 | | |
170 | 173 | | |
171 | 174 | | |
| |||
0 commit comments