You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## Purpose ##
* Given the increasing size of large language models (such as
DeepSeek-R1 which contains 45034 linear layers), the likelihood that any
of the hessian inversions will spontaneously fail is significant
* These changes cause the GPTQ algorithm to fall back to RTN for any
layers which fail hessian inversion
## Changes ##
* Implement fallback by setting hessian value to identity matrix if
inversion fails
---------
Signed-off-by: Kyle Sayers <[email protected]>
0 commit comments