You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## Purpose ##
* Fix failing tests introduced by adding `dispatch_for_generation` to
the data free pipeline
## Changes ##
* After oneshot, oneshot should remove any dispatches from the model.
However, there is a bug fixed
[here](neuralmagic/compressed-tensors#427) where
models which fit entirely on one GPU do not have their dispatches
removed (since they do not have hooks)
* As a result, we need to move weights to the same device before
comparing them for `test_quantization_reload`
* The `test_perplexity` test was implicitly relying on the model being
dispatched to GPUs. Now explicitly `dispatch_for_generation`, similar to
how we do in our examples
## Testing ##
* Nightly and commit tests passed locally
---------
Signed-off-by: Kyle Sayers <[email protected]>
Co-authored-by: Kyle Sayers <[email protected]>
0 commit comments