Commit 81ec39c
[Offload] Convert model back to CT offloading for testing (vllm-project#2403)
## Purpose ##
* Fix testing which requires access to the model after the model has
been saved
## Prerequisites ##
* vllm-project/compressed-tensors#601
## Changes ##
* Convert back to CT offloading after converting to accelerate
offloading for saving
* Previously we just "removed dispatch", but this is bad practice as it
won't work for disk offloading
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com>1 parent c6e4d38 commit 81ec39c
File tree
1 file changed
+7
-4
lines changed- src/llmcompressor/transformers/compression
1 file changed
+7
-4
lines changedLines changed: 7 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
12 | | - | |
| 12 | + | |
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
| |||
89 | 89 | | |
90 | 90 | | |
91 | 91 | | |
92 | | - | |
93 | | - | |
94 | | - | |
| 92 | + | |
| 93 | + | |
95 | 94 | | |
| 95 | + | |
96 | 96 | | |
97 | 97 | | |
98 | 98 | | |
| |||
110 | 110 | | |
111 | 111 | | |
112 | 112 | | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
113 | 116 | | |
114 | 117 | | |
115 | 118 | | |
| |||
0 commit comments