In your paper section 3.3 you said you calculate the regularization term using <img width="531" height="97" alt="Image" src="https://github.com/user-attachments/assets/217a2c5a-192c-473f-b7f2-3e62fb74f00c" /> During sparsity training, you use the unnormalized importance. You only use the max normalized importance during pruning. However in your code, <img width="895" height="92" alt="Image" src="https://github.com/user-attachments/assets/b7893b3a-a13d-4bab-bf55-d8b34b13c461" /> You use the max normalized importance both during sparsity training and pruning. Is there anything wrong ?