pruna is a library which contains a comprehensive suite of compression algorithms including caching, quantization, pruning, distillation and compilation techniques to make your models more efficient to use. I would be interested in contributing a recipe using pruna and the diffusers library.
Based on a discussion with @davidberenstein1957, the recipe would feature a diffusers model quantized using the smash config, then the smashed model could be used for synthetic dataset creation, or we could evaluate the model against its unquantized version as well. Could you please let me know your thoughts and suggestions if the scope of the recipe is sufficient or any changes to make it more applied as well.
cc: @stevhliu