Feature request
Have a section on Pruna AI within the documentation. We did a similar PR for diffusers and thought it would be nice to show how to optimize transformers models too.
.
Motivation
Have a section on Pruna AI within the documentation to show how to optimize LLMs for inference.
Your contribution
We could do everything for the PR.