Reducing inference timings for Sd2.1 Base model

I managed to shave off inference timings for SD2.1 by a few seconds for `512x512` (50 steps) and `768x768` (50 Steps).

Using just few additions:
```py
torch.backends.cudnn.benchmark = True
torch.backends.cuda.matmul.allow_tf32 = True

pipe = StableDiffusionPipeline.from_pretrained(
            MODEL_ID,
            cache_dir=MODEL_CACHE,
            local_files_only=True,
        )
pipe = pipe.to("cuda")

pipe.enable_xformers_memory_efficient_attention()
pipe.enable_vae_slicing()

```

Overall output didn't suffer coz of this. Getting crisp images. Wanted to know how do I create a PR to add these? And are there any tests around this?

Here are the inferences:
- [OG stability-ai 512x512 50 Steps](https://replicate.com/p/nrdqaorgerdntfyrye4ftksw7y) - 5 secs
- [OG stability-ai 768x768 50 Steps](https://replicate.com/p/j3smhcik5vcrpkrztacrfeyqyu) - 14.3 secs
- [pratos sd2.1 512x512 50 Steps](https://replicate.com/p/6se7g5vverhafggujdoidrdkzu) - 3.3 secs
- [pratos sd2.1 768x768 50 Steps](https://replicate.com/p/d6boiywj3rgtxfgjpdofk62eja) - 10.6 secs
![Screenshot 2022-12-23 at 4 33 23 AM](https://user-images.githubusercontent.com/10406436/209239713-cac64fc1-0f0d-4e3f-b10d-020e564e19dd.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reducing inference timings for Sd2.1 Base model #64

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reducing inference timings for Sd2.1 Base model #64

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions