Benchmark Autoencoder #10780

hlky · 2025-02-12T08:49:34Z

What does this PR do?

Benchmarking suite is planned, this is a quick draft to benchmark Autoencoder as a priority.

python benchmarks/benchmark_autoencoderkl.py --pretrained_model_name_or_path "stable-diffusion-v1-5/stable-diffusion-v1-5" --dtype float16 --subfolder vae

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

sayakpaul

Thanks! Left some comments. LMK if they make sense.

sayakpaul · 2025-02-12T09:02:15Z

benchmarks/base_classes.py

+        self.model_class_name = str(self.model.__class__.__name__)
+        self.pretrained_model_name_or_path = pretrained_model_name_or_path
+
+    @torch.no_grad


Suggested change

@torch.no_grad

@torch.no_grad()

As per the docs: https://pytorch.org/docs/stable/generated/torch.no_grad.html

sayakpaul · 2025-02-12T09:07:18Z

benchmarks/base_classes.py

+        return filepath
+
+
+class AutoencoderKLBenchmark(BaseBenchmarkTestCase):


Should we let the users define dummy_inputs() per model class here? And then we could let them implement their own function that needs to be benchmarked.

So, BaseBenchmarkTestCase could then have a method benchmark():

def benchmark(...): time = benchmark_fn(self.run_decode, self.model, tensor) memory = bytes_to_giga_bytes(torch.cuda.max_memory_allocated()) # should this be allocated? benchmark_info = BenchmarkInfo(time=time, memory=memory) csv_dict = generate_csv_dict_model( model_cls=self.model_class_name, ckpt=self.pretrained_model_name_or_path, benchmark_info=benchmark_info, **kwargs, ) print(f"{self.model_class_name} decode - shape: {list(tensor.shape)}, time: {time}, memory: {memory}") return csv_dict

sayakpaul · 2025-02-12T09:09:05Z

benchmarks/base_classes.py

+    def __init__(self, pretrained_model_name_or_path, dtype, **kwargs):
+        super().__init__()
+        self.dtype = getattr(torch, dtype)
+        model = self.model_class.from_pretrained(pretrained_model_name_or_path, torch_dtype=self.dtype, **kwargs).eval()


Likewise, we could move all of these reusable components to BaseBenchmarkTestCase (perhaps rename to ModelBaseBenchmarkTestCase) and let the users specify pretrained_model_name_or_path, torch_dtype, subfolder, etc.

sayakpaul · 2025-02-12T09:09:34Z

benchmarks/base_classes.py

+        print(f"{self.model_class_name} decode - shape: {list(tensor.shape)}, time: {time}, memory: {memory}")
+        return csv_dict
+
+    def test_decode(self):


Not needed for the first iteration but I would consider also including model.compile().

sayakpaul · 2025-02-12T15:06:14Z

On a 4090 without tiling:

AutoencoderKL decode - shape: [1, 4, 32, 32], time: 0.007, memory: 0.461
AutoencoderKL decode - shape: [1, 4, 64, 64], time: 0.031, memory: 1.318
AutoencoderKL decode - shape: [1, 4, 128, 128], time: 0.149, memory: 4.707
AutoencoderKL decode - shape: [1, 4, 256, 256], time: 0.687, memory: 18.221

With tiling:

AutoencoderKL decode - shape: [1, 4, 32, 32], time: 0.007, memory: 0.461
AutoencoderKL decode - shape: [1, 4, 64, 64], time: 0.031, memory: 1.318
AutoencoderKL decode - shape: [1, 4, 128, 128], time: 0.218, memory: 1.322
AutoencoderKL decode - shape: [1, 4, 256, 256], time: 1.032, memory: 1.324

hlky added 5 commits February 12, 2025 08:10

AutoencoderKLBenchmark

ef0c447

csv

4902610

log

5351d9e

max_memory_reserved

ee9a56d

tiling

7f37c1b

sayakpaul reviewed Feb 12, 2025

View reviewed changes

hlky added 4 commits February 12, 2025 09:13

handle oom

73688a2

handle oom

82a238f

tilng

265f1f2

device

96449cd

hlky added 3 commits March 10, 2025 13:00

Merge remote-tracking branch 'upstream/main' into benchmark-autoencoder

8eeee7e

benchmark_autoencoderkl_encode

6714388

in_channels

46a3b43

hlky closed this Apr 15, 2025

hlky deleted the benchmark-autoencoder branch April 15, 2025 12:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmark Autoencoder #10780

Benchmark Autoencoder #10780

Uh oh!

hlky commented Feb 12, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Feb 12, 2025

Uh oh!

sayakpaul Feb 12, 2025 •

edited

Loading

Uh oh!

sayakpaul Feb 12, 2025

Uh oh!

sayakpaul Feb 12, 2025

Uh oh!

sayakpaul commented Feb 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return filepath


		class AutoencoderKLBenchmark(BaseBenchmarkTestCase):

Benchmark Autoencoder #10780

Benchmark Autoencoder #10780

Uh oh!

Conversation

hlky commented Feb 12, 2025

What does this PR do?

Who can review?

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul commented Feb 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sayakpaul Feb 12, 2025 •

edited

Loading