Update benchmarking for diffusers #487

ajrasane · 2025-10-31T01:18:27Z

What does this PR do?

Type of change:
Example update

Overview:

Optimize the benchmarking function in the diffusers example

python diffusion_trt.py --model flux-dev --benchmark --model-dtype BFloat16 --skip-image --torch

Testing

Backbone-only inference latency (BFloat16):
  Average: 139.48 ms
  P50: 139.36 ms
  P95: 141.13 ms
  P99: 141.35 ms

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes
Did you write any new necessary tests?: No
Did you add or update any necessary documentation?: No
Did you update Changelog?: No

codecov · 2025-10-31T01:31:00Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 74.39%. Comparing base (ca94c96) to head (646458a).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #487      +/-   ##
==========================================
+ Coverage   74.36%   74.39%   +0.02%     
==========================================
  Files         181      182       +1     
  Lines       18192    18209      +17     
==========================================
+ Hits        13529    13546      +17     
  Misses       4663     4663

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

kevalmorabia97 · 2025-10-31T06:47:16Z

Please make sure to run internal gitlab diffuesrs cicd test to verify they dont break with this change

Signed-off-by: ajrasane <[email protected]>

cjluo-nv · 2025-10-31T16:44:52Z

examples/diffusers/quantization/diffusion_trt.py

-    def forward_hook(_module, _input, _output):
+        _ = backbone(**dummy_inputs_dict)
        end_event.record()
        torch.cuda.synchronize()


I don't think you need to call sync here.

cjluo-nv · 2025-10-31T16:45:54Z

examples/diffusers/quantization/diffusion_trt.py

+
+    avg_latency = sum(times) / len(times)
+    times = sorted(times)
+    p50 = times[len(times) // 2]


I suggest you use numpy.percentile for these instead.

ajrasane requested a review from a team as a code owner October 31, 2025 01:18

ajrasane requested a review from kevalmorabia97 October 31, 2025 01:18

ajrasane self-assigned this Oct 31, 2025

ajrasane requested a review from cjluo-nv October 31, 2025 01:18

kevalmorabia97 requested a review from jingyu-ml October 31, 2025 06:46

Update benchmarking for diffusers

1aafbbc

Signed-off-by: ajrasane <[email protected]>

ajrasane force-pushed the ajrasane/benchmark_diffusers branch from 89f6c25 to 1aafbbc Compare November 7, 2025 19:28

Add cuda profiler

646458a

Signed-off-by: ajrasane <[email protected]>

ajrasane force-pushed the ajrasane/benchmark_diffusers branch from 094aa94 to 646458a Compare November 7, 2025 20:05

jingyu-ml approved these changes Nov 7, 2025

View reviewed changes

cjluo-nv approved these changes Nov 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update benchmarking for diffusers #487

Update benchmarking for diffusers #487

Uh oh!

ajrasane commented Oct 31, 2025

Uh oh!

codecov bot commented Oct 31, 2025 •

edited

Loading

Uh oh!

kevalmorabia97 commented Oct 31, 2025

Uh oh!

cjluo-nv Oct 31, 2025

Uh oh!

cjluo-nv Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Update benchmarking for diffusers #487

Are you sure you want to change the base?

Update benchmarking for diffusers #487

Uh oh!

Conversation

ajrasane commented Oct 31, 2025

What does this PR do?

Testing

Before your PR is "Ready for review"

Uh oh!

codecov bot commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

kevalmorabia97 commented Oct 31, 2025

Uh oh!

cjluo-nv Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

cjluo-nv Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

codecov bot commented Oct 31, 2025 •

edited

Loading