Skip to content

Commit c410c51

Browse files
bremerm31facebook-github-bot
authored andcommitted
Use active driver benchmarker for do_bench measurements (#537)
Summary: Sometimes the triton backend may set a profiler to be different from the standard `do_bench` implmentation. For cuda this has no functional change, i.e. `triton.runtime.driver.active.get_benchmarker()` still points at `do_bench`. Reviewed By: xuzhao9 Differential Revision: D84215437
1 parent 905b152 commit c410c51

File tree

1 file changed

+1
-1
lines changed
  • tritonbench/components/do_bench

1 file changed

+1
-1
lines changed

tritonbench/components/do_bench/run.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -475,7 +475,7 @@ def do_bench_wrapper(
475475
if latency_measure_mode == "profiler"
476476
else _do_bench_inductor
477477
if latency_measure_mode == "inductor_benchmarker"
478-
else triton.testing.do_bench
478+
else triton.runtime.driver.active.get_benchmarker()
479479
)
480480

481481
return Latency(

0 commit comments

Comments
 (0)