Add Workload object and execution methods #18

tkarna · 2025-11-18T17:30:36Z

Implements #16.

Adds a Workload object and execution engine utility functions that can execute it.
Adds two CPU examples that demonstrate the usage
- One that allocates input data with NumPy and another that uses MLIR helper functions to allocate/fill/deallocate input memrefs.

rolfmorel

Here's a partial pass through it. Will try to complete the first pass through tomorrow!

rolfmorel · 2025-11-18T23:02:31Z

python/lighthouse/workload.py

+        pass
+
+    @abstractmethod
+    def get_input_arrays(self, execution_engine) -> list:


-> abc.Sequence[ir.Value]?

If we had that each Workload is explicitly associated with a Target (or more than one), i.e. has a Target member, and each Target keeps track of it's associated execution_engine, this signature becomes simpler.

Creating the execution engine requires a fully lowered payload IR. So keeping the engine in the Workload object complicates the re-use of this object, e.g., for different schedule parameters.

The flow is something like: ParametrizedWorkload(schedule with unknown params, initial payload IR, correctness test, target specific info, ...) -> ConcreteWorkload(lowered payload IR, target specific info, ...).

In this proposal the Workload object addresses the first part - it is a re-usable high-level description of the problem. Every time one asks for a payload or schedule module, a new module is generated (The payload module is lowered in-place, so it cannot be re-used; schedule may depend on the given parameters). The latter part - lowering to a final payload IR and executing it - is handled via the execution helper functions. We could design this differently, e.g. by adding another "ConcreteWorkload" object that is associated with an execution engine (either in this PR or later). All ideas are much appreciated!

rolfmorel · 2025-11-18T23:04:41Z

python/lighthouse/workload.py

+        pass
+
+    @abstractmethod
+    def get_complexity(self) -> list:


If the different "dimensions" of the returned thing are known, and have sensible names, I would go with a NamedTuple.

rolfmorel · 2025-11-23T20:48:39Z

python/lighthouse/utils/execution.py

+        print(schedule_module)
+
+
+def lower_payload(


Shouldn't this just be a non-abstractmethod method on the Workload class?

rolfmorel · 2025-11-23T20:55:35Z

python/lighthouse/utils/execution.py

+    dump_schedule: bool = False,
+):
+    if not dump_kernel or dump_kernel != "initial":
+        with context, location:


In my mind there are very few cases where you have a reference to a mlir.ir.Context and haven't entered its context manager yet. The story for mlir.ir.Location is similar (there you might actually want to enter and exit nested Locations based on the scope of the IR you're building).

Have a look at #20 (comment) and #20 (comment) . My take is that the function which acts as the process's main() should set up and enter a mlir.ir.Context and set a mlir.ir.Location.unknown(). If we do that, the rest of the code should barely need to think about these any more.

rolfmorel · 2025-11-23T20:59:25Z

python/lighthouse/utils/execution.py

+    if not dump_kernel or dump_kernel != "initial":
+        with context, location:
+            # invoke transform interpreter directly
+            transform_interpreter.apply_named_sequence(


Now that we have llvm/llvm-project#168223, this can probably be simplified. (E.g. do we need apply_transform_schedule?)

rolfmorel · 2025-11-23T21:31:59Z