[D2M][ttnn-jit] Nightly perf collection for Superset by sgholamiTT · Pull Request #7495 · tenstorrent/tt-mlir

sgholamiTT · 2026-03-13T20:24:51Z

Add JIT performance collection to nightly CI

Runs JIT vs TTNN op-level performance benchmarks in the nightly pipeline and exports structured results to Superset for dashboard visualization.

What

call-jit-perf-test.yml: New reusable workflow that sets up the environment (install artifacts, tracy profiler, ttnn-jit wheels), runs run_perf_collect.sh, and uploads results as artifacts.
schedule-nightly.yml: Adds the jit-perf-test job after release-build.
perf_tests.py: Parametrized pytest suite comparing JIT and non-JIT execution of abs, exp, add, mul, matmul across dtypes (bf16, bfp8) and memory configs (dram_interleaved). More configs are yet to be decided.
run_perf_collect.sh: Orchestrates tracy-profiled test runs and invokes the summarizer.
summarize_perf_results.py: Parses per-test CSV profiler output and produces one JSON report per test case (op + dtype + memory config). Each report maps to a separate benchmark_run row in Superset with clean, filterable columns (model=op, precision=dtype, config=shape/memory/fidelity) and three measurements: jit_kernel_duration_ns, ttnn_kernel_duration_ns, perf_ratio.

Superset integration

Reports are picked up by the existing collect_data action, SFTP'd to the perf ingestion server, and loaded into sw_test.benchmark_run / sw_test.benchmark_measurement tables. No changes needed to the data pipeline.

Dashboard: https://superset.tenstorrent.com/superset/dashboard/p/WpVZ5d8pj90/

Note

This workflow should not increase the nightly duration as it runs in parallel with all other text matrix jobs.
Currently, the e2e process takes around 10 mins (evidence)

Test plan

Nightly CI run completes successfully and uploads artifacts (Link)
collect_data finds and processes all perf_*_<job_id>.json reports
Data appears in Superset benchmark_run / benchmark_measurement tables

This reverts commit 81f47df.

Copilot

Pull request overview

Adds a nightly CI job to collect TTNN-JIT vs TTNN op-level performance data (via tracy CSV exports), then summarizes it into per-test JSON files intended for downstream Superset ingestion.

Changes:

Introduces a reusable GitHub Actions workflow to run JIT perf collection on TT hardware and upload results as artifacts.
Adds a new nightly job (jit-perf-test) to invoke the reusable workflow.
Adds a small perf CI suite: parametrized pytest ops, a bash orchestrator, and a CSV→JSON summarizer.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
`.github/workflows/schedule-nightly.yml`	Adds a new nightly job to run JIT perf collection.
`.github/workflows/call-jit-perf-test.yml`	New reusable workflow to set up env, run the perf collection, and upload reports.
`test/ttnn-jit/perf_ci/perf_tests.py`	New pytest suite defining op/dtype/memory/JIT parameterization for profiling.
`test/ttnn-jit/perf_ci/run_perf_collect.sh`	Orchestrates tracy-profiled per-test runs and invokes summarization.
`test/ttnn-jit/perf_ci/summarize_perf_results.py`	Aggregates profiler CSVs and emits per-case JSON reports for ingestion.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

test/ttnn-jit/perf_ci/perf_tests.py

+    )
+
+    function_to_test = (
+        ttnn_jit.jit(debug=True, enable_cache=True)(op) if jit_enabled else op


test/ttnn-jit/perf_ci/perf_tests.py

+
+    print(f"output_tensor\n: {output_tensor}")
+    ttnn.close_device(device)


test/ttnn-jit/perf_ci/summarize_perf_results.py

+        filename = f"perf_{op}_{dtype}_{mem_cfg}{job_suffix}.json"
+        filepath = out_dir / filename


test/ttnn-jit/perf_ci/summarize_perf_results.py

+            g["jit_duration_ns"] = r["duration_ns"]
+            g["math_fidelity_jit"] = r["math_fidelity"]
+        else:
+            g["ttnn_duration_ns"] = r["duration_ns"]
+            g["math_fidelity_ttnn"] = r["math_fidelity"]


.github/workflows/schedule-nightly.yml

+  jit-perf-test:
+    needs: [ build-image, release-build ]
+    uses: ./.github/workflows/call-jit-perf-test.yml
+    secrets: inherit
+    with:
+      docker_image: ${{ needs.build-image.outputs.docker-image }}
+


.github/workflows/call-jit-perf-test.yml

+
+        pip show ttmlir &> /dev/null && pip uninstall -y ttmlir
+        pip show ttnn-jit &> /dev/null && pip uninstall -y ttnn-jit
+        pip install ttnn_jit*.whl --find-links . --upgrade


test/ttnn-jit/perf_ci/perf_tests.py

+import pytest
+


test/ttnn-jit/perf_ci/perf_tests.py

+def test_op_compare(
+    h, w, op, dtype, ttnn_dtype, memory_config, memory_config_id, jit_enabled
+):
+    device = ttnn.open_device(device_id=0)
+    torch_tensor_a = torch.rand((h, w), dtype=dtype) * 100


codecov-commenter · 2026-03-13T20:37:48Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 69.84%. Comparing base (aa43448) to head (081e865).
⚠️ Report is 52 commits behind head on main.
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7495      +/-   ##
==========================================
+ Coverage   69.25%   69.84%   +0.59%     
==========================================
  Files         405      419      +14     
  Lines       71883    74115    +2232     
==========================================
+ Hits        49781    51765    +1984     
- Misses      22102    22350     +248

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

sgholamiTT added 12 commits March 10, 2026 15:29

initial work for perf ci

9eac945

python path issue

1da6307

fix ttnn not found

81f47df

Revert "fix ttnn not found"

82d7c98

This reverts commit 81f47df.

tracy fix

f93edfe

fix upload issue

802d5a9

fix json file naming

aa8a765

new json format

6d64080

json format

d563c3a

new json format

99fb6c0

enable other nightly workflows

bb2f4e5

nightly fix

081e865

sgholamiTT requested review from a team, arminaleTT, vtangTT and xanderchin as code owners March 13, 2026 20:24

Copilot AI review requested due to automatic review settings March 13, 2026 20:24

Copilot started reviewing on behalf of sgholamiTT March 13, 2026 20:25 View session

sgholamiTT changed the title ~~Sgholami/jit performance collection~~ [D2M][ttnn-jit] Nightly perf collection for Superset Mar 13, 2026

Copilot AI reviewed Mar 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[D2M][ttnn-jit] Nightly perf collection for Superset#7495

[D2M][ttnn-jit] Nightly perf collection for Superset#7495
sgholamiTT wants to merge 12 commits intomainfrom
sgholami/jit-performance-collection

sgholamiTT commented Mar 13, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

codecov-commenter commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		print(f"output_tensor\n: {output_tensor}")
		ttnn.close_device(device)

		filename = f"perf_{op}_{dtype}_{mem_cfg}{job_suffix}.json"
		filepath = out_dir / filename

		import pytest

Conversation

sgholamiTT commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add JIT performance collection to nightly CI

What

Superset integration

Note

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

codecov-commenter commented Mar 13, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sgholamiTT commented Mar 13, 2026 •

edited

Loading