Add performance regression tests to CI #2548

Shigoto-dev19 · 2025-10-14T08:15:58Z

Closes #2545.
Depends on #2586.

How it works

The performance regression workflow in CI ensures that compile, prove, and verify times remain consistent across commits. It is driven by two workflows with a single source-of-truth baseline stored in the repo at tests/perf-regression/perf-regression.json.

PR checks (checks.yml): the matrix includes “Performance Regression”. The shared script run-ci-tests.sh runs in check mode (defaults to PERF_MODE=--check) and reads the baseline file directly from the repository. It then runs the benchmarks and compares current results to the stored values to detect regressions. No artifacts are downloaded, and no labels are consulted.
Baseline updates (dump-perf-baseline.yml): this is a separate, manual workflow (workflow_dispatch) that runs on the same branch. It executes the same performance suite in dump mode (PERF_MODE=--dump), regenerates perf-regression.json using GitHub runners, and **commits the updated file to the same branch. This becomes the canonical baseline for subsequent PR checks.

How to use

1) Open a PR (check mode)

The Checks workflow runs the Performance Regression matrix entry.
run-ci-tests.sh executes in --check, reading tests/perf-regression/perf-regression.json directly from the repo.
Outcome: The PR is validated against the committed baseline; no file changes occur.

2) Update the baseline (dump mode)

Trigger the manual workflow “Dump Performance Regression Baseline” (or run the script npm run regression:ci:dump-perf).
CI runs the performance suite in --dump on stable GitHub runners and **commits the refreshed perf-regression.json to the same branch.
Outcome: The baseline in the same branch is updated and used by all future PR checks.

Notes

The committed tests/perf-regression/perf-regression.json is the canonical baseline; PR checks read it directly.
There are no labels and no artifacts in this flow.
The first baseline must be seeded once (either via the dedicated dump workflow after merge, or via a one-off seed commit); See Add the very first performance regression baseline #2586.
Baselines should not be committed manually in normal operation; always update via the dump workflow to ensure consistent results from GitHub runners.

…ession checks

bleepbloopsify

This actions workflow suffers from something I call "doing too much"

Actions philosophy usually goes something like this:
If you want to change how the action behaves, just use git to change what the underlying repository looks like. The action should follow "open-closed" principle, so most of the logic should live in your perf-regression.ts, rather than out here.

Adding a case to Build-and-Test-Server is a bit redundant if you introduce a separate workflow, but makes sense if you want to keep them under the same umbrella.

The pattern I expect to see here usually goes something like:

Add test case to Build-and-Test-Server and run-ci-tests.sh
run-ci-tests.sh calls your test or runs a bash script (check cache-regression case for an example

Your bash script should:

do the download, and / or be responsible for the artifact location (looks like its tests/perf-regression/perf-regression.json).
run the specific tests.

This way we don't have an action responsible for "Dump or Check", we have two locations where we simply "call" your script to allow it to be dumped properly.

suggestions:

drop: this workflow. do not look for other JSON blobs that may or may not exist. only this copy of the repo exists as far as this script is concerned
drop: a workflow that runs the perf-regression dumping (we have it in npm already). You might add one that runs the dump without actually dumping anything, just to make sure the flow works, but not required.
procedure: if you want to update perf-regression, just run the script locally and check it into git
procedure: run-ci-tests should just run the perf regression test, without having to introduce another git action step

question: how large are the perf regression JSONs? we might be able to upload them to GCP if they're quite large?

Shigoto-dev19 · 2025-10-15T15:25:05Z

question: how large are the perf regression JSONs? we might be able to upload them to GCP if they're quite large?

The final size is 1826 bytes.

bleepbloopsify · 2025-10-15T15:27:44Z

question: how large are the perf regression JSONs? we might be able to upload them to GCP if they're quite large?

The final size is 1826 bytes.

well, never mind then, that is quite small

Trivo25 · 2025-10-16T10:35:51Z

This actions workflow suffers from something I call "doing too much"

Actions philosophy usually goes something like this: If you want to change how the action behaves, just use git to change what the underlying repository looks like. The action should follow "open-closed" principle, so most of the logic should live in your perf-regression.ts, rather than out here.

Adding a case to Build-and-Test-Server is a bit redundant if you introduce a separate workflow, but makes sense if you want to keep them under the same umbrella.

The pattern I expect to see here usually goes something like:
1. Add test case to `Build-and-Test-Server` and `run-ci-tests.sh`

2. `run-ci-tests.sh` calls your test or runs a bash script (check `cache-regression` case for an example
Your bash script should:
1. do the download, and / or be responsible for the artifact location (looks like its `tests/perf-regression/perf-regression.json`).

2. run the specific tests.
This way we don't have an action responsible for "Dump or Check", we have two locations where we simply "call" your script to allow it to be dumped properly.

suggestions:
1. drop: this workflow. do not look for other JSON blobs that may or may not exist. only this copy of the repo exists as far as this script is concerned

2. drop: a workflow that runs the perf-regression dumping (we have it in npm already). You might add one that runs the `dump` without actually dumping anything, just to make sure the flow works, but not required.

3. procedure: if you want to update `perf-regression`, just run the script locally and check it into git

4. procedure: run-ci-tests should just run the perf regression test, without having to introduce another git action step
question: how large are the perf regression JSONs? we might be able to upload them to GCP if they're quite large?

agree with the exception of

procedure: if you want to update perf-regression, just run the script locally and check it into git

because I think we will have to dump the data on the runners where we will also check the tests otherwise we might get too big of a variance

Shigoto-dev19 · 2025-10-16T15:25:57Z

agree with the exception of

procedure: if you want to update perf-regression, just run the script locally and check it into git

because I think we will have to dump the data on the runners where we will also check the tests otherwise we might get too big of a variance

I also agree with Florian, that's why I took the approach to do both dump and check in CI to eliminate notable variance from having different performance results run from different machines.

bleepbloopsify · 2025-10-16T15:51:46Z

because I think we will have to dump the data on the runners where we will also check the tests otherwise we might get too big of a variance

ah of course, this makes sense

…rformance regression baseline

bleepbloopsify

I like this much better! "It is CURRENT_REF" type 💩

I dislike gh as a dependency, but since it's already here I guess we can let it hang out a bit longer.

Small nit about a script that only calls one command, at that point do we really need a script?

otherwise, 🚢 it

bleepbloopsify · 2025-10-20T21:16:30Z

tests/perf-regression/dump-perf-ci.sh

This file is interesting, but is there a reason we need it?

My understanding of GH actions is like this:

We trigger the action -> it calls the bash script

not:
we run the bash code -> it calls the gh workflow

CI is meant to make sure that the scripts we can run locally stays running, hence "Continuous" of "Continuous Integration"

request: delete this file, since all it does it call gh workflow run (with preset flags, but man gh should take care of that if necessary).

if it's meant to be manual, just direct people to the web interface and have them call the action manually (since workflow_dispatch is available as a trigger)

an anecdotal aside just to make sure I'm thinking about it the right way:

given the question: would you write a script to encode how we call git on our repository so everyone can do it the same way?

my answer would be: no, unless you need to chain multiple brittle git invocations in a row.

so in this case, I would probably only write a script for it if you need to run multiple git commands in a row.

I think this PR is fine without this script at all 📦

Shigoto-dev19 added dump-performance no changelog labels Oct 14, 2025

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from 206f511 to c0aeb4a Compare October 14, 2025 09:05

Shigoto-dev19 removed the dump-performance label Oct 14, 2025

Add performance regression tests to CI

b0568fc

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from c0aeb4a to b0568fc Compare October 14, 2025 09:55

Shigoto-dev19 added the dump-performance label Oct 14, 2025

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from 955e6f5 to ee33eaa Compare October 14, 2025 16:50

Shigoto-dev19 removed the dump-performance label Oct 15, 2025

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch 3 times, most recently from 51b3fdb to f5f1909 Compare October 15, 2025 07:52

Shigoto-dev19 added the dump-performance label Oct 15, 2025

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch 2 times, most recently from 17774b6 to c83ae06 Compare October 15, 2025 08:15

Fix performance regression check in CI by downloading the baseline

e512d76

Shigoto-dev19 removed the dump-performance label Oct 15, 2025

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from c83ae06 to e512d76 Compare October 15, 2025 08:57

Shigoto-dev19 added 2 commits October 15, 2025 13:23

Prettify logs for zkprograms performance regression check

55e2072

Prettify logs and adjust tolerances for zkApp and CS performance regr…

08b2fb1

…ession checks

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from b2ff575 to 437f588 Compare October 15, 2025 11:20

Refactor performance regression CI tests into unified composite action

4702ace

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from 437f588 to 4702ace Compare October 15, 2025 11:38

Shigoto-dev19 added the dump-performance label Oct 15, 2025

Shigoto-dev19 added 2 commits October 15, 2025 16:36

Refine naming in performance regression CI action

8ba94f2

Make performance regression CI checks use main as baseline source

d15de7f

Shigoto-dev19 marked this pull request as ready for review October 15, 2025 14:41

Shigoto-dev19 requested review from a team as code owners October 15, 2025 14:41

Shigoto-dev19 requested review from bleepbloopsify and ymekuria October 15, 2025 14:41

bleepbloopsify reviewed Oct 15, 2025

View reviewed changes

Shigoto-dev19 added 4 commits October 20, 2025 13:23

Replace perf-regression CI action with a separate workflow to dump pe…

ae06a44

…rformance regression baseline

Update checks.yml

6130696

Add a new script to trigger CI to dump performance regression baseline

de1d6ce

Merge branch 'main' into shigoto/performance-regression-ci-tests

ec2b69d

Shigoto-dev19 mentioned this pull request Oct 20, 2025

Add the very first performance regression baseline #2586

Merged

Shigoto-dev19 marked this pull request as draft October 20, 2025 10:39

Shigoto-dev19 removed the dump-performance label Oct 20, 2025

Shigoto-dev19 added 2 commits October 20, 2025 15:55

Merge branch 'main' into shigoto/performance-regression-ci-tests

13c4720

Amend tolerances for ZkProgram compile time regression testing

ba545cb

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from 8b45c6d to 666c0c8 Compare October 20, 2025 19:05

Target current branch instead of main in dump-perf-baseline CI workflow

cad4af9

Shigoto-dev19 force-pushed the shigoto/performance-regression-ci-tests branch from 666c0c8 to cad4af9 Compare October 20, 2025 19:50

Shigoto-dev19 marked this pull request as ready for review October 20, 2025 20:41

Shigoto-dev19 requested review from Trivo25 and bleepbloopsify October 20, 2025 20:41

bleepbloopsify approved these changes Oct 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add performance regression tests to CI #2548

Add performance regression tests to CI #2548

Shigoto-dev19 commented Oct 14, 2025 •

edited

Loading

Uh oh!

bleepbloopsify left a comment

Uh oh!

Shigoto-dev19 commented Oct 15, 2025

Uh oh!

bleepbloopsify commented Oct 15, 2025

Uh oh!

Trivo25 commented Oct 16, 2025

Uh oh!

Shigoto-dev19 commented Oct 16, 2025

Uh oh!

bleepbloopsify commented Oct 16, 2025

Uh oh!

bleepbloopsify left a comment

Uh oh!

bleepbloopsify Oct 20, 2025

Uh oh!

bleepbloopsify Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add performance regression tests to CI #2548

Are you sure you want to change the base?

Add performance regression tests to CI #2548

Conversation

Shigoto-dev19 commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How it works

How to use

Notes

Uh oh!

bleepbloopsify left a comment

Choose a reason for hiding this comment

Uh oh!

Shigoto-dev19 commented Oct 15, 2025

Uh oh!

bleepbloopsify commented Oct 15, 2025

Uh oh!

Trivo25 commented Oct 16, 2025

Uh oh!

Shigoto-dev19 commented Oct 16, 2025

Uh oh!

bleepbloopsify commented Oct 16, 2025

Uh oh!

bleepbloopsify left a comment

Choose a reason for hiding this comment

Uh oh!

bleepbloopsify Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

bleepbloopsify Oct 20, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Shigoto-dev19 commented Oct 14, 2025 •

edited

Loading