Add benchmark `baseline` option by ntninja · Pull Request #165 · ionelmc/pytest-benchmark

ntninja · 2020-04-09T19:57:31Z

Add baseline option that determines (according to the included docs):

Whether this benchmark's results should be considered as possible
base line values when comparing them to other results of the same
group? Use this if you want to include some results just for
comparison, without them affecting the relative scores displayed
for other results.

ntninja · 2020-04-10T01:03:02Z

Anyway, would be cool if somebody could tell me why the test fails, cause looking at the details gives me a 404.

ionelmc

Hey, sorry for the tardy review. I added some comments and I hope you haven't lost interest :)

ionelmc · 2020-05-10T14:10:32Z

src/pytest_benchmark/table.py

+                        baseline[prop] = max(bench[prop] for _, bench in progress_reporter(
+                            benchmarks, tr, "{line} ({pos}/{total})", line=line)
+                            if bench.get("baseline", True))
+                    except ValueError:


Can we avoid the try/except somehow? What actually can raise that error?

If there is no benchmark in the group marked as baseline this will end up calling max(()) which raises ValueError: max() argument is empty sequence.

Should I convert this to an if? It would require evaluating the array of values up-front, then check their len(…). Or do you prefer me adding a comment about this?

ionelmc · 2020-05-10T14:12:24Z

src/pytest_benchmark/plugin.py

            if name not in (
                    "max_time", "min_rounds", "min_time", "timer", "group", "disable_gc", "warmup",
-                    "warmup_iterations", "calibration_precision", "cprofile"):
+                    "warmup_iterations", "calibration_precision", "cprofile", "baseline"):


Now sure how but we should have some way to validate that there is only 1 baseline per group. It doesn't make sense to have 2 baselines right? Lets not let users wonder why stuff doesn't work as expected (the annoying silent failure).

The current implementation marks all results as possible baselines by default and only excludes the ones marked as baseline=False. If there is more than one baseline=True benchmark available it will choose the one with the lowest value/highest score. This perfectly integrates with the existing behaviour and means that I don't have to choose the baseline value selected for all times (as performance may differ between systems, etc). The included docs actually mention this. As you mentioned there cannot be baselines when the output is rendered, but there can be more than one potential baseline scores.

Unless you have strong feelings on this, I'd like to keep it this way for extra flexibility. The wording could be improved however: maybe something along the lines of potential_baseline, but shorter?

ntninja · 2020-05-10T20:23:07Z

Glad to hear back from you! Did you take a look at the CI yet?

ntninja · 2020-07-11T21:43:47Z

@ionelmc: Ping

tmct · 2024-06-05T10:45:16Z

This would be a nice feature

bear24rw · 2026-02-19T06:59:08Z

Just had a need for this exact usecase, would really like to be able to pin a specific benchmark as 1.0x and make everything else relative to that.

Add benchmark baseline option

3b4e44d

ionelmc requested changes May 10, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmark `baseline` option#165

Add benchmark `baseline` option#165
ntninja wants to merge 1 commit intoionelmc:masterfrom
ntninja:master

ntninja commented Apr 9, 2020

Uh oh!

ntninja commented Apr 10, 2020

Uh oh!

ionelmc left a comment •

edited

Loading

Uh oh!

ionelmc May 10, 2020

Uh oh!

ntninja May 10, 2020

Uh oh!

ionelmc May 10, 2020

Uh oh!

ntninja May 10, 2020

Uh oh!

ntninja commented May 10, 2020

Uh oh!

ntninja commented Jul 11, 2020

Uh oh!

tmct commented Jun 5, 2024

Uh oh!

bear24rw commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ntninja commented Apr 9, 2020

Uh oh!

ntninja commented Apr 10, 2020

Uh oh!

ionelmc left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ionelmc May 10, 2020

Choose a reason for hiding this comment

Uh oh!

ntninja May 10, 2020

Choose a reason for hiding this comment

Uh oh!

ionelmc May 10, 2020

Choose a reason for hiding this comment

Uh oh!

ntninja May 10, 2020

Choose a reason for hiding this comment

Uh oh!

ntninja commented May 10, 2020

Uh oh!

ntninja commented Jul 11, 2020

Uh oh!

tmct commented Jun 5, 2024

Uh oh!

bear24rw commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ionelmc left a comment •

edited

Loading