Add UI for comparing backend results #2010

Kobzol · 2024-11-17T09:26:05Z

This PR adds a new comparison mode to the compile result compare page. This mode allows users to compare the results between the LLVM and Cranelift backends if you are comparing the same commit against each other.

This was IMO the easiest way how to introduce this comparison mode. It would require non-trivial changes to get this working on the backend, and while this is arguably a bit of a hack, I think that it should work fine for the use-case that we need it for. In the future, this could be easily generalized to multiple backends, but for now it was not worth the hassle.

How to test:

Get results

$ rustup component add rustc-codegen-cranelift-preview --toolchain nightly
$ cargo run --bin collector -- bench_local `rustup +nightly which rustc` --include serde --profiles Debug --backends Llvm,Cranelift

Start the website (if you only have the data from the above run in the local DB, it should open the single commit compared against itself)
Click on "Self-compare backend"

lqd

Exciting!

Here are a few comments/questions

site/frontend/src/pages/compare/compile/compile-page.vue

site/frontend/src/pages/compare/compile/common.ts

lqd · 2024-11-18T10:12:27Z

site/frontend/src/pages/compare/compile/common.ts

+      });
+    }
+    const record = benchmarkMap.get(key);
+    if (comparison.backend === "llvm") {


we normalize the case here somewhere I guess?

We don't, the name comes from here.

site/frontend/src/pages/compare/compile/common.ts

site/frontend/src/pages/compare/compile/compile-page.vue

Kobzol · 2024-11-18T10:45:51Z

Thanks for the comments, went through them and resolved them.

site/frontend/src/pages/compare/compile/compile-page.vue

lqd · 2024-11-18T13:31:20Z

There are a few cases where we have before/after assumptions on the compare page, and I don't know if we need to do anything here now:

the links to the detailed query results; the 3 or so links we have there are about commits (it could be interesting to have "before" link be for the llvm backend, and "after" for cranelift) but I don't remember if the page supports selecting the backend, whether we store and load measureme files per backend in the first place, and so on.
the frontend/backend/linker sections graph

These are all useful in understanding where the gains/losses are but it's likely a bit of work, and maybe it's fine not to do anything yet.

Probably not a big deal but when using this we'll also need to remember that we need to use the "after" commit in the compare UI when doing actual single-commit comparisons on rustc-perf. Otherwise there won't be results for both backends on the baseline commit, until we profile cranelift on each merge.

There's also an interesting case that may be more common for the foreseeable future: are broken benchmarks only shown when the 2 commits are different? I say "common" because cg_clif currently doesn't support LTO, so your recent try build shows broken benchmarks here but not on the single-commit comparison. (At some point some of the exa benchmarks were showing as -100% for me, but I can't reproduce this anymore: after some reloads, the failed benchmark rows seem to be gone, and it likely was some transient caching issue.)

Kobzol · 2024-11-18T14:20:12Z

are broken benchmarks only shown when the 2 commits are different?

Yes, it seems like so, because it only shows "newly broken" benchmarks (

rustc-perf/site/src/comparison.rs

Line 780 in 72658ac

newly_failed_benchmarks: errors_in_b.into_iter().collect(),

).

Tbh, I consider this functionality to be a hack useful only for comparing backends for a specific use-case of comparing LLVM and Cranelift, rather than making it work universally for all situations. So things like detailed results and charts are out of scope, IMO.

lqd · 2024-11-18T15:04:48Z

At some point some of the exa benchmarks were showing as -100%

These shouldn't have data since they're broken benchmarks in this run IIUC (and incidentally, toggling "Display raw data" filters these rows out), and I'm not sure exactly when it happens, but for historical purposes this is how it looks when it does happen:

lqd

🚀

Kobzol added 4 commits November 17, 2024 09:49

Add backend to test case key

e98ca17

Add comparison of LLVM against Cranelift

69a77e6

Do not show backend column when comparing backends

cd5b340

Store self compare backend checkbox in URL

9b2d3fb

Kobzol requested a review from lqd November 17, 2024 09:26

Kobzol added 2 commits November 17, 2024 10:28

Clarify comparison text

4e647f8

Fix TS compilation error

e20f90b

lqd reviewed Nov 18, 2024

View reviewed changes

Apply review comments

915e095

lqd reviewed Nov 18, 2024

View reviewed changes

site/frontend/src/pages/compare/compile/compile-page.vue Outdated Show resolved Hide resolved

Refactor logic for displaying self compare checkbox

acc2d3c

lqd approved these changes Nov 18, 2024

View reviewed changes

Kobzol merged commit 0695b84 into master Nov 18, 2024
11 checks passed

Kobzol deleted the compare-backends branch November 18, 2024 15:09

Kobzol mentioned this pull request Nov 19, 2024

Allow comparing codegen backends on the compare page #1993

Closed

Add UI for comparing backend results #2010

Add UI for comparing backend results #2010

Uh oh!

Conversation

Kobzol commented Nov 17, 2024

Uh oh!

lqd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lqd Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

Kobzol Nov 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Kobzol commented Nov 18, 2024

Uh oh!

Uh oh!

lqd commented Nov 18, 2024

Uh oh!

Kobzol commented Nov 18, 2024

Uh oh!

lqd commented Nov 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lqd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lqd commented Nov 18, 2024 •

edited

Loading