[EZ] Check for stable outputs #69

PaliC · 2025-08-12T18:19:41Z

Fixes: #71 and #70

For checking for ones and zeros and noting that in the dataset, I think there should be a followup PR to #57

msaroufim

stable/unstable to me means numerically stable or unstable. In this case I believe better terminology would be constant output functions such as full, zeros, ones

And the other problem is degenerate benchmarks where for instance torch.matmul(A,B) has A and B mostly 0 so the output is always 0

Note that it's fine if a single example in the benchmark is degenerate, the problem is if all of them are

PaliC · 2025-08-13T22:32:56Z

@msaroufim are you saying its fine if a single test in the op is degenerate, however, if all of the tests are degenerate then we should axe it. I think filtering at a test by test level would end up achieving the same thing, and remove the degenerate tests as well.

msaroufim · 2025-08-13T23:04:29Z

Yup maybe warning for degenerate input when benchmarking for now is sufficient. Input includes lots of 0, NaN, constant value for everything

PaliC added 2 commits August 12, 2025 11:18

[ez] Check for stable outputs

0609b8e

add test

2e7ad3b

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 12, 2025

PaliC requested review from msaroufim, bertmaher and jiannanWang August 12, 2025 18:20

fix test

fb4fba4

msaroufim reviewed Aug 12, 2025

View reviewed changes

add inputs

deeb867

PaliC requested a review from msaroufim August 14, 2025 18:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[EZ] Check for stable outputs #69

[EZ] Check for stable outputs #69

Uh oh!

PaliC commented Aug 12, 2025 •

edited

Loading

Uh oh!

msaroufim left a comment

Uh oh!

PaliC commented Aug 13, 2025

Uh oh!

msaroufim commented Aug 13, 2025

Uh oh!

Uh oh!

[EZ] Check for stable outputs #69

Are you sure you want to change the base?

[EZ] Check for stable outputs #69

Uh oh!

Conversation

PaliC commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

msaroufim left a comment

Choose a reason for hiding this comment

Uh oh!

PaliC commented Aug 13, 2025

Uh oh!

msaroufim commented Aug 13, 2025

Uh oh!

Uh oh!

PaliC commented Aug 12, 2025 •

edited

Loading