You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Empirical data has shown that the noise of our testing framework is
quite high. Criterions default threshold of 1% yields a lot of
false-positives (to the point where multiple tests fail spuriously even
if there is no code change). Therefore, we increase it to 10%. This is
still a significantly lower threshold than what the old baseline-based
approach uses, where the deltas ranges from 16% to 400% (yes,
four-hundred). Note that at 5%, significant rates of spurious failures
were still observed.
Signed-off-by: Patrick Roy <[email protected]>
0 commit comments