Make seedless test deterministic by oliviagyg · Pull Request #64 · getyourguide/basemath

oliviagyg · 2025-01-27T11:11:44Z

We hit a rare issue in the CI run here -- one test, test_seed_not_provided, failed. This didn't have anything to do with the commit, though -- instead, it's an issue with the test. We call evaluate_experiment with a small number of samples and -1 overall successes, and this leads to us doing the 'coin flip' to check whether this test experiment might have 'crossed the line'. In this case there's a small (~1%) chance for us to 'fail' the coin flip and get a different result than we normally expect in the test, and that's what happened here.

We avoid this in other tests by setting the seed, but this is the one test where we deliberately don't do that and test the behaviour. So, the fix is to make the test deterministic in other ways. This PR helps us bypass the coin flip by making sure the probability is zero in our first call (by logging no samples at all) and then all the required samples at once in the second call.

gygAlexWeiss

Took me a moment to understand what you're doing here. Understood. Looks good.

oliviagyg added 3 commits January 27, 2025 12:00

Update test without seed to avoid coin flip

49efe83

Fix comment

6ddbefa

Another comment tweak

8a09c96

oliviagyg requested review from a team and gygAlexWeiss as code owners January 27, 2025 11:11

gygAlexWeiss approved these changes Jan 27, 2025

View reviewed changes

gygAlexWeiss merged commit 88a4fe8 into main Jan 27, 2025
3 checks passed

gygAlexWeiss deleted the make-seedless-test-deterministic branch January 27, 2025 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make seedless test deterministic#64

Make seedless test deterministic#64
gygAlexWeiss merged 3 commits intomainfrom
make-seedless-test-deterministic

oliviagyg commented Jan 27, 2025 •

edited

Loading

Uh oh!

gygAlexWeiss left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

oliviagyg commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gygAlexWeiss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oliviagyg commented Jan 27, 2025 •

edited

Loading