Use a `float` for the tolerance in the timer tests #347

llucax · 2024-11-29T12:11:32Z

Hopefully this finally fixes the flaky hypothesis tests.

Remove deprecated uses of pytest-asyncio
Use a float for the tolerance in the timer tests
Revert "Use less extreme values for min and max timedelta in tests"

The `event_loop` fixture is deprecated and `event_loop_policy` should be used instead. The option `asyncio_default_fixture_loop_scope = "function"` is also added to `pyproject.toml`, as it is also deprecated to rely on a default. Signed-off-by: Leandro Lucarella <[email protected]>

llucax · 2024-11-29T12:15:17Z

Tests failed again when queuing a PR, so here is another attempt.

daniel-zullo-frequenz · 2024-11-29T12:27:12Z

Tests failed again when queuing a PR, so here is another attempt.

Hypothesis usually tells you how to reproduce the error by temporarily adding @reproduce_failure({PARAMETERS, FOR, THE, TEST, HERE}) as a decorator on the test case. Have you tried that to be sure the patch solves the issue?

llucax · 2024-11-29T12:29:09Z

Good tip. I actually validated it manually, but for a previous attempt and forgot to do it with the new approach. I will check with the decorator 👍 💯

llucax · 2024-11-29T12:29:57Z

FYI, this was the failure:

______________________ test_policy_skip_missed_and_drift _______________________

    @hypothesis.given(
>       tolerance=st.integers(min_value=0, max_value=_max_timedelta_microseconds),
        **_calculate_next_tick_time_args,
    )

tests/test_timer.py:148: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

tolerance = 171726190479152817, now = 171726190479152817
scheduled_tick_time = -1, interval = 1

    @hypothesis.given(
        tolerance=st.integers(min_value=0, max_value=_max_timedelta_microseconds),
        **_calculate_next_tick_time_args,
    )
    def test_policy_skip_missed_and_drift(
        tolerance: int, now: int, scheduled_tick_time: int, interval: int
    ) -> None:
        """Test the SkipMissedAndDrift policy."""
        hypothesis.assume(now >= scheduled_tick_time)
    
        next_tick_time = SkipMissedAndDrift(
            delay_tolerance=timedelta(microseconds=tolerance)
        ).calculate_next_tick_time(
            now=now, interval=interval, scheduled_tick_time=scheduled_tick_time
        )
        if tolerance < interval:
            assert next_tick_time > now
        drift = now - scheduled_tick_time
        if drift > tolerance:
>           assert next_tick_time == now + interval
E           assert 0 == (171726190479152817 + 1)
E           Falsifying example: test_policy_skip_missed_and_drift(
E               tolerance=171_726_190_479_152_817,
E               now=171_726_190_479_152_817,
E               scheduled_tick_time=-1,
E               interval=1,  # or any other generated value
E           )

tests/test_timer.py:166: AssertionError

daniel-zullo-frequenz · 2024-11-29T12:52:21Z

FYI, this was the failure

I see in this case hypothesis only mentioned how to reproduce the error/falsify the example.

When using an `int`, we need to do a double conversion, first to `float` and then back to `int`, and due to rounding errors, this means there are inconsistencies between the expected and actual values. This is an example failure: ``` ______________________ test_policy_skip_missed_and_drift _______________________ @hypothesis.given( > tolerance=st.integers(min_value=0, max_value=_max_timedelta_microseconds), **_calculate_next_tick_time_args, ) tests/test_timer.py:148: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ tolerance = 171726190479152817, now = 171726190479152817 scheduled_tick_time = -1, interval = 1 @hypothesis.given( tolerance=st.integers(min_value=0, max_value=_max_timedelta_microseconds), **_calculate_next_tick_time_args, ) def test_policy_skip_missed_and_drift( tolerance: int, now: int, scheduled_tick_time: int, interval: int ) -> None: """Test the SkipMissedAndDrift policy.""" hypothesis.assume(now >= scheduled_tick_time) next_tick_time = SkipMissedAndDrift( delay_tolerance=timedelta(microseconds=tolerance) ).calculate_next_tick_time( now=now, interval=interval, scheduled_tick_time=scheduled_tick_time ) if tolerance < interval: assert next_tick_time > now drift = now - scheduled_tick_time if drift > tolerance: > assert next_tick_time == now + interval E assert 0 == (171726190479152817 + 1) E Falsifying example: test_policy_skip_missed_and_drift( E tolerance=171_726_190_479_152_817, E now=171_726_190_479_152_817, E scheduled_tick_time=-1, E interval=1, # or any other generated value E ) tests/test_timer.py:166: AssertionError ``` Using `float` directly ensures we are comparing the same values in the tests and in the code. Some explicit examples are now included in the hypothesis tests to ensure this issue is not reintroduced. Signed-off-by: Leandro Lucarella <[email protected]>

Tests failed because of the double conversion fixes in the previous commit, so we can remove this hack now. This reverts commit 1084381. Signed-off-by: Leandro Lucarella <[email protected]>

llucax · 2024-11-29T13:02:51Z

Yeah, I don't know how @reproduce_failure is used exactly, but example() works pretty well. Adding some examples, although I can't test the falsifying example precisely because we switched from int to float for the problematic value.

With this in mind, it seems like it fixes the issue. Pushed some updates, adding the examples so they are always tested just in case.

llucax requested a review from a team as a code owner November 29, 2024 12:11

llucax requested a review from shsms November 29, 2024 12:11

github-actions bot added part:tests Affects the unit, integration and performance (benchmarks) tests part:tooling Affects the development tooling (CI, deployment, dependency management, etc.) labels Nov 29, 2024

llucax force-pushed the fix-tests branch from 8736e2c to be9c2df Compare November 29, 2024 12:12

llucax enabled auto-merge November 29, 2024 12:12

llucax added this to the v1.4.0 milestone Nov 29, 2024

llucax self-assigned this Nov 29, 2024

llucax added the type:bug Something isn't working label Nov 29, 2024

llucax disabled auto-merge November 29, 2024 12:29

daniel-zullo-frequenz previously approved these changes Nov 29, 2024

View reviewed changes

llucax added 2 commits November 29, 2024 13:59

Revert "Use less extreme values for min and max timedelta in tests"

e03c41a

Tests failed because of the double conversion fixes in the previous commit, so we can remove this hack now. This reverts commit 1084381. Signed-off-by: Leandro Lucarella <[email protected]>

llucax dismissed daniel-zullo-frequenz’s stale review via e03c41a November 29, 2024 13:02

llucax force-pushed the fix-tests branch from be9c2df to e03c41a Compare November 29, 2024 13:02

llucax enabled auto-merge November 29, 2024 13:02

llucax requested a review from daniel-zullo-frequenz November 29, 2024 13:02

daniel-zullo-frequenz approved these changes Nov 29, 2024

View reviewed changes

llucax added this pull request to the merge queue Nov 29, 2024

Merged via the queue into frequenz-floss:v1.x.x with commit 10a29d7 Nov 29, 2024
14 checks passed

llucax deleted the fix-tests branch November 29, 2024 13:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use a `float` for the tolerance in the timer tests #347

Use a `float` for the tolerance in the timer tests #347

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

daniel-zullo-frequenz commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

daniel-zullo-frequenz commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use a float for the tolerance in the timer tests #347

Use a float for the tolerance in the timer tests #347

Uh oh!

Conversation

llucax commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

daniel-zullo-frequenz commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

daniel-zullo-frequenz commented Nov 29, 2024

Uh oh!

llucax commented Nov 29, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use a `float` for the tolerance in the timer tests #347

Use a `float` for the tolerance in the timer tests #347