Change query fuzz test for bottomk, topk #6350

SungJin1212 · 2024-11-20T04:36:03Z

This PR improves the query fuzz test to skip non-deterministic queries.
It can prevent test fail due to the non-deterministic queries like bottomk by (job) (pi(), days_in_month(-{__name__="test_series_a"} offset 1m17s)).

Change log

We decide to change comparer for bottomk and topk.

Which issue(s) this PR fixes:
Fixes #6323

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

yeya24 · 2024-11-20T06:22:03Z

integration/query_fuzz_test.go

+		nonDeterministicQuery := false
+		for i := 0; i < runForNonDeterCheck; i++ {
+			r1, err1 := c2.Query(query, queryTime)
+			r2, err2 := c2.Query(query, queryTime)


If you run the same bottomk or topk query against Prometheus or Cortex within a short period time it is highly likely to have the same result I think. I have tested this against Prometheus.

What we can do instead is to use a different result comparer to only check result series length rather than content if we see bottomk or topk in the query. Or we have dedicated test cases to test topk, bottomk and ignore them in regular test cases

Is it sufficient to compare the length of the series?

Why do bottom or topk query results differ? I was thinking the reason is non-determinism. But, I feel like my thinking might be wrong.

As you said, it is non deterministic but the non deterministic is not that high I think. If you run 100 queries probably the issue occurs once or twice. I think running the same query with c2 is also highly likely to produce the same result so this can be still flaky.

Is it sufficient to compare the length of the series?

We can add other verification if find doable but also reduce flakiness. At least comparing series length is better than not testing those functions at all.

Ok, yes.. the same query with c2 is also flaky.
What about comparing # of samples? The series length is also flaky.

I don't think number of samples is guaranteed the same. We randomized the timestamp of the first sample from each series so depending on your query you might be able to query some series but miss other series. When the result is non-deterministic depending on the returned series you will have different number of samples

Can we change promsmith to not use bottomk/topk with some "time based" functions (day_of_week, time_of_day, etc) ? i think this is the bulk of the errors?

Maybe tbh we can even remove those functions from the fuzzy test, and and create a new fuzzy test just to test those functions?

@alanprot
I haven't found the root cause yet.
I'm suspicious of all the same values. I tested the query bottomk by (job) (pi(), day_of_year(-{__name__="test_series_a"} offset 1m17s)) where I changed the values to return a random value (1~366) of day_of_year.
Then, there is a high chance of success.

Btw, it sounds good to create the fuzz test testing those functions.

FYI. I changed https://github.com/cortexproject/cortex/blob/master/integration/e2e/util.go#L290 to ref, err = app.Append(ref, series[i], start, float64(time.Hour*4114*time.Duration(i+j+1)))

integration/query_fuzz_test.go

alanprot

Maybe we can change this a bit to basically still compare all the results and just ignore the series - as we know if the results are the same, the series returned can be random?

SungJin1212 · 2024-12-10T01:33:49Z

@alanprot
I run it, but the value could be different if we ignore the series.

instant query: topk without (__name__) (pi(), -{__name__="test_series_b",series="2"} offset -1m20s)
        res1: {} => -28 @[1733790497.075]
        {job="test", series="2", status_code="200"} => -50 @[1733790497.075]
        {job="test", series="2", status_code="404"} => -9 @[1733790497.075]
        res2: {} => -50 @[1733790497.075]
        {job="test", series="2", status_code="404"} => -9 @[1733790497.075]
        {job="test", series="2", status_code="500"} => -28 @[1733790497.075]

instant query: topk by (status_code, series, __name__) (
          time(),
          --{__name__="test_series_a",series="0"} offset -53s
        )
        res1: {} => 43 @[1733790497.075]
        {job="test", series="0", status_code="200"} => 24 @[1733790497.075]
        {job="test", series="0", status_code="400"} => 66 @[1733790497.075]
        {job="test", series="0", status_code="502"} => 53 @[1733790497.075]
        res2: {} => 24 @[1733790497.075]
        {job="test", series="0", status_code="400"} => 66 @[1733790497.075]
        {job="test", series="0", status_code="500"} => 43 @[1733790497.075]
        {job="test", series="0", status_code="502"} => 53 @[1733790497.075]

alanprot · 2024-12-10T01:39:33Z

Hum.. Interesting... Im trying to make sense of:

instant query: topk by (status_code, series, __name__) (
          time(),
          --{__name__="test_series_a",series="0"} offset -53s
        )
        res1: {} => 43 @[1733790497.075]
        {job="test", series="0", status_code="200"} => 24 @[1733790497.075]
        {job="test", series="0", status_code="400"} => 66 @[1733790497.075]
        {job="test", series="0", status_code="502"} => 53 @[1733790497.075]
        res2: {} => 24 @[1733790497.075]
        {job="test", series="0", status_code="400"} => 66 @[1733790497.075]
        {job="test", series="0", status_code="500"} => 43 @[1733790497.075]
        {job="test", series="0", status_code="502"} => 53 @[1733790497.075]

Do you know why one of the results does not have labels?

But seems that the problem is that the order of the results are not guaranteed? (as probably the result is ordered by series labels??)

SungJin1212 · 2024-12-10T01:55:51Z

Yeah, the sample is the same if two series are the same.
But, the series set of the query results could be different.
The empty series seems to be weird..

But seems that the problem is that the order of the results are not guaranteed? (as probably the result is ordered by series labels??)

Yes, there seems to be non-determinism in query results.

alanprot · 2024-12-11T20:23:22Z

I think im ok or merging as is, and we can maybe improve after... let me try to rebase this.

Signed-off-by: SungJin1212 <[email protected]>

pull-request-size bot added the size/M label Nov 20, 2024

dosubot bot added the type/tests label Nov 20, 2024

SungJin1212 force-pushed the Improve-fuzz-test-to-skip-nondeterministic-query branch from 27fd5ea to 84c1ffe Compare November 20, 2024 04:40

SungJin1212 changed the title ~~Improve fuzz test to skip non-deterministic query~~ Improve query fuzz test to skip non-deterministic query Nov 20, 2024

SungJin1212 changed the title ~~Improve query fuzz test to skip non-deterministic query~~ Improve query fuzz test to skip non-deterministic query results Nov 20, 2024

yeya24 reviewed Nov 20, 2024

View reviewed changes

SungJin1212 force-pushed the Improve-fuzz-test-to-skip-nondeterministic-query branch from 84c1ffe to 246d811 Compare November 20, 2024 09:08

SungJin1212 changed the title ~~Improve query fuzz test to skip non-deterministic query results~~ Change query fuzz test for bottomk, topk Nov 20, 2024

SungJin1212 force-pushed the Improve-fuzz-test-to-skip-nondeterministic-query branch 2 times, most recently from 87ab76d to 9b9e267 Compare November 20, 2024 09:16

alanprot reviewed Dec 10, 2024

View reviewed changes

integration/query_fuzz_test.go Show resolved Hide resolved

alanprot requested changes Dec 10, 2024

View reviewed changes

Add sampleNumComparer for bottomk, topk query fuzz test

9cee5a8

Signed-off-by: SungJin1212 <[email protected]>

alanprot force-pushed the Improve-fuzz-test-to-skip-nondeterministic-query branch from 9b9e267 to 9cee5a8 Compare December 11, 2024 20:23

alanprot approved these changes Dec 11, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Dec 11, 2024

yeya24 approved these changes Dec 11, 2024

View reviewed changes

yeya24 merged commit 64f25d3 into cortexproject:master Dec 11, 2024
16 checks passed

alanprot mentioned this pull request Dec 12, 2024

Change query fuzz test for bottomk, topk: take 2 #6420

Merged

3 tasks

dsabsay mentioned this pull request Jun 3, 2025

Flaky test: TestPrometheusCompatibilityQueryFuzz #6653

Open

Change query fuzz test for bottomk, topk #6350

Change query fuzz test for bottomk, topk #6350

Uh oh!

Conversation

SungJin1212 commented Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yeya24 Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

SungJin1212 Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

yeya24 Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SungJin1212 Nov 20, 2024

Choose a reason for hiding this comment

Uh oh!

yeya24 Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alanprot Nov 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SungJin1212 Nov 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alanprot left a comment

Choose a reason for hiding this comment

Uh oh!

SungJin1212 commented Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alanprot commented Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SungJin1212 commented Dec 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alanprot commented Dec 11, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SungJin1212 commented Nov 20, 2024 •

edited

Loading

yeya24 Nov 20, 2024 •

edited

Loading

yeya24 Nov 20, 2024 •

edited

Loading

alanprot Nov 22, 2024 •

edited

Loading

SungJin1212 Nov 22, 2024 •

edited

Loading

SungJin1212 commented Dec 10, 2024 •

edited

Loading

alanprot commented Dec 10, 2024 •

edited

Loading

SungJin1212 commented Dec 10, 2024 •

edited

Loading