[TEST] wait for all active shards when indexing data #121442

andreidan · 2025-01-31T16:46:26Z

This attempts to fix a flay test where the term_freq returned by the multiple terms vectors API was null.
I was not able to reproduce this test but this proposes a fix based on the following running theory:

an Elasticsearch cluster comprised of at least 2 nodes
we create a couple of indices with 1 primary and 1 replica
we index a document that was acknowledged only by the primary (because wait_for_active_shards defaults to 1)
the test executes the multiple terms vectors API and it hits the node hosting the replica shard, which hasn't yet received the document we ingested in the primary shard.

This race condition between the document replication and the test running the terms vectors API on the replica shard could yield a null value for the the term's term_freq (as the replica shard contains 0 documents).

This PR proposes we change the wait_for_active_shards value to all so each write is acknowledged by all replicas before the client receives the response.

Fixes #113325

This attempts to fix a flay test where the term_freq returned by the multiple terms vectors API was `null`. I was not able to reproduce this test but this proposes a fix based on the following running theory: - an Elasticsearch cluster comprised of at least 2 nodes - we create a couple of indices with 1 primary and 1 replica - we index a document that was acknowledged only by the primary (because `wait_for_active_shards` defaults to `1`) - the test executes the multiple terms vectors API and it hits the node hosting the replica shard, which hasn't yet received the document we ingested in the primary shard. This race condition between the document replication and the test running the terms vectors API on the replica shard could yield a `null` value for the the term's `term_freq` (as the replica shard contains 0 documents). This PR proposes we change the `wait_for_active_shards` value to `all` so each write is acknowledged by all replicas before the client receives the response.

elasticsearchmachine · 2025-01-31T16:47:03Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

piergm

Theory make sense to me! More so if it's not always failing but a flaky test 😄 Nice one Andrei! Let's hope the theory is correct 🤞

andreidan · 2025-02-03T08:59:16Z

@elasticmachine update branch

andreidan · 2025-02-03T14:12:24Z

@elasticmachine update branch

andreidan added >test-failure Triaged test failures from CI :Search Foundations/Search Catch all for Search Foundations v9.0.0 v8.19.0 v9.1.0 labels Jan 31, 2025

elasticsearchmachine added needs:risk Requires assignment of a risk label (low, medium, blocker) Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch labels Jan 31, 2025

piergm approved these changes Jan 31, 2025

View reviewed changes

Merge branch 'main' into fix-test-mtermvectors

2343745

andreidan added auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Feb 3, 2025

elasticsearchmachine added the needs:risk Requires assignment of a risk label (low, medium, blocker) label Feb 3, 2025

andreidan added low-risk An open issue or test failure that is a low risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Feb 3, 2025

Merge branch 'main' into fix-test-mtermvectors

47a31ce

elasticsearchmachine merged commit 44e5104 into elastic:main Feb 3, 2025
17 checks passed

andreidan deleted the fix-test-mtermvectors branch February 3, 2025 18:57

andreidan added the auto-backport Automatically create backport pull requests when merged label Feb 3, 2025

This was referenced Feb 10, 2025

[9.0][TEST] wait for all active shards when indexing data #122163

Merged

[8.x][TEST] wait for all active shards when indexing data #122164

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TEST] wait for all active shards when indexing data #121442

[TEST] wait for all active shards when indexing data #121442

Uh oh!

andreidan commented Jan 31, 2025

Uh oh!

elasticsearchmachine commented Jan 31, 2025

Uh oh!

piergm left a comment

Uh oh!

andreidan commented Feb 3, 2025

Uh oh!

andreidan commented Feb 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[TEST] wait for all active shards when indexing data #121442

[TEST] wait for all active shards when indexing data #121442

Uh oh!

Conversation

andreidan commented Jan 31, 2025

Uh oh!

elasticsearchmachine commented Jan 31, 2025

Uh oh!

piergm left a comment

Choose a reason for hiding this comment

Uh oh!

andreidan commented Feb 3, 2025

Uh oh!

andreidan commented Feb 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants