Leverage scorer supplier in QueryFeatureExtractor #125259

jimczi · 2025-03-19T20:37:34Z

Follow up of #125103 that leverages scorer supplier to create queries optimised to run on top docs only.

Follow up of elastic#125103 that leverages scorer supplier to create queries optimised to run on top docs only.

elasticsearchmachine · 2025-03-19T20:37:57Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

elasticsearchmachine · 2025-03-19T20:37:58Z

Hi @jimczi, I've created a changelog YAML for you.

benwtrent · 2025-03-19T20:54:31Z

.../plugin/ml/src/main/java/org/elasticsearch/xpack/ml/inference/ltr/QueryFeatureExtractor.java

-                subScorers.add(new FeatureDisiWrapper(scorer, featureNames.get(i)));
+            var scorerSupplier = weight.scorerSupplier(segmentContext);
+            if (scorerSupplier != null) {
+                var scorer = scorerSupplier.get(0L);


I don't understand this "leadCost" thing.

Reading the docs:

* @param leadCost Cost of the scorer that will be used in order to lead iteration. This can be * interpreted as an upper bound of the number of times that {@link DocIdSetIterator#nextDoc}, * {@link DocIdSetIterator#advance} and {@link TwoPhaseIterator#matches} will be called. Under * doubt, pass {@link Long#MAX_VALUE}, which will produce a {@link Scorer} that has good * iteration capabilities.

So, shouldn't this be "num of docs that we will actually score", e.g. the rank_window? But that number seems really small anyways. Maybe 0 is just fine here.

This helps optimize scorers based on the estimated number of matches, which is derived from the cost of the lead iterator in the query. For example, IndexOrDocValuesQuery uses this to decide between a points/term query and a doc values query. Setting the leading cost to 0 forces all queries to use a scorer optimized for a small set of selected documents (the top N). In the case of IndexOrDocValuesQuery, this means selecting the doc values query.

elasticsearchmachine · 2025-03-20T11:24:10Z

💚 Backport successful

Status	Branch	Result
✅	8.x

Follow up of #125103 that leverages scorer supplier to create queries optimised to run on top docs only.

Follow up of elastic#125103 that leverages scorer supplier to create queries optimised to run on top docs only.

Leverage scorer supplier in QueryFeatureExtractor

c3a8271

Follow up of elastic#125103 that leverages scorer supplier to create queries optimised to run on top docs only.

jimczi added >enhancement :Search Relevance/Ranking Scoring, rescoring, rank evaluation. v8.19.0 v9.1.0 labels Mar 19, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Mar 19, 2025

Update docs/changelog/125259.yaml

1cfd6a9

benwtrent reviewed Mar 19, 2025

View reviewed changes

benwtrent approved these changes Mar 20, 2025

View reviewed changes

jimczi added the auto-backport Automatically create backport pull requests when merged label Mar 20, 2025

jimczi merged commit 22be0d9 into elastic:main Mar 20, 2025
17 checks passed

jimczi deleted the query_feature_scorer_supplier branch March 20, 2025 11:22

jimczi mentioned this pull request Mar 20, 2025

[8.x] Leverage scorer supplier in QueryFeatureExtractor (#125259) #125295

Merged

elasticsearchmachine pushed a commit that referenced this pull request Mar 20, 2025

Leverage scorer supplier in QueryFeatureExtractor (#125259) (#125295)

e4c0f5d

Follow up of #125103 that leverages scorer supplier to create queries optimised to run on top docs only.

afoucret pushed a commit to afoucret/elasticsearch that referenced this pull request Mar 21, 2025

Leverage scorer supplier in QueryFeatureExtractor (elastic#125259)

5c7cc4b

Follow up of elastic#125103 that leverages scorer supplier to create queries optimised to run on top docs only.

smalyshev pushed a commit to smalyshev/elasticsearch that referenced this pull request Mar 21, 2025

Leverage scorer supplier in QueryFeatureExtractor (elastic#125259)

60a167a

Follow up of elastic#125103 that leverages scorer supplier to create queries optimised to run on top docs only.

omricohenn pushed a commit to omricohenn/elasticsearch that referenced this pull request Mar 28, 2025

Leverage scorer supplier in QueryFeatureExtractor (elastic#125259)

31aa673

Follow up of elastic#125103 that leverages scorer supplier to create queries optimised to run on top docs only.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Leverage scorer supplier in QueryFeatureExtractor #125259

Leverage scorer supplier in QueryFeatureExtractor #125259

Uh oh!

jimczi commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

benwtrent Mar 19, 2025

Uh oh!

jimczi Mar 19, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Leverage scorer supplier in QueryFeatureExtractor #125259

Leverage scorer supplier in QueryFeatureExtractor #125259

Uh oh!

Conversation

jimczi commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

elasticsearchmachine commented Mar 19, 2025

Uh oh!

benwtrent Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

jimczi Mar 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 20, 2025

💚 Backport successful

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants