[DiskBBQ] Replace n_probe, related to the number of centroids with visit_percentage, related to the number of documents #132722

iverase · 2025-08-12T12:38:20Z

This PR proposes changing the way we budget how many vectors we are going to score during search.

We are currently using the number of centroids to search which can be dined by an static value, aka n_probe or computed dynamically if the value is -1. In this PR we change it so we can define a percentage of vectors we want to visit, which can be defined static or computed dynamically if the value is 0.

The motivation to do this change is that centroids are not balanced so some centroids can contain much more vectors than others so the search budget is not equal between runs. Using a percentage of vector operations feels much easier to understand.

The only complexity here is that we might be visiting documents more than once due to spilled assignmwnts so we need to state clear that this si not a percentage of the documents but a percentage of the vector operations (2 * numVectors).

Note that the configuration file for our utility checkVec needs to change so the "n_probe" entry needs to be replaced with visit_percentage", like:

  "visit_percentage" : [0.25, 0.5, 0.75, 1.0, 1.5, 2.0, 2.5]

…sit_percentage, related to the number of documents

elasticsearchmachine · 2025-08-12T12:38:45Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

benwtrent · 2025-08-12T12:47:57Z

I will benchmark, but this seems like the right way to me.

iverase · 2025-08-12T13:01:28Z

It feels good to me too. It is not perfect but it feels it can work this way.

benwtrent · 2025-08-13T13:36:12Z

server/src/main/java/org/elasticsearch/search/vectors/IVFKnnFloatVectorQuery.java

-        if (Math.min(knnCollector.k(), floatVectorValues.size()) == 0) {
+        if (floatVectorValues.size() == 0) {
+            return NO_RESULTS;
+        }
+        KnnSearchStrategy strategy = searchStrategy;
+        if (searchStrategy.getVisitRatio() == 0.0f) {
+            // dynamically set the percentage
+            float expected = (float) Math.round(1.75f * Math.log10(numCands) * Math.log10(numCands) * (numCands));
+            float ratio = expected / floatVectorValues.size();
+            strategy = new IVFKnnSearchStrategy(ratio);
+        }
+        KnnCollector knnCollector = knnCollectorManager.newCollector(visitedLimit, strategy, context);
+        if (knnCollector == null) {


@iverase you rightly called out that maybe we shouldn't do this.

What we SHOULD do is calculate this percentage across the sum of all segments (potentially having to adjust the auto calculation) and then push that down to each leaf.

Done it in ab36139. We might wait for the work on segment affinity for adjusting the ratio per leaf.

I have that done in #132396 already and had a similar approach to you here I think it looks good. I'll merge over your changes from today shortly.

@john-wagster I looked into what you did and copied over here 😉

…sitRatio

john-wagster

lgtm

server/src/main/java/org/elasticsearch/search/vectors/AbstractIVFKnnVectorQuery.java

benwtrent · 2025-08-13T16:28:45Z

server/src/main/java/org/elasticsearch/search/vectors/AbstractIVFKnnVectorQuery.java

+            // dynamically set the percentage
+            float expected = (float) Math.round(1.75f * Math.log10(numCands) * Math.log10(numCands) * (numCands));
+            visitRatio = expected / totalVectors;


need to benchmark this to make sure its still sane.

My feel is that expected should depnd on the number of documents, still only depends on the numCands.

@iverase I think so to. Otherwise, we end up with a static scale in the numerator, no matter the number of vectors.

I move expected to depend on the number of vectors again:

float expected = (float) Math.round(Math.log10(totalVectors) * Math.log10(totalVectors) * (numCands));

…sitRatio

benwtrent

I tested over multiple data sets (single segment and multi). We end up with pretty good recall curves for different num_candidates and k.

Its obviously not the same as hnsw (just completely different scaling laws), but its pretty close. I think this is good.

…sit_percentage, related to the number of documents (elastic#132722) This commit changes the way we budget how many vectors we are going to score during search.

iverase added 2 commits August 12, 2025 13:15

[DiskBBQ] Replace n_probe, related to the number of centroids with vi…

9337703

…sit_percentage, related to the number of documents

iter

71021f0

iverase requested review from benwtrent and john-wagster August 12, 2025 12:38

iverase added >non-issue :Search Relevance/Vectors Vector search v9.2.0 labels Aug 12, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Aug 12, 2025

iverase added 4 commits August 12, 2025 14:14

doh

651e359

Merge branch 'main' into visitRatio

ffc9a64

iter

d0019c2

Merge branch 'main' into visitRatio

528367a

benwtrent reviewed Aug 13, 2025

View reviewed changes

iverase added 2 commits August 13, 2025 14:53

Compute visitRatio globally when doing it dynamically

ab36139

Merge branch 'visitRatio' of github.com:iverase/elasticsearch into vi…

b8b8091

…sitRatio

john-wagster approved these changes Aug 13, 2025

View reviewed changes

Merge branch 'main' into visitRatio

5b8443c

benwtrent reviewed Aug 13, 2025

View reviewed changes

server/src/main/java/org/elasticsearch/search/vectors/AbstractIVFKnnVectorQuery.java Show resolved Hide resolved

benwtrent reviewed Aug 13, 2025

View reviewed changes

iverase and others added 4 commits August 14, 2025 13:45

make expected depend on numVectors

da21795

assert we only support Float queries

c80dcfc

two fixes

1583fa4

Merge branch 'visitRatio' of github.com:iverase/elasticsearch into vi…

db1aa73

…sitRatio

benwtrent approved these changes Aug 14, 2025

View reviewed changes

Merge branch 'main' into visitRatio

fd976c0

iverase merged commit 15ae296 into elastic:main Aug 14, 2025
32 of 33 checks passed

iverase deleted the visitRatio branch August 14, 2025 16:30

iverase mentioned this pull request Aug 19, 2025

Use expectedDocs instead of actualDocs to stop posting lists iteration #133116

Merged

[DiskBBQ] Replace n_probe, related to the number of centroids with visit_percentage, related to the number of documents #132722

[DiskBBQ] Replace n_probe, related to the number of centroids with visit_percentage, related to the number of documents #132722

Uh oh!

Conversation

iverase commented Aug 12, 2025

Uh oh!

elasticsearchmachine commented Aug 12, 2025

Uh oh!

benwtrent commented Aug 12, 2025

Uh oh!

iverase commented Aug 12, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

john-wagster left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants