Fix naming, use oversample parameter instead of num_candidates_factor

carlosdelest · carlosdelest · commit 7b5ac9079a29 · 2025-01-10T16:39:59.000+01:00
diff --git a/docs/reference/mapping/types/dense-vector.asciidoc b/docs/reference/mapping/types/dense-vector.asciidoc
@@ -121,12 +121,12 @@ The three following quantization strategies are supported:
 * `bbq` - experimental:[] Better binary quantization which reduces each dimension to a single bit precision. This reduces the memory footprint by 96% (or 32x) at a larger cost of accuracy. Generally, oversampling during query time and reranking can help mitigate the accuracy loss.
 
 
-When using a quantized format, you may want to oversample and rescore the results to improve accuracy. See <<dense-vector-knn-search-reranking, oversampling and rescoring>> for more information.
+When using a quantized format, you may want to oversample and rescore the results to improve accuracy. See <<dense-vector-knn-search-rescoring, oversampling and rescoring>> for more information.
 
 To use a quantized index, you can set your index type to `int8_hnsw`, `int4_hnsw`, or `bbq_hnsw`. When indexing `float` vectors, the current default
 index type is `int8_hnsw`.
 
-Quantized vectors can use <<dense-vector-knn-search-reranking,oversampling and rescoring>> to improve accuracy on approximate kNN search results.
+Quantized vectors can use <<dense-vector-knn-search-rescoring,oversampling and rescoring>> to improve accuracy on approximate kNN search results.
 
 NOTE: Quantization will continue to keep the raw float vector values on disk for reranking, reindexing, and quantization improvements over the lifetime of the data.
 This means disk usage will increase by ~25% for `int8`, ~12.5% for `int4`, and ~3.1% for `bbq` due to the overhead of storing the quantized and raw vectors.
diff --git a/docs/reference/rest-api/common-parms.asciidoc b/docs/reference/rest-api/common-parms.asciidoc
@@ -1367,12 +1367,16 @@ tag::knn-rescore-vector[]
 NOTE: Rescoring only makes sense for quantized vectors; when <<dense-vector-quantization,quantization>> is not used, the original vectors are used for scoring.
 Rescore option will be ignored for non-quantized `dense_vector` fields.
 
-`num_candidates_factor`::
+`oversample`::
 (Required, float)
 +
-Applies the specified oversample factor to the number of candidates on the approximate kNN search.
-The approximate kNN search will retrieve `num_candidates * num_candidates_factor` candidates per shard, and then use the original vectors for rescoring them.
+Applies the specified oversample factor to `k` on the approximate kNN search.
+The approximate kNN search will:
 
-See <<dense-vector-knn-search-reranking,oversampling and rescoring quantized vectors>> for details.
+* Retrieve `num_candidates` candidates per shard.
+* From these candidates, the top `k * oversample` candidates per shard will be rescored using the original vectors.
+* The top `k` rescored candidates will be returned.
+
+See <<dense-vector-knn-search-rescoring,oversampling and rescoring quantized vectors>> for details.
 --
 end::knn-rescore-vector[]
diff --git a/docs/reference/search/search-your-data/knn-search.asciidoc b/docs/reference/search/search-your-data/knn-search.asciidoc
@@ -1070,7 +1070,7 @@ the global top `k` matches across shards. You cannot set the
 
 
 [discrete]
-[[dense-vector-knn-search-reranking]]
+[[dense-vector-knn-search-rescoring]]
 ==== Oversampling and rescoring for quantized vectors
 
 When using <<dense-vector-quantization,quantized vectors>> for kNN search, you can optionally rescore results to balance performance and accuracy, by doing:
@@ -1091,10 +1091,13 @@ Generally, we have found that:
 * `bbq` requires rescoring except on exceptionally large indices or models specifically designed for quantization. We have found that between 3x-5x oversampling is generally sufficient. But for fewer dimensions or vectors that do not quantize well, higher oversampling may be required.
 
 You can use the `rescore_vector` preview:[] option to automatically perform reranking.
-When a rescore `num_candidates_factor` parameter is specified, the approximate kNN search will retrieve the top `num_candidates * oversample` candidates per shard.
-It will then use the original vectors to rescore them, and return the top `k` results.
+When a rescore `oversample` parameter is specified, the approximate kNN search will:
 
-Here is an example of using the `rescore_vector` option with the `num_candidates_factor` parameter:
+* Retrieve `num_candidates` candidates per shard.
+* From these candidates, the top `k * oversample` candidates per shard will be rescored using the original vectors.
+* The top `k` rescored candidates will be returned.
+
+Here is an example of using the `rescore_vector` option with the `oversample` parameter:
 
 [source,console]
 ----
@@ -1106,7 +1109,7 @@ POST image-index/_search
     "k": 10,
     "num_candidates": 100,
     "rescore_vector": {
-      "num_candidates_factor": 2.0
+      "oversample": 2.0
     }
   },
   "fields": [ "title", "file-type" ]
@@ -1118,18 +1121,19 @@ POST image-index/_search
 
 This example will:
 
-* Search using approximate kNN with `num_candidates` set to 200 (`num_candidates` * `num_candidates_factor`).
-* Rescore the top 200 candidates per shard using the original, non quantized vectors.
+* Search using approximate kNN for the top 100 candidates.
+* Rescore the top 20 candidates (`oversample * k`) per shard using the original, non quantized vectors.
+* Return the top 10 (`k`) rescored candidates.
 * Merge the rescored canddidates from all shards, and return the top 10 (`k`) results.
 
 [discrete]
-[[dense-vector-knn-search-reranking-rescore-additional]]
+[[dense-vector-knn-search-rescoring-rescore-additional]]
 ===== Additional rescoring techniques
 
 The following sections provide additional ways of rescoring:
 
 [discrete]
-[[dense-vector-knn-search-reranking-rescore-section]]
+[[dense-vector-knn-search-rescoring-rescore-section]]
 ====== Use the `rescore` section for top-level kNN search
 
 You can use this option when you don't want to rescore on each shard, but on the top results from all shards.
@@ -1185,7 +1189,7 @@ gathering 20 nearest neighbors according to quantized scoring and rescoring with
 
 
 [discrete]
-[[dense-vector-knn-search-reranking-script-score]]
+[[dense-vector-knn-search-rescoring-script-score]]
 ====== Use a `script_score` query to rescore per shard
 
 You can use this option when you want to rescore on each shard and want more fine-grained control on the rescoring