Test BBQ recall with different bit sizes #129729

carlosdelest · 2025-06-19T17:20:54Z

Related to #129421

Creates a new ES910BinaryQuantizedVectorsFormat that accepts variable bit sizes for indexing and querying.

KnnIndexTester and related classes have been modified to allow testing with different bit sizes. Testing is possible via the following configuration file:

{
  "doc_vectors" : "/path_to_vectors",
  "query_vectors" : "/path_to_queries",
  "num_docs" : 100000,
  "num_queries" : 1000,
  "index_type" : "flat",
  "over_sampling_factor" : 1,
  "search_threads" : 40,
  "index_threads" : 40,
  "reindex" : true,
  "force_merge" : true,
  "vector_space" : "cosine",
  "quantize_bits" : 1,
  "vector_encoding" : "float32",
  "dimensions" : 768,
  "use_new_flat_vectors_format": true,
  "quantize_query_bits" : 4
}

Where:

quantize_bits determine the quantization for vectors at index time
quantize_query_bits determine the quantization for queries
use_new_flat_vectors_format: true to allow testing the variable bit sizes, false to fall back to the previous vector format (useful to compare quantize_bits:1 and quantize_query_bits: 4 to check the baseline)

…e-quantization-bits # Conflicts: # qa/vector/src/main/java/org/elasticsearch/test/knn/CmdLineArgs.java # server/src/main/java/org/elasticsearch/index/mapper/vectors/DenseVectorFieldMapper.java

benwtrent · 2025-07-01T16:36:47Z

.../src/main/java/org/elasticsearch/index/codec/vectors/es910/OffHeapBinarizedVectorValues.java

+        this.byteBuffer = ByteBuffer.allocate(dimension);
+        this.binaryValue = byteBuffer.array();
+        this.binaryQuantizer = quantizer;
+        this.discretizedDimensions = BQVectorUtils.discretize(dimension, 64);


Doesn't seem needed, you are just storing the raw bytes.

benwtrent · 2025-07-01T16:38:13Z

...main/java/org/elasticsearch/index/codec/vectors/es910/ES910BinaryQuantizedVectorsWriter.java

+        OffHeapBinarizedQueryVectorValues(IndexInput data, int dimension, int size) {
+            this.slice = data;
+            this.dimension = dimension;
+            this.size = size;
+            // 4x the quantized binary dimensions
+            int binaryDimensions = (BQVectorUtils.discretize(dimension, 64) / 8) * BQSpaceUtils.B_QUERY;
+            this.byteBuffer = ByteBuffer.allocate(binaryDimensions);
+            this.binaryValue = byteBuffer.array();
+            // + 1 for the quantized sum


I am not sure if this is used, but its incorrect as you aren't packing the bits, binaryDimensions is just dimensions.

…sions

…e option combinations

…ntization-bits' into non-issue/bbq-multiple-quantization-bits

carlosdelest added 4 commits June 17, 2025 18:54

First version

688ea6d

Add CLI args for KnnIndexTester

c4c2218

Set correctly number of bits for query and index

7496e6a

Add suffix for index bits

1add01b

elasticsearchmachine added v9.1.0 v9.2.0 and removed v9.1.0 labels Jun 19, 2025

Merge remote-tracking branch 'origin/main' into non-issue/bbq-multipl…

2e56e90

…e-quantization-bits # Conflicts: # qa/vector/src/main/java/org/elasticsearch/test/knn/CmdLineArgs.java # server/src/main/java/org/elasticsearch/index/mapper/vectors/DenseVectorFieldMapper.java

benwtrent reviewed Jul 1, 2025

View reviewed changes

carlosdelest and others added 9 commits July 1, 2025 18:53

Fix merge

7a9015a

Don't use discretize to calculate dimensions, removed quantized dimen…

81b5409

…sions

Use bit scale for both index and queries

5108d96

Fix tests

98469e9

[CI] Auto commit changes from spotless

49a4c6c

Fix setting the query bits / index bits statically and printing them out

b3dab7e

Add a JSON config generator to generate JSON config files for multipl…

73566aa

…e option combinations

Merge remote-tracking branch 'carlosdelest/non-issue/bbq-multiple-qua…

b045209

…ntization-bits' into non-issue/bbq-multiple-quantization-bits

[CI] Auto commit changes from spotless

fbe28d9

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Test BBQ recall with different bit sizes #129729

Test BBQ recall with different bit sizes #129729

Uh oh!

carlosdelest commented Jun 19, 2025

Uh oh!

benwtrent Jul 1, 2025

Uh oh!

benwtrent Jul 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Test BBQ recall with different bit sizes #129729

Are you sure you want to change the base?

Test BBQ recall with different bit sizes #129729

Uh oh!

Conversation

carlosdelest commented Jun 19, 2025

Uh oh!

benwtrent Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

benwtrent Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants