[DiskBBQ] Quantize centroids using 7 bits instead of 4 bits #132261

iverase · 2025-07-31T11:28:35Z

We have introduced a Int7 vectors scorer that shows it is faster that the int4 scorer in the microbenchmarks benchmarks. In addition it should provide more accurate results. The only trade off is that it requires two more bytes per vector so it will increase slightly the serialization size.

All in all, it is a better trade off so let's move our quantization strategy for centroids from int 4 to int 7.

elasticsearchmachine · 2025-07-31T11:28:59Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

john-wagster

lgtm

* upstream/main: (822 commits) Improve Semantic Text Exists Query Tests (elastic#132283) Make hierarchical k-means over centroids cheaper (elastic#132316) Remove unnecessary listener.delegateFailure in IndexShard#ensureMutable (elastic#132294) Add missing release note (elastic#132319) Unmute elastic#131803 (elastic#132295) Include bytes for live docs in ShardFieldStats (elastic#132232) Fix default missing index sort value of data_nanos pre 7.14 (elastic#132162) [DiskBBQ] Quantize centroids using 7 bits instead of 4 bits (elastic#132261) Use panamized version for windows in Int7VectorScorer (elastic#132311) Mute org.elasticsearch.xpack.ml.integration.AutodetectMemoryLimitIT testTooManyByAndOverFields elastic#132310 Mute org.elasticsearch.xpack.ml.integration.AutodetectMemoryLimitIT testManyDistinctOverFields elastic#132308 Update 8.17 version to 8.17.10 (elastic#132303) Mute org.elasticsearch.datastreams.DataStreamsClientYamlTestSuiteIT test {p0=data_stream/10_basic/Create hidden data stream with match all template} elastic#132298 Add random queries to logsdb data generation tests (elastic#132109) ES|QL Inference runner refactoring (elastic#131986) Add basic example to linear-retriever.md (elastic#132196) Refactor RemoteClusterService to be multi-project aware (elastic#131894) ESQL: Mark csv-spec tests (elastic#132098) Mute org.elasticsearch.common.logging.JULBridgeTests testThrowable elastic#132280 Bump versions after 8.19.0 release ...

iverase added 2 commits July 31, 2025 12:23

[DiskBBQ] Quantizize centroids using 7 bits instead of 4 bits

2d74e36

spotless

5dbe8c9

iverase requested review from benwtrent and john-wagster July 31, 2025 11:28

iverase added >non-issue :Search Relevance/Vectors Vector search v9.2.0 labels Jul 31, 2025

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Jul 31, 2025

iverase changed the title ~~[DiskBBQ] Quantizize centroids using 7 bits instead of 4 bits~~ [DiskBBQ] Quantize centroids using 7 bits instead of 4 bits Jul 31, 2025

john-wagster approved these changes Jul 31, 2025

View reviewed changes

iverase merged commit e087048 into elastic:main Aug 1, 2025
33 checks passed

iverase deleted the int7DiskBBQ branch August 1, 2025 09:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DiskBBQ] Quantize centroids using 7 bits instead of 4 bits #132261

[DiskBBQ] Quantize centroids using 7 bits instead of 4 bits #132261

Uh oh!

iverase commented Jul 31, 2025

Uh oh!

elasticsearchmachine commented Jul 31, 2025

Uh oh!

john-wagster left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[DiskBBQ] Quantize centroids using 7 bits instead of 4 bits #132261

[DiskBBQ] Quantize centroids using 7 bits instead of 4 bits #132261

Uh oh!

Conversation

iverase commented Jul 31, 2025

Uh oh!

elasticsearchmachine commented Jul 31, 2025

Uh oh!

john-wagster left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants