Adding new experimental bbq index types #114439

benwtrent · 2024-10-09T17:49:01Z

new index types of bbq_hnsw and bbq_flat which utilize the better binary quantization formats. A 32x reduction in memory, with nice recall properties.

docs/reference/search/search-your-data/knn-search.asciidoc

mayya-sharipova · 2024-10-11T18:34:41Z

docs/reference/search/search-your-data/knn-search.asciidoc

+        "knn": { <2>
+          "query_vector": [0.04283529, 0.85670587, -0.51402352, 0],
+          "field": "my_int4_vector",
+          "num_candidates": 20 <3>


We can use "k" instead of num_candidates to retrieve 20 results.

I can, but I would just set it to k. I don't see the value.

docs/reference/search/search-your-data/knn-search.asciidoc

mayya-sharipova

Accept all the math in scorers which I don't completely follow, the rest LGTM.

Thanks @benwtrent. Great addition!

…ex-types

elasticsearchmachine · 2024-10-14T16:41:25Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

elasticsearchmachine · 2024-10-14T18:57:32Z

Hi @benwtrent, I've created a changelog YAML for you.

benwtrent · 2024-10-14T20:19:35Z

@elasticmachine update branch

elasticsearchmachine · 2024-10-15T00:14:32Z

💔 Backport failed

Status	Branch	Result
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 114439

benwtrent · 2024-10-15T00:25:19Z

Did a larger scale benchmark over 134,705,698 1024 Coherev3 vectors on a single 64 GB node (about 32GB of off-heap allocated).

This was done with rally track: https://github.com/elastic/rally-tracks/tree/master/msmarco-v2-vector

I would expect rescoring per shard to have a slightly bigger latency hit, but better recall.

Took about 12hrs to ingest with aggressive merging enabled.

fmt: knn-recall-k-num_candidates-number_rescored

Here are some of the results:

Shows that oversampling and rescoring helps, but can hurt qps.
knn-recall-10-20_no_rescore
recall: 0.45
Avg Nodes Visited: 15,915.211
78% best ndgc
Single client latency: 9ms
Multi-client QPS: 1,134.649

knn-recall-10-100-50.
Its interesting that even at 5x oversampling, its latency isn't much worse and at 10x num candidates, it only visits 2x more vectors.
Recall: 0.704
Avg Nodes Visited: 36,079.801
90% of best NDGC
Multi-client QPS: 451.596 (30ms latency)
Single client latency: 18ms

knn-recall-10-1000-200: its pretty neat to see that even visiting >100k vectors over multiple segments has 42ms latency (we ain't even done optimizing this stuff yet).
recall: 0.895
AvgNodesVisited: 115,598.117
97% best ndgc
Single client latency: 42.534ms
Multi-client QPS: 167.806 (93ms)

benwtrent · 2024-10-15T00:30:04Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

new index types of bbq_hnsw and bbq_flat which utilize the better binary quantization formats. A 32x reduction in memory, with nice recall properties. (cherry picked from commit 6c752ab)

new index types of bbq_hnsw and bbq_flat which utilize the better binary quantization formats. A 32x reduction in memory, with nice recall properties.

…4783) * Adding new bbq index types behind a feature flag (#114439) new index types of bbq_hnsw and bbq_flat which utilize the better binary quantization formats. A 32x reduction in memory, with nice recall properties. (cherry picked from commit 6c752ab) * spotless

new index types of bbq_hnsw and bbq_flat which utilize the better binary quantization formats. A 32x reduction in memory, with nice recall properties.

flobernd · 2025-02-03T13:08:33Z

Hi @benwtrent, would it be possible to add these types to the specification?

The types are currently missing in the client and there is no documentation available for them either.

Happy to add the new enum members myself, if somebody could provide a proper documentation string 🙂

Adding new bbq index types behind a feature flag

7649ff8

benwtrent added >non-issue cloud-deploy Publish cloud docker image for Cloud-First-Testing :Search Relevance/Vectors Vector search v8.16.0 v9.0.0 labels Oct 9, 2024

benwtrent added 3 commits October 9, 2024 16:28

adding tests, correcting scoring

1f857c9

adding docs

0c311aa

fixing docs

2c66653

mayya-sharipova reviewed Oct 11, 2024

View reviewed changes

docs/reference/search/search-your-data/knn-search.asciidoc Show resolved Hide resolved

mayya-sharipova reviewed Oct 11, 2024

View reviewed changes

docs/reference/search/search-your-data/knn-search.asciidoc Outdated Show resolved Hide resolved

mayya-sharipova approved these changes Oct 11, 2024

View reviewed changes

benwtrent added 2 commits October 14, 2024 12:11

Merge remote-tracking branch 'upstream/main' into feature/add-bbq-ind…

9ecd815

…ex-types

addressing PR comments, adding experimental tags

b33ee9b

benwtrent marked this pull request as ready for review October 14, 2024 16:41

elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Oct 14, 2024

benwtrent added >feature and removed >non-issue labels Oct 14, 2024

Update docs/changelog/114439.yaml

70f3c44

Merge branch 'main' into feature/add-bbq-index-types

13f15f3

benwtrent added the auto-backport Automatically create backport pull requests when merged label Oct 15, 2024

benwtrent merged commit 6c752ab into elastic:main Oct 15, 2024
17 checks passed

elasticsearchmachine added the backport pending label Oct 15, 2024

benwtrent deleted the feature/add-bbq-index-types branch October 15, 2024 00:26

benwtrent mentioned this pull request Oct 15, 2024

[8.x] Adding new bbq index types behind a feature flag (#114439) #114783

Merged

davidkyle pushed a commit that referenced this pull request Oct 15, 2024

Adding new bbq index types behind a feature flag (#114439)

b6c90d8

new index types of bbq_hnsw and bbq_flat which utilize the better binary quantization formats. A 32x reduction in memory, with nice recall properties.

benwtrent removed the backport pending label Oct 15, 2024

benwtrent changed the title ~~Adding new bbq index types behind a feature flag~~ Adding new experimental bbq index types Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding new experimental bbq index types #114439

Adding new experimental bbq index types #114439

Uh oh!

benwtrent commented Oct 9, 2024 •

edited

Loading

Uh oh!

Uh oh!

mayya-sharipova Oct 11, 2024

Uh oh!

benwtrent Oct 14, 2024

Uh oh!

Uh oh!

mayya-sharipova left a comment •

edited

Loading

Uh oh!

elasticsearchmachine commented Oct 14, 2024

Uh oh!

elasticsearchmachine commented Oct 14, 2024

Uh oh!

benwtrent commented Oct 14, 2024

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 15, 2024

Uh oh!

benwtrent commented Oct 15, 2024

Uh oh!

benwtrent commented Oct 15, 2024

Uh oh!

flobernd commented Feb 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Adding new experimental bbq index types #114439

Adding new experimental bbq index types #114439

Uh oh!

Conversation

benwtrent commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mayya-sharipova Oct 11, 2024

Choose a reason for hiding this comment

Uh oh!

benwtrent Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mayya-sharipova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Oct 14, 2024

Uh oh!

elasticsearchmachine commented Oct 14, 2024

Uh oh!

benwtrent commented Oct 14, 2024

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 15, 2024

💔 Backport failed

Uh oh!

benwtrent commented Oct 15, 2024

Uh oh!

benwtrent commented Oct 15, 2024

💚 All backports created successfully

Questions ?

Uh oh!

flobernd commented Feb 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

benwtrent commented Oct 9, 2024 •

edited

Loading

mayya-sharipova left a comment •

edited

Loading