Skip to content

Conversation

@iverase
Copy link
Contributor

@iverase iverase commented Sep 1, 2025

We have currently a fixed oversampling of 20% of the total centroid. This works well for low / medium number of centroids but starts being quite slow when the number of centroids increases. Now that we introduced a visit ratio we can make it dependent on that an oversample the number of centroids we expect to visit by a constant.

Local experiment shows no lost of recall and a nice speed up when the number of centroids is high.

@elasticsearchmachine elasticsearchmachine added the Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch label Sep 1, 2025
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-search-relevance (Team:Search Relevance)

@benwtrent benwtrent self-requested a review September 2, 2025 13:04
@iverase iverase merged commit 0080d2c into elastic:main Sep 2, 2025
33 checks passed
@iverase iverase deleted the centroid_versample branch September 2, 2025 15:05
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Sep 11, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Sep 11, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Sep 16, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 8, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 16, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 22, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 23, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 23, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
phananh1010 added a commit to phananh1010/elasticsearch that referenced this pull request Oct 23, 2025
BASE=3c264cff967750ff7cb581defa42b0d69bb31fc6
HEAD=0e24fc739f9daa5b003d20fb0fffaf197449db67
Branch=main
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>non-issue :Search Relevance/Vectors Vector search Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch v9.2.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants