Skip to content

Conversation

ChrisHegarty
Copy link
Contributor

@ChrisHegarty ChrisHegarty commented Sep 21, 2024

Upgrade to Lucene 9.12

This PR will be squash-merged, so as to simplify the backport to the 8.x branch.

benwtrent and others added 30 commits May 30, 2024 12:58
…torScorerFactoryTests (#109303)

This commit adds a couple of missing supported checks to org.elasticsearch.vec.VectorScorerFactoryTests. Without these the test try, and fail, to run on Windows.
Updated LuceneDocument to take advantage of looking up feature values on existing features and selecting the max when parsing multi-value sparse vectors
Just doing the minimal fixes to making this pass tests again and
compile. One trivial constructor adjustment and a few adjustments to
task count expectations in some tests from having Lucene fork less.
@ChrisHegarty
Copy link
Contributor Author

Lucene 9.12.0 has been released. This PR is ready of final review. I cannot officially review in GH, but it LGTM.

@ChrisHegarty ChrisHegarty changed the title Upgrade to Lucene 9.12 Upgrade to Lucene 9.12.0 Sep 28, 2024
@benwtrent
Copy link
Member

My only slight concern is the flat vector formats. Now Lucene 9.12 treats them like exposed normal codecs we need to make sure there aren't any weird name conflicts, etc. with our flat formats.

I don't think there should be since ours have a different name. Just wanting to be sure.

Copy link
Member

@javanna javanna left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left a note on a TODO, I missed it when reviewing the corresponding PR. We can also address this post-merge. LGTM otherwise.

@ChrisHegarty ChrisHegarty added the auto-backport Automatically create backport pull requests when merged label Oct 1, 2024
@ChrisHegarty ChrisHegarty merged commit 32dde26 into main Oct 1, 2024
27 checks passed
@ChrisHegarty ChrisHegarty deleted the lucene_snapshot_9_12 branch October 1, 2024 07:39
ChrisHegarty added a commit that referenced this pull request Oct 1, 2024
This commit upgrades to Lucene 9.12.0.

Co-authored-by: Adrien Grand <[email protected]>
Co-authored-by: Armin Braun <[email protected]>
Co-authored-by: Benjamin Trent <[email protected]>
Co-authored-by: Chris Hegarty <[email protected]>
Co-authored-by: John Wagster <[email protected]>
Co-authored-by: Luca Cavanna <[email protected]>
Co-authored-by: Mayya Sharipova <[email protected]>
@ChrisHegarty ChrisHegarty removed backport pending auto-backport Automatically create backport pull requests when merged labels Oct 1, 2024
ChrisHegarty added a commit that referenced this pull request Oct 1, 2024
This commit upgrades to Lucene 9.12.0.

Co-authored-by: Adrien Grand <[email protected]>
Co-authored-by: Armin Braun <[email protected]>
Co-authored-by: Benjamin Trent <[email protected]>
Co-authored-by: John Wagster <[email protected]>
Co-authored-by: Luca Cavanna <[email protected]>
Co-authored-by: Mayya Sharipova <[email protected]>
@elastic elastic deleted a comment from elasticsearchmachine Oct 1, 2024
matthewabbott pushed a commit to matthewabbott/elasticsearch that referenced this pull request Oct 4, 2024
This commit upgrades to Lucene 9.12.0.

Co-authored-by: Adrien Grand <[email protected]>
Co-authored-by: Armin Braun <[email protected]>
Co-authored-by: Benjamin Trent <[email protected]>
Co-authored-by: Chris Hegarty <[email protected]>
Co-authored-by: John Wagster <[email protected]>
Co-authored-by: Luca Cavanna <[email protected]>
Co-authored-by: Mayya Sharipova <[email protected]>
elasticsearchmachine pushed a commit that referenced this pull request Oct 8, 2024
This copies the work from apache/lucene#13651
into Elasticsearch. The main reason for the copy is to simply allow it
to be deployed & used in Elasticsearch prior to Elasticsearch upgrading
to Lucene 10. At which point, we will then use the format as provided by
Lucene.

This is currently blocked by two pieces of work:

blocked by: #112933 blocked
by: #113333

After the format is merged, then code will be added for integration
tests & integration with Elasticsearch through new index format types in
the API.
benwtrent added a commit to benwtrent/elasticsearch that referenced this pull request Oct 8, 2024
…ic#113491)

This copies the work from apache/lucene#13651
into Elasticsearch. The main reason for the copy is to simply allow it
to be deployed & used in Elasticsearch prior to Elasticsearch upgrading
to Lucene 10. At which point, we will then use the format as provided by
Lucene.

This is currently blocked by two pieces of work:

blocked by: elastic#112933 blocked
by: elastic#113333

After the format is merged, then code will be added for integration
tests & integration with Elasticsearch through new index format types in
the API.
elasticsearchmachine pushed a commit that referenced this pull request Oct 8, 2024
…) (#114362)

This copies the work from apache/lucene#13651
into Elasticsearch. The main reason for the copy is to simply allow it
to be deployed & used in Elasticsearch prior to Elasticsearch upgrading
to Lucene 10. At which point, we will then use the format as provided by
Lucene.

This is currently blocked by two pieces of work:

blocked by: #112933 blocked
by: #113333

After the format is merged, then code will be added for integration
tests & integration with Elasticsearch through new index format types in
the API.

Co-authored-by: Elastic Machine <[email protected]>
matthewabbott pushed a commit to matthewabbott/elasticsearch that referenced this pull request Oct 10, 2024
…ic#113491)

This copies the work from apache/lucene#13651
into Elasticsearch. The main reason for the copy is to simply allow it
to be deployed & used in Elasticsearch prior to Elasticsearch upgrading
to Lucene 10. At which point, we will then use the format as provided by
Lucene.

This is currently blocked by two pieces of work:

blocked by: elastic#112933 blocked
by: elastic#113333

After the format is merged, then code will be added for integration
tests & integration with Elasticsearch through new index format types in
the API.
davidkyle pushed a commit to davidkyle/elasticsearch that referenced this pull request Oct 13, 2024
…ic#113491)

This copies the work from apache/lucene#13651
into Elasticsearch. The main reason for the copy is to simply allow it
to be deployed & used in Elasticsearch prior to Elasticsearch upgrading
to Lucene 10. At which point, we will then use the format as provided by
Lucene.

This is currently blocked by two pieces of work:

blocked by: elastic#112933 blocked
by: elastic#113333

After the format is merged, then code will be added for integration
tests & integration with Elasticsearch through new index format types in
the API.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

:Search/Search Search-related issues that do not fall into other categories Team:Search Meta label for search team test-arm Pull Requests that should be tested against arm agents test-full-bwc Trigger full BWC version matrix tests test-update-serverless test-windows Trigger CI checks on Windows >upgrade v8.16.0 v9.0.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants