Skip to content

Commit e47fff1

Browse files
authored
Merge branch 'main' into version-sets
2 parents 131a81d + ff3950d commit e47fff1

File tree

101 files changed

+3101
-2598
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

101 files changed

+3101
-2598
lines changed
Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
---
2+
name: Comment on PR for .asciidoc changes
3+
4+
on:
5+
# We need to use pull_request_target to be able to comment on PRs from forks
6+
pull_request_target:
7+
types:
8+
- synchronize
9+
- opened
10+
- reopened
11+
branches:
12+
- main
13+
- master
14+
- "9.0"
15+
16+
jobs:
17+
comment-on-asciidoc-change:
18+
permissions:
19+
contents: read
20+
pull-requests: write
21+
uses: elastic/docs-builder/.github/workflows/comment-on-asciidoc-changes.yml@main

branches.json

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,9 @@
1010
{
1111
"branch": "9.0"
1212
},
13+
{
14+
"branch": "8.18"
15+
},
1316
{
1417
"branch": "8.17"
1518
},

build-tools-internal/version.properties

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
elasticsearch = 9.0.0
2-
lucene = 10.0.0
2+
lucene = 10.1.0
33

44
bundled_jdk_vendor = openjdk
55
bundled_jdk = 23+37@3c5b90190c68498b986a97f276efd28a

docs/Versions.asciidoc

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11

22
include::{docs-root}/shared/versions/stack/{source_branch}.asciidoc[]
33

4-
:lucene_version: 10.0.0
5-
:lucene_version_path: 10_0_0
4+
:lucene_version: 10.1.0
5+
:lucene_version_path: 10_1_0
66
:jdk: 11.0.2
77
:jdk_major: 11
88
:build_type: tar

docs/changelog/119308.yaml

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
pr: 119308
2+
summary: Upgrade to Lucene 10.1.0
3+
area: Search
4+
type: upgrade
5+
issues: []

docs/changelog/121193.yaml

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
pr: 121193
2+
summary: Enable LOOKUP JOIN in non-snapshot builds
3+
area: ES|QL
4+
type: enhancement
5+
issues:
6+
- 121185
7+
highlight:
8+
title: Enable LOOKUP JOIN in non-snapshot builds
9+
body: |-
10+
This effectively releases LOOKUP JOIN into tech preview. Docs will
11+
follow in a separate PR.
12+
13+
- Enable the lexing/grammar for LOOKUP JOIN in non-snapshot builds.
14+
- Remove the grammar for the unsupported `| JOIN ...` command (without `LOOKUP` as first keyword). The way the lexer modes work, otherwise we'd also have to enable `| JOIN ...` syntax on non-snapshot builds and would have to add additional validation to provide appropriate error messages.
15+
- Remove grammar for `LOOKUP JOIN index AS ...` because qualifiers are not yet supported. Otherwise we'd have to put in additional validation as well to prevent such queries.
16+
17+
Also fix https://github.com/elastic/elasticsearch/issues/121185
18+
notable: true

docs/reference/search/profile.asciidoc

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -176,7 +176,7 @@ The API returns the following result:
176176
"time_in_nanos": 775274,
177177
"children" : [
178178
{
179-
"name": "SimpleTopScoreDocCollector",
179+
"name": "TopScoreDocCollector",
180180
"reason": "search_top_hits",
181181
"time_in_nanos": 775274
182182
}
@@ -537,7 +537,7 @@ Looking at the previous example:
537537
"time_in_nanos": 775274,
538538
"children" : [
539539
{
540-
"name": "SimpleTopScoreDocCollector",
540+
"name": "TopScoreDocCollector",
541541
"reason": "search_top_hits",
542542
"time_in_nanos": 775274
543543
}
@@ -551,7 +551,7 @@ Looking at the previous example:
551551

552552

553553
We see a top-level collector named `QueryPhaseCollector` which holds a child
554-
`SimpleTopScoreDocCollector`. `SimpleTopScoreDocCollector` is the default
554+
`TopScoreDocCollector`. `TopScoreDocCollector` is the default
555555
"scoring and sorting" `Collector` used by {es}. The `reason` field attempts
556556
to give a plain English description of the class name. The `time_in_nanos`
557557
is similar to the time in the Query tree: a wall-clock time inclusive of all
@@ -751,7 +751,7 @@ The API returns the following result:
751751
"time_in_nanos": 1945072,
752752
"children": [
753753
{
754-
"name": "SimpleTopScoreDocCollector",
754+
"name": "TopScoreDocCollector",
755755
"reason": "search_top_hits",
756756
"time_in_nanos": 22577
757757
},
@@ -788,7 +788,7 @@ major portions of the query are represented:
788788
2. The second `TermQuery` (message:search) represents the `post_filter` query.
789789

790790
The Collector tree is fairly straightforward, showing how a single
791-
QueryPhaseCollector that holds the normal scoring SimpleTopScoreDocCollector
791+
QueryPhaseCollector that holds the normal scoring TopScoreDocCollector
792792
used to collect top hits, as well as BucketCollectorWrapper to run all scoped
793793
aggregations.
794794

@@ -1332,7 +1332,7 @@ One of the `dfs.knn` sections for a shard looks like the following:
13321332
"rewrite_time" : 1275732,
13331333
"collector" : [
13341334
{
1335-
"name" : "SimpleTopScoreDocCollector",
1335+
"name" : "TopScoreDocCollector",
13361336
"reason" : "search_top_hits",
13371337
"time_in_nanos" : 17163
13381338
}

docs/reference/search/search-your-data/semantic-text-hybrid-search

Lines changed: 15 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -113,6 +113,7 @@ POST _tasks/<task_id>/_cancel
113113
==== Perform hybrid search
114114

115115
After reindexing the data into the `semantic-embeddings` index, you can perform hybrid search by using <<rrf,reciprocal rank fusion (RRF)>>. RRF is a technique that merges the rankings from both semantic and lexical queries, giving more weight to results that rank high in either search. This ensures that the final results are balanced and relevant.
116+
To extract the most relevant fragments from the original text and query, you can use the <<highlighting,highlight parameter>>:
116117

117118
[source,console]
118119
------------------------------------------------------------
@@ -142,6 +143,13 @@ GET semantic-embeddings/_search
142143
}
143144
]
144145
}
146+
},
147+
"highlight": {
148+
"fields": {
149+
"semantic_text": {
150+
"number_of_fragments": 2 <5>
151+
}
152+
}
145153
}
146154
}
147155
------------------------------------------------------------
@@ -150,7 +158,7 @@ GET semantic-embeddings/_search
150158
<2> Lexical search is performed on the `content` field using the specified phrase.
151159
<3> The second `standard` retriever refers to the semantic search.
152160
<4> The `semantic_text` field is used to perform the semantic search.
153-
161+
<5> Specifies the maximum number of fragments to return. See <<semantic-text-highlighting, semantic text highlighting>> for a more complete example.
154162

155163
After performing the hybrid search, the query will return the top 10 documents that match both semantic and lexical search criteria. The results include detailed information about each document:
156164

@@ -178,36 +186,14 @@ After performing the hybrid search, the query will return the top 10 documents t
178186
"_score": 0.032786883,
179187
"_rank": 1,
180188
"_source": {
181-
"semantic_text": {
182-
"inference": {
183-
"inference_id": "my-elser-endpoint",
184-
"model_settings": {
185-
"task_type": "sparse_embedding"
186-
},
187-
"chunks": [
188-
{
189-
"text": "What so many out there do not realize is the importance of what you do after you work out. You may have done the majority of the work, but how you treat your body in the minutes and hours after you exercise has a direct effect on muscle soreness, muscle strength and growth, and staying hydrated. Cool Down. After your last exercise, your workout is not over. The first thing you need to do is cool down. Even if running was all that you did, you still should do light cardio for a few minutes. This brings your heart rate down at a slow and steady pace, which helps you avoid feeling sick after a workout.",
190-
"embeddings": {
191-
"exercise": 1.571044,
192-
"after": 1.3603843,
193-
"sick": 1.3281639,
194-
"cool": 1.3227621,
195-
"muscle": 1.2645415,
196-
"sore": 1.2561599,
197-
"cooling": 1.2335974,
198-
"running": 1.1750668,
199-
"hours": 1.1104802,
200-
"out": 1.0991782,
201-
"##io": 1.0794281,
202-
"last": 1.0474665,
203-
(...)
204-
}
205-
}
206-
]
207-
}
208-
},
209189
"id": 8408852,
210190
"content": "What so many out there do not realize is the importance of (...)"
191+
},
192+
"highlight" : {
193+
"semantic_text" : [
194+
"... fragment_1 ...",
195+
"... fragment_2 ..."
196+
]
211197
}
212198
}
213199
]

docs/reference/troubleshooting/common-issues/disk-usage-exceeded.asciidoc

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -127,12 +127,14 @@ its {cloud}/ec-api-console.html[Elasticsearch API Console] to later
127127
with this resolution flow on {ess}, kindly reach out to
128128
https://support.elastic.co[Elastic Support] for assistance.
129129

130-
== Prevent watermark errors
130+
[discrete]
131+
[[fix-watermark-errors-prevent]]
132+
=== Prevent watermark errors
131133

132-
To avoid watermark errors in future, , perform one of the following actions:
134+
To avoid watermark errors in future, perform one of the following actions:
133135

134136
* If you're using {ess}, {ece}, or {eck}: Enable <<xpack-autoscaling,autoscaling>>.
135137

136138
* Set up {kibana-ref}/kibana-alerts.html[stack monitoring alerts] on top of
137139
<<monitor-elasticsearch-cluster,{es} monitoring>> to be notified before
138-
the flood-stage watermark is reached.
140+
the flood-stage watermark is reached.

0 commit comments

Comments
 (0)