[ML] Flag updates from Inference #131725

prwhelan · 2025-07-22T18:54:21Z

Flag updates from Inference so Serverless can detect them.
Swap tests to set adaptive allocations rather than num allocations to pass in serverless.

elasticsearchmachine · 2025-07-22T18:55:38Z

Pinging @elastic/ml-core (Team:ML)

jan-elastic

LGTM

jan-elastic · 2025-07-24T06:56:16Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

+
+        public void setFromInference(boolean fromInference) {
+            this.fromInference = fromInference;
+            this.isInternal = fromInference;


It looks confusing the setFromInference also sets isInternal.

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

jan-elastic · 2025-07-24T07:33:12Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

 import org.elasticsearch.xpack.core.ml.utils.ExceptionsHelper;

 import java.io.IOException;
 import java.util.Objects;


I'm missing a bit of context: why do we need to distinguish between these cases?

Is there a corresponding Serverless PR?

Yeah, let me ping you with the internal documentation

I'm missing a bit of context: why do we need to distinguish between these cases?

We need to allow updates to num_allocations in serverless that originate from the AdaptiveAllocationsScalerService (ADAPTIVE_ALLOCATIONS), but we want to disallow updates from users (API and INFERENCE). The only alternative I thought of was refactoring AdaptiveAllocationsScalerService to update directly rather than through the API, but that felt more intrusive.

…dates

jonathan-buttner · 2025-08-19T14:44:00Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

+                    // we changed over from a boolean to an enum
+                    // when it was a boolean, true came from adaptive allocations and false came from the rest api
+                    // treat "inference" as if it came from the api
+                    out.writeBoolean(isInternal());


Do we need to determine if source == Source.ADAPTIVE_ALLOCATIONS here? Since this will return true for Source.INFERENCE as well?

Previously, we set the boolean to true if the source was either from the inference update api or the adaptive allocations autoscaler. out.writeBoolean(isInternal()) preserves this logic (i think). It means the stream reader will think an inference api call is an adaptive allocations api call, but that only affects serverless which is only mixed cluster during a rolling update.

jonathan-buttner · 2025-08-19T14:48:06Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java


        public boolean isInternal() {
-            return isInternal;
+            return source == Source.INFERENCE || source == Source.ADAPTIVE_ALLOCATIONS;


Can you confirm that we do want Source.INFERENCE here for all the usage of isInternal() below?

Confirmed! Yeah inference update code previously set isInternal to true (back when the boolean existed)

…pdates

…dates

jan-elastic · 2025-10-09T06:57:51Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java


        public static final ParseField TIMEOUT = new ParseField("timeout");

+        private static final TransportVersion INFERENCE_UPDATE_ML = TransportVersion.fromName("inference_update_ml");


I think this name INFERENCE_UPDATE_ML isn't particularly clear.

What about something like UPDATE_TRAINED_MODEL_DEPLOYMENT_REQUEST_SOURCE or so? 🤷

jan-elastic · 2025-10-09T06:59:20Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

 import static org.elasticsearch.xpack.core.ml.action.StartTrainedModelDeploymentAction.Request.NUMBER_OF_ALLOCATIONS;

 public class UpdateTrainedModelDeploymentAction extends ActionType<CreateTrainedModelAssignmentAction.Response> {
+    public enum Source {


Isn't it cleaner to move this into Request (so UpdateTrainedModelDeploymentAction.Request.Source)?

jan-elastic

LGTM, just nitpicking

davidkyle · 2025-10-09T08:35:55Z

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java

+    public enum Source {
+        API,
+        ADAPTIVE_ALLOCATIONS,
+        INFERENCE


nit INFERENCE is pretty vague, from the usage it appears to mean the request comes from the Inference API

Suggested change

INFERENCE

INFERENCE_API

…dates

[ML] Use adaptive allocations in test

b124b8f

prwhelan added >test Issues or PRs that are addressing/adding tests :ml Machine learning Team:ML Meta label for the ML team v9.2.0 labels Jul 22, 2025

elasticsearchmachine added the serverless-linked Added by automation, don't add manually label Jul 22, 2025

prwhelan marked this pull request as ready for review July 22, 2025 18:55

Merge branch 'main' into block-updates

061cc5e

jan-elastic approved these changes Jul 23, 2025

View reviewed changes

Flag when updates come from the inference endpoint

9ddaec4

prwhelan changed the title ~~[ML] Use adaptive allocations in test~~ [ML] Flag updates from Inference Jul 23, 2025

prwhelan requested a review from jan-elastic July 23, 2025 20:54

jan-elastic reviewed Jul 24, 2025

View reviewed changes

...src/main/java/org/elasticsearch/xpack/core/ml/action/UpdateTrainedModelDeploymentAction.java Outdated Show resolved Hide resolved

jan-elastic reviewed Jul 24, 2025

View reviewed changes

prwhelan added 3 commits July 29, 2025 09:09

Merge branch 'main' of github.com:elastic/elasticsearch into block-up…

b97a4ca

…dates

Change booleans to enum

0107a7c

Merge branch 'main' into block-updates

00f7ba7

jonathan-buttner approved these changes Aug 19, 2025

View reviewed changes

Merge branch 'main' of github.com:prwhelan/elasticsearch into block-u…

fd87934

…pdates

elasticsearchmachine added v9.3.0 and removed v9.2.0 labels Oct 2, 2025

elasticsearchmachine and others added 2 commits October 2, 2025 07:34

[CI] Update transport version definitions

e9bc931

Merge branch 'main' of github.com:elastic/elasticsearch into block-up…

87b4a29

…dates

jan-elastic reviewed Oct 9, 2025

View reviewed changes

jan-elastic approved these changes Oct 9, 2025

View reviewed changes

davidkyle reviewed Oct 9, 2025

View reviewed changes

Merge branch 'main' of github.com:elastic/elasticsearch into block-up…

c14ed68

…dates

prwhelan added 3 commits October 10, 2025 10:17

rename transport version

0e05b7a

relocate source under request; rename to inference api

e671686

Merge branch 'main' of github.com:elastic/elasticsearch into block-up…

8dd8d94

…dates

prwhelan merged commit 4290a8e into elastic:main Oct 13, 2025
34 checks passed


		public static final ParseField TIMEOUT = new ParseField("timeout");

		private static final TransportVersion INFERENCE_UPDATE_ML = TransportVersion.fromName("inference_update_ml");

[ML] Flag updates from Inference #131725

[ML] Flag updates from Inference #131725

Uh oh!

Conversation

prwhelan commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Jul 22, 2025

Uh oh!

jan-elastic left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jan-elastic left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

prwhelan commented Jul 22, 2025 •

edited

Loading