[ML] Partial fix for deployment disappearing #137216

jonathan-buttner · 2025-10-27T18:35:59Z

WIP

This PR is just to show the changes I made to be able to test the issue here: #137134

To make the reproduction faster I temporarily changed the code to allow the times to be shorter:

PUT /_cluster/settings
{
  "persistent": {
    "xpack.ml.trained_models.adaptive_allocations.scale_to_zero_time": "10s",
    "xpack.ml.trained_models.adaptive_allocations.scale_up_cooldown_time": "10s",
    "logger.org.elasticsearch.xpack.ml.inference.assignment": "DEBUG"
  }
}

Then we can follow the steps in the issue to reproduce, which are:

Create deployment via creating inference endpoint

PUT _inference/rerank/mytest-old
{
    "service": "elasticsearch",
    "service_settings": {
        "num_threads": 1,
        "model_id": ".rerank-v1",
        "adaptive_allocations": {
            "enabled": true,
            "min_number_of_allocations": 0,
            "max_number_of_allocations": 2
        }
    }
}

Wait for mytest-old to scale to zero ~10 seconds

GET _ml/trained_models/_stats

Create a new deployment via inference endpoint, mytest-old should still exist, but it will have an allocation which is not intended.

PUT _inference/rerank/mytest-new3
{
    "service": "elasticsearch",
    "service_settings": {
        "num_threads": 1,
        "model_id": ".rerank-v1",
        "adaptive_allocations": {
            "enabled": true,
            "min_number_of_allocations": 0,
            "max_number_of_allocations": 2
        }
    }
}

GET _ml/trained_models/_stats

elasticsearchmachine · 2025-10-27T18:36:40Z

Hi @jonathan-buttner, I've created a changelog YAML for you.

Partial fix for deployment disappearing

0f14434

jonathan-buttner added >bug :ml Machine learning Team:ML Meta label for the ML team v9.3.0 labels Oct 27, 2025

jonathan-buttner mentioned this pull request Oct 27, 2025

[ML] Old trained model deployment got deleted unexpectedly after a new one is added through inference API #137134

Closed

Update docs/changelog/137216.yaml

e1b519c

[CI] Auto commit changes from spotless

6f4c50b

jonathan-buttner closed this Nov 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Partial fix for deployment disappearing #137216

[ML] Partial fix for deployment disappearing #137216

Uh oh!

jonathan-buttner commented Oct 27, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ML] Partial fix for deployment disappearing #137216

[ML] Partial fix for deployment disappearing #137216

Uh oh!

Conversation

jonathan-buttner commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jonathan-buttner commented Oct 27, 2025 •

edited

Loading