[ML] Use internal user for internal inference action #128327

jonathan-buttner · 2025-05-22T16:56:08Z

When we added the proxy action it caused a bug because we moved the InferenceAction.Request to an internal action instead of monitor(here). There are a few places where we're using the InferenceAction.Request directly. In these situations we need to execute the client using the INFERENCE_ORIGIN to indicate that we're acting as the internal user. Otherwise we'd get an error when a read only user (with only monitor_inference privileges) attempts to make inference requests.

Reproducing the issue

The issue can be reproduced like the following:

PUT _inference/rerank/coherererank
{
    "service": "cohere",
    "service_settings": {
        "api_key": "api_key",
        "model_id": "rerank-v3.5"
    }
}

PUT jon/_doc/1?pretty
{
    "title": "The Terminator",
    "overview": "A cyborg is sent back in time to kill Sarah Connor."
}

PUT jon/_doc/2?pretty
{
    "title": "Terminator 2: Judgment Day",
    "overview": "A cyborg is sent back in time to protect John Connor."
}
PUT jon/_doc/3?pretty
{
    "title": "Terminator Genisys",
    "overview": "A cyborg is sent back in time to protect Sarah Connor."
}

GET jon/_search <-- This should result in a permissions error
{
    "_source": [
        "title"
    ],
    "retriever": {
        "text_similarity_reranker": {
            "retriever": {
                "standard": {
                    "query": {
                        "multi_match": {
                            "fields": [
                                "title",
                                "overview"
                            ],
                            "query": "terminator arnold"
                        }
                    }
                }
            },
            "field": "title",
            "inference_text": "terminator arnold",
            "inference_id": "coherererank"
        }
    }
}

POST /_query?format=txt <--- this should also fail
{
    "query": "FROM jon | KEEP title, overview | SORT title DESC | LIMIT 10 | RERANK \"terminator arnold\" ON title WITH coherererank"
}

Error

{
    "error": {
        "root_cause": [
            {
                "type": "status_exception",
                "reason": "[text_similarity_reranker] search failed - retrievers '[standard]' returned errors. All failures are attached as suppressed exceptions.",
                "suppressed": [
                    {
                        "type": "search_phase_execution_exception",
                        "reason": "Computing updated ranks for results failed",
                        "phase": "rank-feature",
                        "grouped": true,
                        "failed_shards": []
                    }
                ]
            }
        ],
        "type": "status_exception",
        "reason": "[text_similarity_reranker] search failed - retrievers '[standard]' returned errors. All failures are attached as suppressed exceptions.",
        "suppressed": [
            {
                "type": "search_phase_execution_exception",
                "reason": "Computing updated ranks for results failed",
                "phase": "rank-feature",
                "grouped": true,
                "failed_shards": [],
                "caused_by": {
                    "type": "security_exception",
                    "reason": "action [cluster:internal/xpack/inference] is unauthorized for user [test_read_user] with effective roles [test_read], this action is granted by the cluster privileges [manage,all]"
                }
            }
        ]
    },
    "status": 403
}

Testing that it works correctly

To ensure the fix works we can do the following

Create a role with only monitor_inference

POST _security/role/test_read?pretty
{
    "cluster": ["monitor_inference"],
    "indices": [
      {
        "names": [
          "jon*"
        ],
        "privileges": [
          "read"
        ],
        "allow_restricted_indices": false
      }
    ]
}

Create a user that uses the role

POST _security/user/test_read_user?pretty
{
    "password": "password",
    "roles": [
      "test_read"
    ]
}

Perform the request with the test_read_user

GET jon/_search
{
    "_source": [
        "title"
    ],
    "retriever": {
        "text_similarity_reranker": {
            "retriever": {
                "standard": {
                    "query": {
                        "multi_match": {
                            "fields": [
                                "title",
                                "overview"
                            ],
                            "query": "terminator arnold"
                        }
                    }
                }
            },
            "field": "title",
            "inference_text": "terminator arnold",
            "inference_id": "coherererank"
        }
    }
}

POST /_query?format=txt
{
    "query": "FROM jon | KEEP title, overview | SORT title DESC | LIMIT 10 | RERANK \"terminator arnold\" ON title WITH coherererank"
}

elasticsearchmachine · 2025-05-22T16:56:33Z

Hi @jonathan-buttner, I've created a changelog YAML for you.

elasticsearchmachine · 2025-05-22T18:16:56Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2025-05-22T19:00:45Z

💔 Backport failed

Status	Branch	Result
❌	8.19	Commit could not be cherrypicked due to conflicts
❌	9.0	Commit could not be cherrypicked due to conflicts
❌	8.18	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 128327

* Using correct origin for inference action * Update docs/changelog/128327.yaml * [CI] Auto commit changes from spotless --------- Co-authored-by: elasticsearchmachine <[email protected]> (cherry picked from commit 19e18a9) # Conflicts: # x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/inference/InferenceRunner.java

jonathan-buttner · 2025-05-22T19:15:07Z

💚 All backports created successfully

Status	Branch	Result
✅	9.0
✅	8.19
✅	8.18

Questions ?

Please refer to the Backport tool documentation

* Using correct origin for inference action * Update docs/changelog/128327.yaml * [CI] Auto commit changes from spotless --------- Co-authored-by: elasticsearchmachine <[email protected]> (cherry picked from commit 19e18a9) # Conflicts: # x-pack/plugin/esql/src/main/java/org/elasticsearch/xpack/esql/inference/InferenceRunner.java

Using correct origin for inference action

2323d00

jonathan-buttner added >bug :ml Machine learning Team:ML Meta label for the ML team auto-backport Automatically create backport pull requests when merged v8.19.0 v9.1.0 v9.0.3 v8.18.3 labels May 22, 2025

jonathan-buttner and others added 2 commits May 22, 2025 12:56

Update docs/changelog/128327.yaml

eff5669

[CI] Auto commit changes from spotless

eef31d2

jonathan-buttner linked an issue May 22, 2025 that may be closed by this pull request

Performing rerank inference requests with monitor_inference fails with authorization error #128328

Closed

jonathan-buttner marked this pull request as ready for review May 22, 2025 18:16

prwhelan approved these changes May 22, 2025

View reviewed changes

jonathan-buttner merged commit 19e18a9 into elastic:main May 22, 2025
18 checks passed

elasticsearchmachine added the backport pending label May 22, 2025

This was referenced May 22, 2025

[9.0] [ML] Use internal user for internal inference action (#128327) #128330

Merged

[8.19] [ML] Use internal user for internal inference action (#128327) #128332

Merged

jonathan-buttner mentioned this pull request May 22, 2025

[8.18] [ML] Use internal user for internal inference action (#128327) #128333

Merged

davidkyle mentioned this pull request May 23, 2025

[ML] Yaml test that runs inference as a non-admin user #128363

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Use internal user for internal inference action #128327

[ML] Use internal user for internal inference action #128327

Uh oh!

jonathan-buttner commented May 22, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented May 22, 2025

Uh oh!

elasticsearchmachine commented May 22, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented May 22, 2025

Uh oh!

jonathan-buttner commented May 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ML] Use internal user for internal inference action #128327

[ML] Use internal user for internal inference action #128327

Uh oh!

Conversation

jonathan-buttner commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reproducing the issue

Error

Testing that it works correctly

Create a role with only monitor_inference

Create a user that uses the role

Perform the request with the test_read_user

Uh oh!

elasticsearchmachine commented May 22, 2025

Uh oh!

elasticsearchmachine commented May 22, 2025

Uh oh!

Uh oh!

elasticsearchmachine commented May 22, 2025

💔 Backport failed

Uh oh!

jonathan-buttner commented May 22, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jonathan-buttner commented May 22, 2025 •

edited

Loading