[Inference API ] Deprecate task_settings in favour of parameters #114176

maxhniebergall · 2024-10-04T21:16:58Z

Summary

In this change we deprecate task_settings and alias it to "parameters". Moreover, when creating inference endpoints, users can pass an object named "parameters" in place of "task_settings", where the object can have all of the same fields it had before. If the user passes "task_settings" to the create endpoint API, the operation will succeed, but there will be a warning asking them to migrate to using "parameters" instead. In the response to the create endpoint API, both "parameters" and "task_settings" objects will be included, and they will have all of the same fields, to allow users to update their programs to expect "parameters" instead of "task_settings" without breaking backwards compatibility. When users perform a GET _inference/all, the response will include both "parameters" and "task_settings" until at least 9.0.

TODO

We will need to update the clients
We will need to update the docs
- The docs should specify the above information, as well as providing examples like below.
Filter out empty task settings/parameters

Examples of create endpoint API:

Put ELSER

Request:

{
    "service": "elasticsearch",
    "service_settings": {
        "model_id": ".elser_model_2",
        "num_allocations": 1,
        "num_threads": 1
    }
}

Response:

{
    "inference_id": "elser_endpoint2",
    "task_type": "sparse_embedding",
    "service": "elasticsearch",
    "service_settings": {
        "num_allocations": 1,
        "num_threads": 1,
        "model_id": ".elser_model_2"
    },
    "task_settings": {},
    "parameters": {}
}

Put cohere

Request:

{
    "service": "cohere",
    "service_settings": {
        "model_id": "embed-english-v3.0",
        "api_key": <REDACTED>
    },
    "task_settings": {
        "input_type": "ingest"
    }
}

Response:

{
    "inference_id": "testss",
    "task_type": "text_embedding",
    "service": "cohere",
    "service_settings": {
        "similarity": "dot_product",
        "dimensions": 1024,
        "model_id": "embed-english-v3.0",
        "rate_limit": {
            "requests_per_minute": 10000
        },
        "embedding_type": "float"
    },
    "task_settings": {
        "input_type": "ingest"
    },
    "parameters": {
        "input_type": "ingest"
    }
}

Request:

{
    "service": "cohere",
    "service_settings": {
        "model_id": "embed-english-v3.0",
        "api_key": "gNMQtKcON8qrF3CjuZ270SJq7TCVyG6il08jZ4nV"
    },
    "parameters": {
        "input_type": "ingest"
    }
}

Response:

{
    "inference_id": "tests2",
    "task_type": "text_embedding",
    "service": "cohere",
    "service_settings": {
        "similarity": "dot_product",
        "dimensions": 1024,
        "model_id": "embed-english-v3.0",
        "rate_limit": {
            "requests_per_minute": 10000
        },
        "embedding_type": "float"
    },
    "task_settings": {
        "input_type": "ingest"
    },
    "parameters": {
        "input_type": "ingest"
    }
}

Get _inference/all

Response:

{
    "endpoints": [
        {
            "inference_id": ".elser-2",
            "task_type": "sparse_embedding",
            "service": "elasticsearch",
            "service_settings": {
                "num_threads": 1,
                "model_id": ".elser_model_2",
                "adaptive_allocations": {
                    "enabled": true,
                    "min_number_of_allocations": 1,
                    "max_number_of_allocations": 8
                }
            },
            "task_settings": {},
            "parameters": {}
        },
        {
            "inference_id": "elser_endpoint1",
            "task_type": "sparse_embedding",
            "service": "elasticsearch",
            "service_settings": {
                "num_allocations": 1,
                "num_threads": 1,
                "model_id": ".elser_model_2"
            },
            "task_settings": {},
            "parameters": {}
        },
        {
            "inference_id": "testss",
            "task_type": "text_embedding",
            "service": "cohere",
            "service_settings": {
                "similarity": "dot_product",
                "dimensions": 1024,
                "model_id": "embed-english-v3.0",
                "rate_limit": {
                    "requests_per_minute": 10000
                },
                "embedding_type": "float"
            },
            "task_settings": {
                "input_type": "ingest"
            },
            "parameters": {
                "input_type": "ingest"
            }
        }
    ]
}

elasticsearchmachine · 2024-10-04T21:17:23Z

Hi @maxhniebergall, I've created a changelog YAML for you. Note that since this PR is labelled >deprecation, you need to update the changelog YAML to fill out the extended information sections.

server/src/main/java/org/elasticsearch/inference/ModelConfigurations.java

...ore/src/main/java/org/elasticsearch/xpack/core/inference/action/PutInferenceModelAction.java

x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferenceIndex.java

...plugin/inference/src/main/java/org/elasticsearch/xpack/inference/registry/ModelRegistry.java

server/src/main/java/org/elasticsearch/inference/InferenceService.java

server/src/main/java/org/elasticsearch/inference/ModelConfigurations.java

...ore/src/main/java/org/elasticsearch/xpack/core/inference/action/PutInferenceModelAction.java

…eTaskSettings # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java

elasticsearchmachine · 2024-10-07T21:29:17Z

Pinging @elastic/ml-core (Team:ML)

Max Hniebergall added 4 commits October 4, 2024 17:11

Add parameters to ModeConfigurations; rename task_settings constant

50d8912

Add endpointVersion exceptin in index mappings

714535e

Add mappings for endpoint_version

7f11346

start to handle mixed cluster failures for new mapping

6432240

maxhniebergall added >deprecation :ml Machine learning v8.16.0 v9.0.0 labels Oct 4, 2024

maxhniebergall requested review from dan-rubinstein and davidkyle October 4, 2024 21:16

Update docs/changelog/114176.yaml

84c75f1

davidkyle reviewed Oct 7, 2024

View reviewed changes

Max Hniebergall added 3 commits October 7, 2024 11:53

Only write EndpointVersion to index

388c475

Refactor EndpointVersion into enum instead of string

104ab7c

undo renaming task_settings field name

8c9c4f4

dan-rubinstein reviewed Oct 7, 2024

View reviewed changes

Max Hniebergall added 4 commits October 7, 2024 15:13

Remove endpoint_version

6de692b

precommit

fc72d8b

Add deprecation warning and integration tests

7bcec6c

Merge branch 'main' of github.com:elastic/elasticsearch into deprecat…

4f4be9e

…eTaskSettings # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java

maxhniebergall marked this pull request as ready for review October 7, 2024 21:28

elasticsearchmachine added the Team:ML Meta label for the ML team label Oct 7, 2024

maxhniebergall and others added 5 commits October 7, 2024 17:37

Update 114176.yaml

e139224

spotless

7d6ab96

Ignore depreaction warnings on put in inference tests

081a1a8

Merge branch 'main' into deprecateTaskSettings

916d746

spotless

d3d211a

maxhniebergall changed the title ~~[Inference API ] Add endpoint_version to deprecate task settings~~ [Inference API ] Deprecate task_settings in favour of parameters Oct 8, 2024

maxhniebergall closed this Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Inference API ] Deprecate task_settings in favour of parameters #114176

[Inference API ] Deprecate task_settings in favour of parameters #114176

Uh oh!

maxhniebergall commented Oct 4, 2024 •

edited

Loading

Uh oh!

elasticsearchmachine commented Oct 4, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Inference API ] Deprecate task_settings in favour of parameters #114176

[Inference API ] Deprecate task_settings in favour of parameters #114176

Uh oh!

Conversation

maxhniebergall commented Oct 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

TODO

Examples of create endpoint API:

Put ELSER

Put cohere

Get _inference/all

Uh oh!

elasticsearchmachine commented Oct 4, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

maxhniebergall commented Oct 4, 2024 •

edited

Loading