[ML] EIS Unified chat completions integration #118301

jonathan-buttner · 2024-12-09T21:32:32Z

WIP

This PR implements the changes to add chat completion support to EIS within the inference plugin.

The current state of this PR is hard coded to work against OpenAI.

Testing

Run elasticsearch with a few feature flags enabled:

run-es -Des.inference_unified_feature_flag_enabled=true -Des.elastic_inference_service_feature_flag_enabled=true

Create an endpoint

PUT http://localhost:9200/_inference/completion/test
{
    "service": "elastic",
    "service_settings": {
        "api_key": "key",
        "model_id": "gpt-4o"
    }
}

Send request

POST http://localhost:9200/_inference/completion/test/_unified
{
    "model": "gpt-4o",
    "messages": [
        {
            "role": "user",
            "content": "What is the weather like in Boston today?"
        }
    ],
    "stop": ["none"],
    "tools": [
        {
        "type": "function",
        "function": {
            "name": "get_current_weather",
            "description": "Get the current weather in a given location",
            "parameters": {
            "type": "object",
            "properties": {
                "location": {
                "type": "string",
                "description": "The city and state, e.g. San Francisco, CA"
                },
                "unit": {
                "type": "string",
                "enum": ["celsius", "fahrenheit"]
                }
            },
            "required": ["location"]
            }
        }
        }
    ],
    "tool_choice": "auto"
}

… entity

elasticsearchmachine · 2024-12-09T21:32:57Z

Hi @jonathan-buttner, I've created a changelog YAML for you.

…ntegration

jonathan-buttner · 2025-01-07T21:54:46Z

Closing in favor of: #118871

jonathan-buttner and others added 4 commits December 6, 2024 16:02

Starting completion model

ccec39b

Adding model

467747f

initial implementation of request and response handling, manager, and…

69ba46d

… entity

Working response from openai

39e2c27

jonathan-buttner added >enhancement :ml Machine learning Team:ML Meta label for the ML team v9.0.0 v8.18.0 labels Dec 9, 2024

Update docs/changelog/118301.yaml

7984b69

jonathan-buttner added 4 commits December 10, 2024 08:34

Fixing comment

be588f4

Adding some initial tests

38a58f9

Merge branch 'main' of github.com:elastic/elasticsearch into ml-eis-i…

cad6f1e

…ntegration

Moving tests around

2e4fb05

jaybcee mentioned this pull request Dec 17, 2024

[Elastic Inference Service] Add ElasticInferenceService Unified ChatCompletions Integration #118871

Merged

jonathan-buttner closed this Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] EIS Unified chat completions integration #118301

[ML] EIS Unified chat completions integration #118301

Uh oh!

jonathan-buttner commented Dec 9, 2024

Uh oh!

elasticsearchmachine commented Dec 9, 2024

Uh oh!

jonathan-buttner commented Jan 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ML] EIS Unified chat completions integration #118301

[ML] EIS Unified chat completions integration #118301

Uh oh!

Conversation

jonathan-buttner commented Dec 9, 2024

Testing

Uh oh!

elasticsearchmachine commented Dec 9, 2024

Uh oh!

jonathan-buttner commented Jan 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants