vLLM custom connector setup guide #3858

benironside · 2025-11-07T22:22:02Z

Resolves #3474 by creating a tutorial for how to connect a custom LLM running in vLLM to Elastic.

Technical reviewers, I left a few questions for you in comments. Also:

Has this been tested with the Obs/Search Assistant, or are these instructions security-only.
Is this supported in v9.0+?
@dhru42 I could use some insight into how the use-case for this guide differs from the existing self-managed LLM guide

github-actions · 2025-11-07T22:23:57Z

🔍 Preview links for changed docs

solutions/security/ai/connect-to-vLLM.md

benironside · 2025-11-07T22:39:07Z

solutions/security/ai/connect-to-vLLM.md

+
+1. Configure your host server with the necessary GPU resources.
+2. Run the desired model in a vLLM container.
+3. Use a reverse proxy like Nginx to securely expose the endpoint to {{ecloud}}.


Is it just Elastic Cloud that this works with? Not other deployment types?

benironside · 2025-11-07T23:28:09Z

solutions/security/ai/connect-to-vLLM.md

+  1. When you want to invoke a tool, never describe the call in text.
+  2. Always return the invocation in the `tool_calls` field.
+  3. The `content` field must remain empty for any assistant message that performs a tool call.
+  4. Only use tool calls defined in the "tools" parameter.


Note to self: Following https://github.com/elastic/sdh-security-team/issues/1417 to confirm if this system prompt fix works

since 9.1.7 it seems it is not needed anymore, but we can keep it until we change the recommended model,
more important in this case is to make sure they add

feature_flags.overrides: securitySolution.inferenceChatModelDisabled: true

to config/kibana.yml otherwise Mistral is not going to work with Security Assistant (more details in linked SDH above)

dhru42 · 2025-11-10T14:10:39Z

@dhru42 I could use some insight into how the use-case for this guide differs from the existing self-managed LLM guide

can we make the existing page generic then link to two methods:

Connect to your own local LLM with LM Studio (exists already)
Connect to your own local LLM with vLLM (the google doc i shared)

benironside · 2025-11-12T18:50:47Z

@dhru42 I could use some insight into how the use-case for this guide differs from the existing self-managed LLM guide

can we make the existing page generic then link to two methods:
1. Connect to your own local LLM with LM Studio (exists already)

2. Connect to your own local LLM with vLLM ([the google doc i shared](https://docs.google.com/document/d/1pGKBECl6T4LdFhctAWURRZJrdEC8qKRVWz0bWnylN4s/edit?usp=sharing))

Yeah, that makes sense. @dhru42 I'm still curious to better understand the different use-cases that each option addresses. How should users pick which to set up?

After docs on-week (this week), I'll work on it and sync up with Patryk and/or Garrett to discuss details.

dhru42 · 2025-11-12T20:25:43Z

@benironside there's a formatting issue. could you ensure that all the steps are reflected as shown in the docs, otherwise it LGTM.

…tent into 3474-vLLM-guide

patrykkopycinski · 2025-11-20T15:25:56Z

solutions/security/ai/connect-to-vLLM.md

+2. Run the following terminal command to start the vLLM server, download the model, and expose it on port 8000:
+
+```bash
+docker run --name Mistral-Small-3.2-24B --gpus all \


is is something we will be able to update shortly? I mean we should avoid recommending Mistral-Small-3.2-24B as it has a lot of issues with Security Assistant tool calling

We can update this any time. For now, since this model isn't recommended, I replaced it with [YOUR_MODEL_ID]. Make sense to you?

I think it's going to be less confusing if we stay to the previous version and just update it with a new model, because the list of params depends on the model id

patrykkopycinski · 2025-11-20T15:29:02Z

solutions/security/ai/connect-to-vLLM.md

+  1. When you want to invoke a tool, never describe the call in text.
+  2. Always return the invocation in the `tool_calls` field.
+  3. The `content` field must remain empty for any assistant message that performs a tool call.
+  4. Only use tool calls defined in the "tools" parameter.


since 9.1.7 it seems it is not needed anymore, but we can keep it until we change the recommended model,
more important in this case is to make sure they add

feature_flags.overrides: securitySolution.inferenceChatModelDisabled: true

to config/kibana.yml otherwise Mistral is not going to work with Security Assistant (more details in linked SDH above)

solutions/security/ai/connect-to-vLLM.md

github-actions · 2025-11-24T18:47:52Z

Vale Linting Results

Summary: 3 suggestions found

💡 Suggestions (3)

File	Line	Rule	Message
solutions/security/ai/connect-to-own-local-llm.md	14	Elastic.Capitalization	'Connect to your own local LLM using LM Studio' should use sentence-style capitalization.
solutions/security/ai/connect-to-vLLM.md	21	Elastic.Capitalization	'Connect vLLM to' should use sentence-style capitalization.
solutions/security/ai/connect-to-vLLM.md	93	Elastic.WordChoice	Consider using 'can, might' instead of 'may', unless the term is in the UI.

bmorelli25

This is a good start, but right now the page feels like it's trying to be a guide and an example. If you pick a single type of content, it'll be more useful and easier to follow. I think you should structure this page as a guide, similar to these:

The example content, like the Server info is probably still useful, but could be folded up into the relevant step.

Thoughts?

solutions/security/ai/connect-to-vLLM.md

bmorelli25

This looks great! Just two more small comments for your consideration.

🚢 🚢

solutions/security/ai/connect-to-vLLM.md

Co-authored-by: Brandon Morelli <[email protected]>

benironside added 2 commits November 7, 2025 16:04

Creates vLLM connection guide

82a7cf7

Update connect-to-vLLM.md

ada1c84

benironside self-assigned this Nov 7, 2025

benironside commented Nov 7, 2025

View reviewed changes

solutions/security/ai/connect-to-vLLM.md Outdated Show resolved Hide resolved

benironside commented Nov 7, 2025

View reviewed changes

benironside added 3 commits November 7, 2025 16:46

adds collapsible explanation section

e526500

Update connect-to-vLLM.md

4ce881b

Adds final setup steps

7fbdd2c

benironside commented Nov 7, 2025

View reviewed changes

benironside requested review from dhru42 and patrykkopycinski November 7, 2025 23:29

benironside and others added 4 commits November 13, 2025 10:14

fixes formatting

bd93ac1

Merge branch 'main' into 3474-vLLM-guide

ab58841

removes collapsible block

02cd1e8

Merge branch '3474-vLLM-guide' of https://github.com/elastic/docs-con…

f5d9acd

…tent into 3474-vLLM-guide

benironside requested a review from spong November 14, 2025 23:08

benironside marked this pull request as ready for review November 19, 2025 21:20

benironside requested review from a team as code owners November 19, 2025 21:20

patrykkopycinski reviewed Nov 20, 2025

View reviewed changes

Incorporates Patryk's review

9aed97e

benironside added 3 commits November 24, 2025 11:52

Update connect-to-vLLM.md

a2a08a2

specifies use-case for each custom llm guide

5024b23

minor edit

49eb86d

bmorelli25 requested changes Nov 24, 2025

View reviewed changes

benironside added 11 commits November 25, 2025 11:03

First pass incorporating Brandon's review

6742fd3

Additional edits inspired by Brandon's review

3e5cd12

implements stepper

58f1a0b

Adds prereqs and fixes stepper

5bd6224

additional fixes

480535d

adds next steps

2ddbf8c

fixes list in step 1

08e3a9e

experiments with indentation

b9eec65

typo

420f528

Update connect-to-vLLM.md

1edfc67

Update connect-to-vLLM.md

56d3b8a

bmorelli25 approved these changes Nov 25, 2025

View reviewed changes

solutions/security/ai/connect-to-vLLM.md Outdated Show resolved Hide resolved

solutions/security/ai/connect-to-vLLM.md Outdated Show resolved Hide resolved

benironside and others added 4 commits November 25, 2025 13:35

Update solutions/security/ai/connect-to-vLLM.md

a080779

Co-authored-by: Brandon Morelli <[email protected]>

Update solutions/security/ai/connect-to-vLLM.md

9c79a20

Co-authored-by: Brandon Morelli <[email protected]>

Merge branch 'main' into 3474-vLLM-guide

8377140

Merge branch 'main' into 3474-vLLM-guide

9fe1e5b

benironside enabled auto-merge (squash) November 25, 2025 21:39

benironside disabled auto-merge November 25, 2025 21:39

benironside enabled auto-merge (squash) November 25, 2025 21:57

Merge branch 'main' into 3474-vLLM-guide

abee578

benironside merged commit 59c7441 into main Nov 25, 2025
7 of 8 checks passed

benironside deleted the 3474-vLLM-guide branch November 25, 2025 21:59

vLLM custom connector setup guide #3858

vLLM custom connector setup guide #3858

Conversation

benironside commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

Uh oh!

benironside Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

benironside Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

patrykkopycinski Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

dhru42 commented Nov 10, 2025

Uh oh!

benironside commented Nov 12, 2025

Uh oh!

dhru42 commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

patrykkopycinski Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

benironside Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

patrykkopycinski Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

patrykkopycinski Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions bot commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Vale Linting Results

Uh oh!

bmorelli25 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bmorelli25 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

benironside commented Nov 7, 2025 •

edited

Loading

github-actions bot commented Nov 7, 2025 •

edited

Loading

dhru42 commented Nov 12, 2025 •

edited

Loading

github-actions bot commented Nov 24, 2025 •

edited

Loading