feat(llm): Add vLLM Support #683

par4m · 2025-07-02T07:43:09Z

Implements #659

Since vLLM doesn't support MacOS, models can only be run on CPU.

Choose a model with less than 1.5B Parameters and at least 2400 token support, for testing DeepSeek Coder 1.3B is used on MacOS M4

Install vLLM: (python < 3.13)

pip install vllm

Run vLLM Server

vllm serve deepseek-ai/deepseek-coder-1.3b-instruct

cocoindex.LlmSpec(
    api_type=cocoindex.LlmApiType.VLLM,
    model="deepseek-ai/deepseek-coder-1.3b-instruct",
    address="http://127.0.0.1:8000/v1",                    # /v1 is mandatory 
)

Also vLLM Supports Embedding the same way as OpenAI, a new PR for it would be better - https://docs.vllm.ai/en/v0.6.6/serving/openai_compatible_server.html#embeddings-api

More info - https://docs.vllm.ai/en/v0.6.6/serving/openai_compatible_server.html#

docs/docs/ai/llm.mdx

par4m · 2025-07-03T03:31:56Z

Not sure why formatting failed can you please rerun the checks

badmonster0 · 2025-07-03T07:16:12Z

Not sure why formatting failed can you please rerun the checks

Should be caused by a format issue of some example code. fixed in #687.

The check still fails as it's not merged with the latest main yet, but it's safe to merge. I'll merge now.

badmonster0 · 2025-08-19T02:08:33Z

thank you @par4m ! new release note is out and we made a section for you, we love your contribution!!
https://cocoindex.io/blogs/cocoindex-changelog-2025-08-18#par4m ❤️

par4m added 3 commits July 2, 2025 13:06

add vllm

2410892

fix

45db82c

fix formatting

2ea1601

badmonster0 approved these changes Jul 3, 2025

View reviewed changes

docs/docs/ai/llm.mdx Show resolved Hide resolved

add to table

a266104

badmonster0 merged commit c1ce446 into cocoindex-io:main Jul 3, 2025
21 of 24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(llm): Add vLLM Support #683

feat(llm): Add vLLM Support #683

par4m commented Jul 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

par4m commented Jul 3, 2025

Uh oh!

badmonster0 commented Jul 3, 2025

Uh oh!

Uh oh!

badmonster0 commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(llm): Add vLLM Support #683

feat(llm): Add vLLM Support #683

Conversation

par4m commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

par4m commented Jul 3, 2025

Uh oh!

badmonster0 commented Jul 3, 2025

Uh oh!

Uh oh!

badmonster0 commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

par4m commented Jul 2, 2025 •

edited

Loading