✨ vLLM Backend integration #42

parfeniukink · 2024-08-30T10:24:26Z

Summary

This PR extends the PR: Deepsparse Backend implementation.
The base branch is parfeniukink/features/deepsparse-backend.

vllm is added to optional dependencies
The VllmBackend class encapsulates the vLLM integration.
The guidellm/backend/vllm is available only if the Python version and the runtime platform pass the validation.
vllm tests are skipped in case the platform is not Linux

Usage

This is an example of a command you can use in your terminal:

--data=openai_humaneval: determines the dataset
--model=/local/path/my_model: determines the local path to the model object. If not specified - the env variable will be used.

python -m src.guidellm.main --data=openai_humaneval --max-requests=1 --max-seconds=20 --rate-type=constant --rate=1.0 --backend=vllm --model=/local-path

Environment configuration

The model could also be set with GUIDELLM__LLM_MODEL. If the CLI value or environment variable is not set, then the default will be used. Currently, the default model is: mistralai/Mistral-7B-Instruct-v0.3.

* backend/test_openai_backend.py -> backend/test_openai.py * backend/test_deepsparse_backend.py -> backend/test_deepsparse.py

not tested

Dmytro Parfeniuk added 2 commits August 30, 2024 13:22

🚚 Better naming is provided

5e93c1f

* backend/test_openai_backend.py -> backend/test_openai.py * backend/test_deepsparse_backend.py -> backend/test_deepsparse.py

✨ vllm backend integration is added

cea679e

not tested

parfeniukink self-assigned this Aug 30, 2024

Dmytro Parfeniuk added 3 commits August 30, 2024 13:52

✅ vllm tests are skipped if platform is not Linux

440d4be

📌 vLLM python version is increased to 3.12

1a715a6

📌 vllm dependency is skipped if platform IS NOT Linux

14f4c42

parfeniukink requested a review from markurtz August 30, 2024 11:10

Dmytro Parfeniuk added 14 commits September 2, 2024 08:58

🚧 WIP

6e30870

🚧 WIP

b0c0acb

dummy.data.openai is removed

9f431ea

WIP Docker tests

832b316

removed tmp file

3d8c80f

Dockerfile remove COPY

7836d45

🚧 WIP

defe53d

dockerfile is improved

9071358

tests are comlete

59e4cc6

docker testing guide is added

bd76806

💚 Code quality is provided

92c8819

Dockefile improved. Removed unused parts

e25be2f

✅ tests are fixed

809694c

🐳 --platform is removed from Dockerfile

78b78ed

parfeniukink marked this pull request as ready for review September 10, 2024 07:44

markurtz closed this Apr 22, 2025

sjmonson deleted the parfeniukink/features/vllm-backend branch September 29, 2025 20:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨ vLLM Backend integration #42

✨ vLLM Backend integration #42

Uh oh!

parfeniukink commented Aug 30, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

✨ vLLM Backend integration #42

✨ vLLM Backend integration #42

Uh oh!

Conversation

parfeniukink commented Aug 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Usage

Environment configuration

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

parfeniukink commented Aug 30, 2024 •

edited

Loading