Resolve the issue of getting different results from transformers evaluation and vLLM evaluation #44

RadoslawPlawecki · 2025-11-28T18:55:58Z

Description

The different results were obtained depending on the evaluation method. To solve this issue, environment variables were added to interference_vllm.py.

os.environ["VLLM_USE_V1"] = "0"
os.environ["CUBLAS_WORKSPACE_CONFIG"]=":4096:8"
os.environ["VLLM_ENABLE_V1_MULTIPROCESSING"] = "0"

After introducing the changes, all the tests were passed successfully.

DzmitryPihulski

The main issue here is not using the chat template, but I will add it in future versions

codecov · 2025-12-01T07:48:57Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

added environment variables to interference_vllm.py

ebbc9b7

RadoslawPlawecki requested a review from DzmitryPihulski as a code owner November 28, 2025 18:55

RadoslawPlawecki added 2 commits November 28, 2025 20:32

delete coverage.xml

8bb7ea0

update evaluator.py

a51a18e

DzmitryPihulski assigned RadoslawPlawecki Nov 29, 2025

DzmitryPihulski approved these changes Dec 1, 2025

View reviewed changes

DzmitryPihulski merged commit 008de41 into LLMSQL:main Dec 1, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Resolve the issue of getting different results from transformers evaluation and vLLM evaluation #44

Resolve the issue of getting different results from transformers evaluation and vLLM evaluation #44

Uh oh!

RadoslawPlawecki commented Nov 28, 2025

Uh oh!

DzmitryPihulski left a comment

Uh oh!

Uh oh!

codecov bot commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Resolve the issue of getting different results from transformers evaluation and vLLM evaluation #44

Resolve the issue of getting different results from transformers evaluation and vLLM evaluation #44

Uh oh!

Conversation

RadoslawPlawecki commented Nov 28, 2025

Description

Uh oh!

DzmitryPihulski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Dec 1, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants