fix: don't set reasoning effort for non-reasoning models #97

rootfs · 2025-09-08T22:11:59Z

What type of PR is this?

If the model doesn't support reasoning, don't set the reasoning effort in API call.

What this PR does / why we need it:
This is a regression when reasoning mode support is added
Which issue(s) this PR fixes:

Fixes #

Release Notes: Yes/No

Signed-off-by: Huamin Chen <[email protected]>

github-actions · 2025-09-08T22:12:12Z

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 `config`

Owners: @rootfs
Files changed:

config/config.yaml

📁 `src`

Owners: @rootfs, @Xunzhuo, @wangchen615
Files changed:

src/semantic-router/pkg/config/config.go
src/semantic-router/pkg/extproc/reason_mode_config_test.go
src/semantic-router/pkg/extproc/reason_mode_selector.go
src/semantic-router/pkg/extproc/reason_mode_selector_test.go
src/semantic-router/pkg/extproc/reasoning_integration_test.go

📁 `website`

Owners: @Xunzhuo
Files changed:

website/docs/getting-started/configuration.md

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

netlify · 2025-09-08T22:12:12Z

✅ Deploy Preview for vllm-semantic-router ready!

Name	Link
🔨 Latest commit	`74af935`
🔍 Latest deploy log	https://app.netlify.com/projects/vllm-semantic-router/deploys/68c023440638380008a96dac
😎 Deploy Preview	https://deploy-preview-97--vllm-semantic-router.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

rootfs · 2025-09-08T22:13:31Z

cc @tao12345666333

Copilot

Pull Request Overview

This PR fixes a regression introduced when reasoning mode support was added, specifically ensuring that reasoning effort is not set for models that don't support reasoning features.

Refactored the reasoning mode implementation from hardcoded model family detection to a configuration-driven approach
Added model reasoning configurations to define which models support reasoning and their specific syntax requirements
Updated test coverage to verify that unknown/unsupported models don't receive reasoning fields

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`config/config.yaml`	Adds model reasoning configurations defining supported models and their reasoning syntax
`src/semantic-router/pkg/config/config.go`	Implements config types and pattern matching logic for model reasoning configurations
`src/semantic-router/pkg/extproc/reason_mode_selector.go`	Refactors reasoning logic to use config-driven approach instead of hardcoded model detection
`src/semantic-router/pkg/extproc/reason_mode_selector_test.go`	Updates tests to use new config-driven approach and adds comprehensive test coverage
`src/semantic-router/pkg/extproc/reasoning_integration_test.go`	Updates integration tests to verify proper behavior for unknown models and config-driven reasoning

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/semantic-router/pkg/extproc/reason_mode_selector.go

src/semantic-router/pkg/extproc/reasoning_integration_test.go

src/semantic-router/pkg/extproc/reason_mode_selector.go

Signed-off-by: Huamin Chen <[email protected]>

src/semantic-router/pkg/config/config.go

src/semantic-router/pkg/extproc/reason_mode_selector.go

config/config.yaml

Xunzhuo · 2025-09-09T02:44:11Z

let us stop the cycles round reasoning string match、pattern etc works for a better choice. I think we should have better approach to take care of the auto reasoning logics, the modelname can be very dynamic since that is set in vllm startup options, like i can set dsv31 as the modelname for deepseek-v3.1.

we should have the fixed groups/family names for models like we have supported:

deepseek
qwen
gpt-oss
theses three vars should be fixed and the model name just is an alias around it. so maybe we should have an extra params for modelconfig like reasoning-familiy:

deepseek:

model_config:
  ds-v31-balabala:
    reasoning-family: deepseek
......
    preferred_endpoints:
    - endpoint1

gpt-oss:

model_config:
  gptoss-balabala:
    reasoning-family: gpt-oss
......
    preferred_endpoints:
    - endpoint1

qwen

model_config:
  qwen3-balabala:
    reasoning-family: qwen3
......
    preferred_endpoints:
    - endpoint1

the modelname can be very flexible, like for deepseek family, we can set (ds-v3,self-hosted/ds,official/deepseek, anything we want, since that is used in modelname in request body) we just care about what the model reasoning family is. if it is empty, we just think it is not supported hybrid reasoning.

Signed-off-by: Huamin Chen <[email protected]>

rootfs · 2025-09-09T12:51:36Z

@Xunzhuo model family is a good idea. It is added to the new config now.

rootfs · 2025-09-09T13:38:11Z

review comments addressed, merging it so I can add other PRs using this new config.

fix: don't set reasoning effort for non-reasoning models

215340f

Signed-off-by: Huamin Chen <[email protected]>

rootfs requested review from Xunzhuo and wangchen615 as code owners September 8, 2025 22:12

github-actions bot assigned rootfs, wangchen615 and Xunzhuo Sep 8, 2025

rootfs requested a review from Copilot September 8, 2025 22:13

Copilot AI reviewed Sep 8, 2025

View reviewed changes

src/semantic-router/pkg/extproc/reason_mode_selector.go Show resolved Hide resolved

src/semantic-router/pkg/extproc/reasoning_integration_test.go Show resolved Hide resolved

src/semantic-router/pkg/extproc/reason_mode_selector.go Show resolved Hide resolved

rootfs added 2 commits September 8, 2025 18:19

review feedback

1799f90

Signed-off-by: Huamin Chen <[email protected]>

update configuration doc

a912b95

Signed-off-by: Huamin Chen <[email protected]>

tao12345666333 reviewed Sep 9, 2025

View reviewed changes

src/semantic-router/pkg/config/config.go Outdated Show resolved Hide resolved

src/semantic-router/pkg/config/config.go Outdated Show resolved Hide resolved

src/semantic-router/pkg/extproc/reason_mode_selector.go Outdated Show resolved Hide resolved

liangyuanpeng reviewed Sep 9, 2025

View reviewed changes

config/config.yaml Outdated Show resolved Hide resolved

rootfs requested a review from liangyuanpeng September 9, 2025 12:06

review feedback

31d5483

Signed-off-by: Huamin Chen <[email protected]>

Merge branch 'main' into fix-reasoning-for-non-reasoning-model

74af935

rootfs merged commit 446dfe6 into vllm-project:main Sep 9, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: don't set reasoning effort for non-reasoning models #97

fix: don't set reasoning effort for non-reasoning models #97

Uh oh!

rootfs commented Sep 8, 2025

Uh oh!

github-actions bot commented Sep 8, 2025 •

edited

Loading

Uh oh!

netlify bot commented Sep 8, 2025 •

edited

Loading

Uh oh!

rootfs commented Sep 8, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Xunzhuo commented Sep 9, 2025 •

edited

Loading

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix: don't set reasoning effort for non-reasoning models #97

fix: don't set reasoning effort for non-reasoning models #97

Uh oh!

Conversation

rootfs commented Sep 8, 2025

Uh oh!

github-actions bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

👥 vLLM Semantic Team Notification

📁 config

📁 src

📁 website

🎉 Thanks for your contributions!

Uh oh!

netlify bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for vllm-semantic-router ready!

Uh oh!

rootfs commented Sep 8, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Xunzhuo commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

rootfs commented Sep 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Sep 8, 2025 •

edited

Loading

📁 `config`

📁 `src`

📁 `website`

netlify bot commented Sep 8, 2025 •

edited

Loading

Xunzhuo commented Sep 9, 2025 •

edited

Loading