Skip to content

[None][fix] Replace assertions with warnings for unsupported logits/logprobs in speculative sampler#12547

Open
yifjiang wants to merge 1 commit intoNVIDIA:mainfrom
yifjiang:fix/spec-sampler-no-crash-v2
Open

[None][fix] Replace assertions with warnings for unsupported logits/logprobs in speculative sampler#12547
yifjiang wants to merge 1 commit intoNVIDIA:mainfrom
yifjiang:fix/spec-sampler-no-crash-v2

Conversation

@yifjiang
Copy link

Summary

Test plan

  • Send a request with logprobs=True to a model serving with speculative decoding (e.g. MTP) — verify server logs a warning instead of crashing
  • Verify the response completes successfully (without logprobs)
  • Existing speculative decoding tests pass

🤖 Generated with Claude Code

…ogprobs in speculative sampler

When return_context_logits, return_generation_logits, or
return_log_probs is requested with speculative decoding, the server
crashes with an AssertionError. Replace these assertions with warnings
so the server stays alive and the request completes without the
unsupported fields.

Signed-off-by: yifjiang <19356972+yifjiang@users.noreply.github.com>
@yifjiang yifjiang requested a review from a team as a code owner March 25, 2026 17:07
@yifjiang yifjiang requested a review from ziyixiong-nv March 25, 2026 17:07
@svc-trtllm-gh-bot svc-trtllm-gh-bot added the Community want to contribute PRs initiated from Community label Mar 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Community want to contribute PRs initiated from Community

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants