fix: disable EAGLE3 speculative decoding for gpt-oss-120b
Streaming responses were consistently dropping the last 1-2 tokens due to a vLLM v0.12.0 EAGLE3 bug. Non-streaming was unaffected.
fix: disable EAGLE3 speculative decoding for gpt-oss-120b
Streaming responses were consistently dropping the last 1-2 tokens due to a vLLM v0.12.0 EAGLE3 bug. Non-streaming was unaffected.