Will inference-time techniques help with reasoning models? #231

itsmeknt · 2025-08-26T19:45:07Z

itsmeknt
Aug 26, 2025

I've been using GPT-OSS 20B and 120B a lot recently, and am wondering if Optillm will help with models already trained for reasoning (R1, Qwen 3-thinking, GPT-OSS, O3, etc).

These models are already trained with their own internal chain of thought and verification, so I think these inference-time search strategies won't help too much when it comes to filtering and pruning search paths. But maybe it would help a little because it allows the request to think for even longer and use more tokens?

What are your thoughts? And are there any research or benchmarks to validate?

Answered by codelion

Aug 26, 2025

Yes, inference-time techniques are still useful for reasoning/thinking models. We have a number of them in the repo that demonstrate that - for instance see autothink and deepthink. Even in reasoning models both sequential inference-time scaling by generating more tokens and parallel inference-time scaling by combining multiple parallel responses helps improve the accuracy. This is similar to how Grok-Heavy or Gemini-DeepThink work. In addition, there is work to make the reasoning more efficient as we did in AutoThink - https://huggingface.co/blog/codelion/autothink

View full answer

codelion · 2025-08-26T23:06:33Z

codelion
Aug 26, 2025
Maintainer

Yes, inference-time techniques are still useful for reasoning/thinking models. We have a number of them in the repo that demonstrate that - for instance see autothink and deepthink. Even in reasoning models both sequential inference-time scaling by generating more tokens and parallel inference-time scaling by combining multiple parallel responses helps improve the accuracy. This is similar to how Grok-Heavy or Gemini-DeepThink work. In addition, there is work to make the reasoning more efficient as we did in AutoThink - https://huggingface.co/blog/codelion/autothink

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Will inference-time techniques help with reasoning models? #231

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Will inference-time techniques help with reasoning models? #231

Uh oh!

Uh oh!

itsmeknt Aug 26, 2025

Replies: 1 comment

Uh oh!

codelion Aug 26, 2025 Maintainer

itsmeknt
Aug 26, 2025

codelion
Aug 26, 2025
Maintainer