How to finetune Qwen3 in non-thinking mode? #2835

chingf · 2025-06-26T19:23:13Z

chingf
Jun 26, 2025

I'd like to finetune a Qwen3 model in non-thinking mode. Is there a way to do this?
In another setting I'd usually do this by passing thinking=False as an argument into tokenizer.apply_chat_template. It seems like this was briefly supported after this pull request: #2694, but the change made by that pull request to src/axolotl/utils/schemas/config.py has since been deleted. Is there another suggested way to make sure the model is in non-thinking mode?

ehartford · 2025-06-26T22:21:51Z

ehartford
Jun 26, 2025
Collaborator

you need to generate fake but plausible thinking blocks and mask them out so you don't override the models thinking skills but you still need the responses to respect the thinking blocks

0 replies

NanoCode012 · 2025-06-27T03:33:16Z

NanoCode012
Jun 27, 2025
Maintainer

@chingf , re: the linked PR, it was accidentally removed in a refactor and I've made a PR to add it back here #2837

To answer the non-thinking mode, I haven't looked too much into it, so maybe Eric may know more about it.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How to finetune Qwen3 in non-thinking mode? #2835

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Uh oh!

How to finetune Qwen3 in non-thinking mode? #2835

Uh oh!

chingf Jun 26, 2025

Replies: 2 comments

Uh oh!

ehartford Jun 26, 2025 Collaborator

Uh oh!

Uh oh!

NanoCode012 Jun 27, 2025 Maintainer

chingf
Jun 26, 2025

ehartford
Jun 26, 2025
Collaborator

NanoCode012
Jun 27, 2025
Maintainer