Skip to content

Commit d585a14

Browse files
committed
prefill cannot be used with thinking models
1 parent bd859ed commit d585a14

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tools/server/utils.hpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -766,7 +766,7 @@ static json oaicompat_chat_params_parse(
766766
/* TODO: test this properly */
767767
inputs.reasoning_format = COMMON_REASONING_FORMAT_NONE;
768768

769-
if (inputs.chat_template_kwargs.find("enable_thinking") != inputs.chat_template_kwargs.end()) {
769+
if (inputs.enable_thinking || inputs.chat_template_kwargs.find("enable_thinking") != inputs.chat_template_kwargs.end()) {
770770
throw std::runtime_error("Assistant response prefill is incompatible with enable_thinking.");
771771
}
772772

0 commit comments

Comments
 (0)