`server`: update deepseek reasoning format (pass reasoning_content as diffs) #13933

ochafik · 2025-05-30T23:49:19Z

Fixes #13867

(~~updated / simplified webui accordingly~~ edit: reverted UI changes, will let @ngxson update it as follow up)

…ffs), add legacy option for compat

ngxson

I think I will redo the frontend changes in a more structured way, can you maybe revert these changes?

For now, I think it's worth adding a per-request reasoning format control

ngxson · 2025-05-31T08:18:59Z

tools/server/webui/src/components/ChatMessage.tsx

-  // for reasoning model, we split the message into content and thought
-  // TODO: implement this as remark/rehype plugin in the future
-  const { content, thought, isThinking }: SplitMessage = useMemo(() => {
-    if (msg.content === null || msg.role !== 'assistant') {
-      return { content: msg.content };
-    }
-    let actualContent = '';
-    let thought = '';
-    let isThinking = false;
-    let thinkSplit = msg.content.split('<think>', 2);
-    actualContent += thinkSplit[0];


If user doesn't explicitly enable deepseek reasoning format (which is the default behavior), this code is there to make sure the thinking is always parsed.

Unless we can control thinking format per-request, I don't think we can remove this code block

reverted UI changes

ngxson · 2025-05-31T08:22:10Z

tools/server/webui/src/utils/types.ts


 export type PendingMessage = Omit<Message, 'content'> & {
  content: string | null;
+  reasoningContent: string | null;


Tbh I'm not very confident adding this, because it is not future-proof.

If you look at how chatgpt structure their message, the content is an array instead of a string, and that is what I wanted to do in near future. It will allow different message parts, like for example a message can contains both reasoning, text response and a tool call

This reverts commit abdb499.

This reverts commit 98acde2.

ngxson · 2025-06-01T21:19:26Z

common/arg.cpp

        "(default: deepseek)",
        [](common_params & params, const std::string & value) {
            /**/ if (value == "deepseek") { params.reasoning_format = COMMON_REASONING_FORMAT_DEEPSEEK; }
+            else if (value == "deepseek-legacy") { params.reasoning_format = COMMON_REASONING_FORMAT_DEEPSEEK_LEGACY; }


should we add a help message to explain this mode?

… diffs) (ggml-org#13933) * server: update deepseek reasoning format (now in reasoning_content diffs), add legacy option for compat * update unit/test_tool_call.py::test_thoughts

ochafik added 2 commits May 30, 2025 16:46

server: update deepseek reasoning format (now in reasoning_content di…

961635c

…ffs), add legacy option for compat

server: update webui to new reasoning_content diff

98acde2

github-actions bot added testing Everything test related examples server labels May 30, 2025

update webui build

abdb499

ochafik marked this pull request as ready for review May 31, 2025 00:02

ochafik requested a review from ngxson as a code owner May 31, 2025 00:02

ochafik mentioned this pull request May 31, 2025

Misc. bug: Reasoning content is not separated when streaming #13867

Closed

ochafik added 2 commits May 30, 2025 18:53

update make_any_request in server test utils

8faafcd

update unit/test_tool_call.py::test_thoughts

de81d4c

github-actions bot added the python python script changes label May 31, 2025

CISC approved these changes May 31, 2025

View reviewed changes

ngxson requested changes May 31, 2025

View reviewed changes

ochafik added 2 commits June 1, 2025 11:28

Revert "update webui build"

e9a0e2d

This reverts commit abdb499.

Revert "server: update webui to new reasoning_content diff"

3ea2e7d

This reverts commit 98acde2.

ngxson reviewed Jun 1, 2025

View reviewed changes

ngxson approved these changes Jun 1, 2025

View reviewed changes

ochafik merged commit c9bbc77 into ggml-org:master Jun 2, 2025
48 checks passed

This was referenced Jun 3, 2025

Misc. bug: llama-server didn't display thought process since b5576 #13981

Closed

Misc. bug: llama-server webui with --jinja flag does not show thinking when using reasoning models #14007

Closed

ggerganov mentioned this pull request Jun 12, 2025

Eval bug: Qwen models lost ability to think #14147

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`server`: update deepseek reasoning format (pass reasoning_content as diffs) #13933

`server`: update deepseek reasoning format (pass reasoning_content as diffs) #13933

Uh oh!

ochafik commented May 30, 2025 •

edited

Loading

Uh oh!

ngxson left a comment •

edited

Loading

Uh oh!

ngxson May 31, 2025

Uh oh!

ochafik Jun 1, 2025

Uh oh!

ngxson May 31, 2025

Uh oh!

ngxson Jun 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

server: update deepseek reasoning format (pass reasoning_content as diffs) #13933

server: update deepseek reasoning format (pass reasoning_content as diffs) #13933

Uh oh!

Conversation

ochafik commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngxson left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ngxson May 31, 2025

Choose a reason for hiding this comment

Uh oh!

ochafik Jun 1, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson May 31, 2025

Choose a reason for hiding this comment

Uh oh!

ngxson Jun 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

`server`: update deepseek reasoning format (pass reasoning_content as diffs) #13933

`server`: update deepseek reasoning format (pass reasoning_content as diffs) #13933

ochafik commented May 30, 2025 •

edited

Loading

ngxson left a comment •

edited

Loading