fix: support both <think> and <thinking> tags for LM Studio GPT-OSS models #6751

roomote · 2025-08-06T15:12:56Z

This PR fixes the issue where LM Studio GPT-OSS models were showing raw model responses instead of properly formatted thinking blocks.

Problem

LM Studio GPT-OSS models use <thinking> tags for their reasoning content, but the current implementation only looks for <think> tags. This causes the thinking blocks to not be parsed correctly, resulting in the raw response being displayed.

Solution

Created a new MultiTagXmlMatcher utility that can handle multiple XML tag names
Updated the LM Studio handler to use MultiTagXmlMatcher with both ["think", "thinking"] tags
Added comprehensive tests for the new functionality

Testing

Added unit tests for MultiTagXmlMatcher covering various scenarios
Added specific tests in the LM Studio handler test suite for both <think> and <thinking> tag parsing
All tests pass successfully

Fixes #6750

Important

Adds support for <thinking> tags in LM Studio GPT-OSS models by introducing MultiTagXmlMatcher and updating LmStudioHandler.

Behavior:
- Updates LmStudioHandler in lm-studio.ts to use MultiTagXmlMatcher for parsing both <think> and <thinking> tags.
- Ensures reasoning content is correctly extracted and displayed.
Utilities:
- Introduces MultiTagXmlMatcher in multi-tag-xml-matcher.ts to handle multiple XML tags.
Testing:
- Adds unit tests for MultiTagXmlMatcher in multi-tag-xml-matcher.spec.ts.
- Updates lmstudio.spec.ts to test <think> and <thinking> tag handling in LmStudioHandler.

^{This description was created by}^{for 924a793. You can customize this summary. It will automatically update as commits are pushed.}

…odels - Created MultiTagXmlMatcher utility to handle multiple XML tag names - Updated LM Studio handler to parse both <think> and <thinking> tags - Added comprehensive tests for the new functionality - Fixes #6750

ellipsis-dev · 2025-08-06T15:16:33Z

src/utils/multi-tag-xml-matcher.ts

+				if (char === "<") {
+					// Emit any text before the tag
+					if (i > this.lastEmittedIndex) {
+						this.emit(false, this.buffer.substring(this.lastEmittedIndex, i))


When a '<' is encountered in TEXT state, the code unconditionally emits the preceding text. This can duplicate text that’s being collected as part of a matched tag. Consider emitting plain text only when depth is 0 (i.e. not inside a matching tag).

ellipsis-dev · 2025-08-06T15:16:33Z

src/utils/multi-tag-xml-matcher.ts

+	constructor(
+		private tagNames: string[],
+		private transform?: (chunks: XmlMatcherResult) => Result,
+		private position = 0,


The constructor parameter 'position' is declared but never used. Consider removing it if not needed.

Suggested change

private position = 0,

FringeNet · 2025-08-06T15:17:02Z

Does not use tags.

curl -X POST http://localhost:1234/v1/chat/completions   -H "Content-Type: application/json"   -d '{
    "model": "unsloth/gpt-oss-20b-GGUF/gpt-oss-20b-Q4_K_M.gguf",
    "messages": [{"role": "user", "content": "What is 1+1?"}],
    "reasoning_effort": "high"
  }'

Returns:

{
  "id": "chatcmpl-abxh306fwqfvcdb3zprm0d",
  "object": "chat.completion",
  "created": 1754493323,
  "model": "gpt-oss-20b@q4_k_m",
  "choices": [
    {
      "index": 0,
      "logprobs": null,
      "finish_reason": "stop",
      "message": {
        "role": "assistant",
        "content": "The user asks a simple math question: \"What is 1+1?\" The answer is 2. Provide the result and maybe mention basic addition. Probably just answer 2.\n\nWe should comply with policy: It's a direct factual answer. No disallowed content. We can provide short response.\\(1 + 1 = 2\\)."
      }
    }
  ],
  "usage": {
    "prompt_tokens": 74,
    "completion_tokens": 69,
    "total_tokens": 143
  },
  "stats": {},
  "system_fingerprint": "gpt-oss-20b@q4_k_m"

roomote

I reviewed my own code and found bugs I didn't know I wrote.

roomote · 2025-08-06T15:17:40Z

src/utils/multi-tag-xml-matcher.ts

+
+			// If we're inside a matched tag, collect the content
+			if (this.depth > 0 && this.state === "TEXT" && i >= this.lastEmittedIndex) {
+				this.matchedContent += char


Is this intentional? The character-by-character content collection in the matched tag could be inefficient for large responses. Consider buffering larger chunks for better performance:

Suggested change

this.matchedContent += char

// If we're inside a matched tag, collect the content

if (this.depth > 0 && this.state === "TEXT") {

const remainingContent = this.buffer.substring(i)

this.matchedContent += remainingContent

i = this.buffer.length - 1

}

roomote · 2025-08-06T15:17:40Z

src/utils/multi-tag-xml-matcher.ts

+								this.lastEmittedIndex = i + 1
+								this.matchedContent = "" // Reset matched content
+							}
+							this.depth++


Could we handle nested tags of the same type? The current implementation might not correctly parse cases like <think>outer <think>inner</think> outer</think>. The depth tracking seems to assume all matched tags are the same, but with multiple tag names, nested different tags would work while nested same tags might not.

roomote · 2025-08-06T15:17:41Z

src/utils/__tests__/multi-tag-xml-matcher.spec.ts

+		const emptyBlocks = allResults.filter((r) => r.matched && r.data === "")
+		expect(emptyBlocks.length).toBeGreaterThan(0)
+	})
+})


Could we add tests for streaming behavior with partial chunks? For example, when a tag is split across chunks like receiving <thi in one chunk and nking>content</thinking> in the next. This would ensure the matcher handles real-world streaming scenarios correctly.

roomote · 2025-08-06T15:17:41Z

src/utils/multi-tag-xml-matcher.ts

+				} else if (char !== "/" || this.currentTag.length > 0) {
+					this.currentTag += char
+				} else {
+					this.currentTag += char


This code appears to be duplicated. Could we simplify it to:

Suggested change

this.currentTag += char

} else {

this.currentTag += char

}

roomote · 2025-08-06T15:17:41Z

src/utils/multi-tag-xml-matcher.ts

+	constructor(
+		private tagNames: string[],
+		private transform?: (chunks: XmlMatcherResult) => Result,
+		private position = 0,


Is the position parameter needed? It's accepted in the constructor but never used in the implementation, unlike the original XmlMatcher. If it's not needed, we could remove it to avoid confusion.

roomote · 2025-08-06T15:17:41Z

src/utils/multi-tag-xml-matcher.ts

@@ -0,0 +1,141 @@
+import { XmlMatcherResult } from "./xml-matcher"
+
+/**


Could we add JSDoc with usage examples? It would help developers understand how to use this with different tag combinations:

Suggested change

/**

/**

* A multi-tag XML matcher that can match multiple tag names.

* This is useful for handling different thinking tag formats from various models.

*

* @example

* // Match both <think> and <thinking> tags

* const matcher = new MultiTagXmlMatcher(['think', 'thinking'])

* const results = matcher.update('<think>Hello</think> world <thinking>Hi</thinking>')

*/

daniel-lxs · 2025-08-07T00:52:41Z

Not a proper fix, closing for now

roomote bot requested review from cte, jr and mrubens as code owners August 6, 2025 15:12

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 6, 2025

github-project-automation bot added this to Roo Code Roadmap Aug 6, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 6, 2025

github-project-automation bot added this to Roo Code Roadmap Aug 6, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Aug 6, 2025

roomote bot mentioned this pull request Aug 6, 2025

[BUG] LM-Studio GPT-OSS Harmony Rendering #6750

Closed

ellipsis-dev bot reviewed Aug 6, 2025

View reviewed changes

roomote bot commented Aug 6, 2025

View reviewed changes

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Aug 6, 2025

daniel-lxs closed this Aug 7, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Aug 7, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Aug 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: support both <think> and <thinking> tags for LM Studio GPT-OSS models #6751

fix: support both <think> and <thinking> tags for LM Studio GPT-OSS models #6751

Uh oh!

roomote bot commented Aug 6, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot Aug 6, 2025

Uh oh!

ellipsis-dev bot Aug 6, 2025

Uh oh!

FringeNet commented Aug 6, 2025

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 6, 2025

Uh oh!

roomote bot Aug 6, 2025

Uh oh!

roomote bot Aug 6, 2025

Uh oh!

roomote bot Aug 6, 2025

Uh oh!

roomote bot Aug 6, 2025

Uh oh!

roomote bot Aug 6, 2025

Uh oh!

daniel-lxs commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

-				this.matchedContent += char
+			// If we're inside a matched tag, collect the content
+			if (this.depth > 0 && this.state === "TEXT") {
+				const remainingContent = this.buffer.substring(i)
+				this.matchedContent += remainingContent
+				i = this.buffer.length - 1
+			}

		@@ -0,0 +1,141 @@
		import { XmlMatcherResult } from "./xml-matcher"

		/**

-/**
+/**
+ * A multi-tag XML matcher that can match multiple tag names.
+ * This is useful for handling different thinking tag formats from various models.
+ *
+ * @example
+ * // Match both <think> and <thinking> tags
+ * const matcher = new MultiTagXmlMatcher(['think', 'thinking'])
+ * const results = matcher.update('<think>Hello</think> world <thinking>Hi</thinking>')
+ */

fix: support both <think> and <thinking> tags for LM Studio GPT-OSS models #6751

fix: support both <think> and <thinking> tags for LM Studio GPT-OSS models #6751

Uh oh!

Conversation

roomote bot commented Aug 6, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Testing

Uh oh!

ellipsis-dev bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

FringeNet commented Aug 6, 2025

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs commented Aug 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

roomote bot commented Aug 6, 2025 •

edited by ellipsis-dev bot

Loading