fix: improve error handling for LMStudio model compatibility #8576

roomote · 2025-10-09T03:51:27Z

Description

This PR attempts to address Issue #8575 where Roo Code errors out when using the Kwaipilot/KAT-Dev Q8 model with LM Studio or Jan.ai.

Changes

Enhanced Error Handling

Added specific error detection for connection failures (ECONNREFUSED/ENOTFOUND)
Added model not found error handling with clear user guidance
Added context length exceeded error detection with actionable advice
Improved error messages to be more user-friendly and actionable

Test Coverage

Added comprehensive test cases for all new error scenarios
Tests cover connection errors, model not found, and context length issues

Technical Details

The changes enhance the error handling in the LmStudioHandler class to provide more specific and helpful error messages when issues occur. This should help users:

Understand when LM Studio is not running or accessible
Know when a model is not loaded in LM Studio
Recognize context length limitations and how to address them

Limitations

While this PR improves error handling significantly, the original issue report lacks specific error details. The implementation addresses common error scenarios that could cause Roo Code to "error out" with local models, but without the exact error messages from the Kwaipilot model, we cannot guarantee this fully resolves the specific compatibility issue.

Testing

✅ All existing tests pass
✅ New tests added for error scenarios
✅ Linting and type checking pass

Next Steps

If this doesn't fully resolve the issue, we may need:

Specific error logs from users experiencing the Kwaipilot model issue
Model-specific configuration or handling for Kwaipilot/KAT-Dev models

Fixes #8575

Feedback Welcome

This PR represents an attempt to improve the user experience when errors occur with LM Studio models. Feedback and testing with the actual Kwaipilot/KAT-Dev model would be greatly appreciated.

Important

Improves error handling in LmStudioHandler for connection, model not found, and context length errors, with comprehensive test coverage.

Error Handling:
- LmStudioHandler in lm-studio.ts now detects connection errors (ECONNREFUSED, ENOTFOUND) and throws specific messages.
- Handles "model not found" errors with guidance to ensure the model is loaded.
- Detects context length errors and advises on using models with larger context windows.
- Uses enhanced error handler for other errors.
Testing:
- lmstudio.spec.ts adds tests for connection errors, model not found, context length, and generic API errors.
- Tests ensure specific error messages are thrown for each scenario.

^{This description was created by}^{for e407b1e. You can customize this summary. It will automatically update as commits are pushed.}

- Add specific error detection for connection failures - Add model not found error handling - Add context length exceeded error handling - Provide clearer error messages for debugging - Update tests to cover new error scenarios Fixes #8575

roomote

Reviewing my own code feels like debugging a mirror - every flaw reflects back with uncomfortable clarity.

roomote · 2025-10-09T03:54:28Z

src/api/providers/lm-studio.ts

+					errorMessage.includes("context") ||
+					errorMessage.includes("token") ||
+					errorMessage.includes("length")
+				) {


P1: Overly broad error detection pattern

The context length error detection will incorrectly match ANY error containing "context", "token", or "length". This creates false positives for unrelated errors like:

"Invalid token format"

"Response length too short"

"Authentication token expired"

Consider using more specific patterns that match actual LM Studio context length errors, such as checking for phrases like "context length exceeded", "maximum context", or "token limit".

Suggested change

) {

// Check for context length errors

if (

errorMessage.toLowerCase().includes("context length exceeded") ||

errorMessage.toLowerCase().includes("maximum context") ||

errorMessage.toLowerCase().includes("token limit exceeded")

) {

roomote · 2025-10-09T03:54:28Z

src/api/providers/lm-studio.ts

 			} catch (error) {
+				// Handle specific error cases
+				const errorMessage = error instanceof Error ? error.message : String(error)
+
+				// Check for connection errors
+				if (errorMessage.includes("ECONNREFUSED") || errorMessage.includes("ENOTFOUND")) {
+					throw new Error(
+						`Cannot connect to LM Studio at ${this.options.lmStudioBaseUrl || "http://localhost:1234"}. Please ensure LM Studio is running and the server is started.`,
+					)
+				}
+
+				// Check for model not found errors
+				if (
+					errorMessage.includes("model") &&
+					(errorMessage.includes("not found") || errorMessage.includes("does not exist"))
+				) {
+					throw new Error(
+						`Model "${this.getModel().id}" not found in LM Studio. Please ensure the model is loaded in LM Studio.`,
+					)
+				}
+
+				// Check for context length errors
+				if (
+					errorMessage.includes("context") ||
+					errorMessage.includes("token") ||
+					errorMessage.includes("length")
+				) {
+					throw new Error(
+						`Context length exceeded for model "${this.getModel().id}". Please load the model with a larger context window in LM Studio, or use a different model that supports longer contexts.`,
+					)
+				}
+
+				// Use the enhanced error handler for other OpenAI-like errors
 				throw handleOpenAIError(error, this.providerName)
 			}


P2: Code duplication - extract error handling logic

The error handling logic (lines 100-132) is duplicated in completePrompt() (lines 219-251). This violates DRY principles and makes maintenance harder. Consider extracting this into a private method like handleLmStudioError(error: unknown): never that both methods can use.

roomote · 2025-10-09T03:54:28Z

src/api/providers/lm-studio.ts

 		} catch (error) {
+			// If error was already processed and re-thrown above, just re-throw it
+			if (error instanceof Error && error.message.includes("LM Studio")) {
+				throw error
+			}
+
+			// Generic fallback error
 			throw new Error(
-				"Please check the LM Studio developer logs to debug what went wrong. You may need to load the model with a larger context length to work with Roo Code's prompts.",
+				`LM Studio error: ${error instanceof Error ? error.message : String(error)}. Please check the LM Studio developer logs for more details.`,
 			)
 		}


P2: Inconsistent error handling flow

This catch block re-checks if the error message contains "LM Studio", but the specific error handlers above already throw errors with "LM Studio" in the message. This means:

Connection errors, model not found, and context errors will be re-thrown here (lines 174-176)

Errors from handleOpenAIError() will hit the generic fallback (lines 179-181)

However, handleOpenAIError() returns errors with format "LM Studio completion error: ..." which DOES contain "LM Studio", so they'll also be re-thrown at line 175, never reaching the generic fallback.

The generic fallback at lines 179-181 appears unreachable. Consider simplifying this logic or removing the unreachable code.

roomote · 2025-10-09T03:54:28Z

src/api/providers/__tests__/lmstudio.spec.ts

+				for await (const _chunk of stream) {
+					// Should not reach here
+				}
+			}).rejects.toThrow("LM Studio completion error")


P3: Test expects wrong error message

The test expects "LM Studio completion error" but based on the implementation:

The error "Unknown API Error" doesn't match connection/model/context patterns

It gets passed to handleOpenAIError() which returns "LM Studio completion error: Unknown API Error"

This error contains "LM Studio" so it's re-thrown at line 174-176

The actual error message should be "LM Studio completion error: Unknown API Error"

The test should verify the complete error message or use .toMatch() for partial matching.

roomote · 2025-10-09T03:54:28Z

src/api/providers/__tests__/lmstudio.spec.ts

+		it("should handle generic API errors", async () => {
+			mockCreate.mockRejectedValueOnce(new Error("Unknown API Error"))
+
+			const stream = handler.createMessage(systemPrompt, messages)
+
+			await expect(async () => {
+				for await (const _chunk of stream) {
+					// Should not reach here
+				}
+			}).rejects.toThrow("LM Studio completion error")
 		})


P3: Missing test coverage for false positives

The tests don't verify that the error detection correctly distinguishes between different error types. Add tests for errors that contain "token", "context", or "length" but aren't actually context length errors, such as:

"Invalid token format"

"Authentication token expired"

"Response length mismatch"

These should NOT trigger the context length error message.

roomote · 2025-10-28T22:50:16Z

Code Review Summary

I've completed the review and identified the following issues:

Issues Found

P1: Overly broad context length error detection - Lines 121-125 and 240-244 match ANY error containing "context", "token", or "length", creating false positives for unrelated errors like "Invalid token format" or "Authentication token expired". Use more specific patterns.
P2: Code duplication - Error handling logic is duplicated between createMessage() (lines 100-132) and completePrompt() (lines 219-251). Extract into a shared private method.
P2: Inconsistent error handling flow - The outer catch blocks (lines 172-181, 254-263) re-check for "LM Studio" in error messages, but all errors from inner handlers already contain this text, making the generic fallback unreachable.
P3: Test expects partial error message - Line 165 expects "LM Studio completion error" but the actual message is "LM Studio completion error: Unknown API Error". Test should verify the complete message or use .toMatch().
P3: Missing test coverage for false positives - Tests don't verify that errors like "Invalid token format" don't incorrectly trigger context length errors.

Follow Along on Roo Code Cloud

ellipsis-dev · 2025-10-28T22:52:00Z

src/api/providers/lm-studio.ts

 			try {
 				results = await this.client.chat.completions.create(params)
 			} catch (error) {
+				// Handle specific error cases


Consider refactoring duplicated error handling logic into a helper function. The same error checks (connection errors, model not found, and context length) appear in both createMessage and completePrompt, which can lead to maintenance challenges.

roomote · 2025-10-28T22:52:50Z

src/api/providers/lm-studio.ts

+				// Check for context length errors
+				if (
+					errorMessage.includes("context") ||
+					errorMessage.includes("token") ||
+					errorMessage.includes("length")
+				) {


The context length error detection will incorrectly match ANY error containing "context", "token", or "length". This creates false positives for unrelated errors like "Invalid token format", "Response length too short", or "Authentication token expired". Consider using more specific patterns that match actual LM Studio context length errors, such as checking for phrases like "context length exceeded", "maximum context", or "token limit".

Suggested change

// Check for context length errors

if (

errorMessage.includes("context") ||

errorMessage.includes("token") ||

errorMessage.includes("length")

) {

// Check for context length errors

if (

errorMessage.toLowerCase().includes("context length exceeded") ||

errorMessage.toLowerCase().includes("maximum context") ||

errorMessage.toLowerCase().includes("token limit exceeded")

) {

roomote · 2025-10-28T22:53:13Z

src/api/providers/lm-studio.ts

+				// Handle specific error cases
+				const errorMessage = error instanceof Error ? error.message : String(error)
+
+				// Check for connection errors
+				if (errorMessage.includes("ECONNREFUSED") || errorMessage.includes("ENOTFOUND")) {
+					throw new Error(
+						`Cannot connect to LM Studio at ${this.options.lmStudioBaseUrl || "http://localhost:1234"}. Please ensure LM Studio is running and the server is started.`,
+					)
+				}
+
+				// Check for model not found errors
+				if (
+					errorMessage.includes("model") &&
+					(errorMessage.includes("not found") || errorMessage.includes("does not exist"))
+				) {
+					throw new Error(
+						`Model "${this.getModel().id}" not found in LM Studio. Please ensure the model is loaded in LM Studio.`,
+					)
+				}
+
+				// Check for context length errors
+				if (
+					errorMessage.includes("context") ||
+					errorMessage.includes("token") ||
+					errorMessage.includes("length")
+				) {
+					throw new Error(
+						`Context length exceeded for model "${this.getModel().id}". Please load the model with a larger context window in LM Studio, or use a different model that supports longer contexts.`,
+					)
+				}
+
+				// Use the enhanced error handler for other OpenAI-like errors
 				throw handleOpenAIError(error, this.providerName)


The error handling logic is duplicated in completePrompt() (lines 219-251). This violates DRY principles and makes maintenance harder. Consider extracting this into a private method like handleLmStudioError(error: unknown): never that both methods can use.

roomote · 2025-10-28T22:53:36Z

src/api/providers/lm-studio.ts

 		} catch (error) {
+			// If error was already processed and re-thrown above, just re-throw it
+			if (error instanceof Error && error.message.includes("LM Studio")) {
+				throw error
+			}
+
+			// Generic fallback error
 			throw new Error(
-				"Please check the LM Studio developer logs to debug what went wrong. You may need to load the model with a larger context length to work with Roo Code's prompts.",
+				`LM Studio error: ${error instanceof Error ? error.message : String(error)}. Please check the LM Studio developer logs for more details.`,
 			)


This catch block re-checks if the error message contains "LM Studio", but the specific error handlers above already throw errors with "LM Studio" in the message (connection, model not found, context errors), and handleOpenAIError() returns errors with format "LM Studio completion error: ...". All these errors will be re-thrown at lines 174-176, so the generic fallback at lines 179-181 appears unreachable. Consider simplifying this logic or removing the unreachable code.

roomote · 2025-10-28T22:54:02Z

src/api/providers/__tests__/lmstudio.spec.ts

+				for await (const _chunk of stream) {
+					// Should not reach here
+				}
+			}).rejects.toThrow("LM Studio completion error")


The test expects "LM Studio completion error" but based on the implementation, the error "Unknown API Error" doesn't match connection/model/context patterns, so it gets passed to handleOpenAIError() which returns "LM Studio completion error: Unknown API Error". This error contains "LM Studio" so it's re-thrown at lines 174-176. The actual error message should be "LM Studio completion error: Unknown API Error". The test should verify the complete error message or use .toMatch() for partial matching.

roomote · 2025-10-28T22:54:28Z

src/api/providers/__tests__/lmstudio.spec.ts

+		it("should handle connection errors", async () => {
+			const connectionError = new Error("connect ECONNREFUSED 127.0.0.1:1234")
+			mockCreate.mockRejectedValueOnce(connectionError)

 			const stream = handler.createMessage(systemPrompt, messages)

 			await expect(async () => {
 				for await (const _chunk of stream) {
 					// Should not reach here
 				}
-			}).rejects.toThrow("Please check the LM Studio developer logs to debug what went wrong")
+			}).rejects.toThrow("Cannot connect to LM Studio at http://localhost:1234")
+		})
+
+		it("should handle model not found errors", async () => {
+			const modelError = new Error("model 'local-model' not found")
+			mockCreate.mockRejectedValueOnce(modelError)
+
+			const stream = handler.createMessage(systemPrompt, messages)
+
+			await expect(async () => {
+				for await (const _chunk of stream) {
+					// Should not reach here
+				}
+			}).rejects.toThrow('Model "local-model" not found in LM Studio')
+		})
+
+		it("should handle context length errors", async () => {
+			const contextError = new Error("context length exceeded")
+			mockCreate.mockRejectedValueOnce(contextError)
+
+			const stream = handler.createMessage(systemPrompt, messages)
+
+			await expect(async () => {
+				for await (const _chunk of stream) {
+					// Should not reach here
+				}
+			}).rejects.toThrow("Context length exceeded")
+		})
+
+		it("should handle generic API errors", async () => {
+			mockCreate.mockRejectedValueOnce(new Error("Unknown API Error"))
+
+			const stream = handler.createMessage(systemPrompt, messages)
+
+			await expect(async () => {
+				for await (const _chunk of stream) {
+					// Should not reach here
+				}
+			}).rejects.toThrow("LM Studio completion error")
 		})


The tests don't verify that the error detection correctly distinguishes between different error types. Add tests for errors that contain "token", "context", or "length" but aren't actually context length errors, such as "Invalid token format", "Authentication token expired", or "Response length mismatch". These should NOT trigger the context length error message.

roomote

Review complete. Found 5 issues that should be addressed before merging.

roomote bot requested review from cte, jr and mrubens as code owners October 9, 2025 03:51

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Oct 9, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Oct 9, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Oct 9, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Oct 9, 2025

roomote bot commented Oct 9, 2025

View reviewed changes

roomote bot mentioned this pull request Oct 9, 2025

[BUG] Roocode errors out with Kwaipilot / KAT-Dev Q8 #8575

Closed

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Oct 9, 2025

hannesrudolph closed this Oct 28, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Oct 28, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Oct 28, 2025

hannesrudolph reopened this Oct 28, 2025

github-project-automation bot moved this from Done to New in Roo Code Roadmap Oct 28, 2025

github-project-automation bot moved this from Done to Triage in Roo Code Roadmap Oct 28, 2025

hannesrudolph closed this Oct 28, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Oct 28, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Oct 28, 2025

ellipsis-dev bot reviewed Oct 28, 2025

View reviewed changes

roomote bot commented Oct 28, 2025

View reviewed changes

-				) {
+				// Check for context length errors
+				if (
+					errorMessage.toLowerCase().includes("context length exceeded") ||
+					errorMessage.toLowerCase().includes("maximum context") ||
+					errorMessage.toLowerCase().includes("token limit exceeded")
+				) {

fix: improve error handling for LMStudio model compatibility #8576

fix: improve error handling for LMStudio model compatibility #8576

Uh oh!

Conversation

roomote bot commented Oct 9, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

Enhanced Error Handling

Test Coverage

Technical Details

Limitations

Testing

Next Steps

Feedback Welcome

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Summary

Issues Found

Uh oh!

ellipsis-dev bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

roomote bot commented Oct 9, 2025 •

edited by ellipsis-dev bot

Loading

roomote bot commented Oct 28, 2025 •

edited

Loading