fix: use native Ollama API endpoints instead of OpenAI-compatible routes #7071

roomote · 2025-08-14T01:47:34Z

This PR fixes issue #7070 where Ollama models were incorrectly using OpenAI-compatible routes instead of native Ollama API endpoints.

Problem

When using models like gpt-oss:120b with Ollama, the plugin was trying to get completions from OpenAI routes (/v1) instead of the native Ollama /api/chat endpoint.

Solution

Replaced the OpenAI client library with direct axios calls to Ollama's native API
Now using /api/chat endpoint for chat completions instead of /v1 OpenAI-compatible endpoint
Properly handle streaming responses from Ollama's native API format
Maintain backward compatibility with existing configurations

Changes Made

Modified src/api/providers/ollama.ts:
- Removed dependency on OpenAI client
- Implemented direct HTTP calls to Ollama's /api/chat endpoint using axios
- Added proper message format conversion from Anthropic to Ollama format
- Implemented streaming response handling for Ollama's native format
- Added proper error handling for Ollama-specific errors
Updated tests:
- src/api/providers/__tests__/ollama.spec.ts: Updated to mock axios instead of OpenAI client
- src/api/providers/__tests__/ollama-timeout.spec.ts: Updated timeout tests to work with new implementation

Testing

All existing tests pass ✅
Timeout configuration tests pass ✅
Linting and type checking pass ✅

Fixes #7070

Important

This PR updates OllamaHandler to use native Ollama API endpoints with axios, replacing OpenAI-compatible routes, and updates tests to reflect these changes.

Behavior:
- Replaces OpenAI-compatible routes with native Ollama API endpoints in OllamaHandler.
- Uses /api/chat endpoint for chat completions.
- Handles streaming responses and error scenarios specific to Ollama.
- Maintains backward compatibility with existing configurations.
Implementation:
- Removes OpenAI client dependency, uses axios for HTTP requests in ollama.ts.
- Converts message formats from Anthropic to Ollama.
- Implements error handling for connection and model not found errors.
Testing:
- Updates ollama.spec.ts and ollama-timeout.spec.ts to mock axios and test new implementation.
- Verifies timeout configurations and error handling.
- Ensures all existing tests pass.

^{This description was created by}^{for 14c33f8. You can customize this summary. It will automatically update as commits are pushed.}

- Replace OpenAI client with direct axios calls to Ollama native API - Use /api/chat endpoint instead of /v1 OpenAI-compatible endpoint - Update tests to reflect the new native API implementation - Maintain backward compatibility with existing configurations Fixes #7070

ellipsis-dev · 2025-08-14T01:49:25Z

src/api/providers/ollama.ts

+					}) as const,
+			)
+
+			let buffer = ""


Unused variable 'buffer' declared here isn't used; please remove for clarity.

Suggested change

let buffer = ""

ellipsis-dev · 2025-08-14T01:49:25Z

src/api/providers/ollama.ts

-				for (const matcherChunk of matcher.update(delta.content)) {
-					yield matcherChunk
+
+		// Convert Anthropic messages to Ollama format


The inline conversion of Anthropic messages to Ollama format (lines 82–117) could be extracted into a utility function for improved readability and maintainability.

^{This comment was generated because it violated a code review rule: irule_tTqpIuNs8DV0QFGj.}

roomote

Reviewing my own code is like grading my own homework - I already know where I cut corners.

roomote · 2025-08-14T01:52:07Z

src/api/providers/ollama.ts

+					}) as const,
+			)
+
+			let buffer = ""


Is this buffer variable intentional? It's declared but never used. Could we remove it to keep the code clean?

roomote · 2025-08-14T01:52:07Z

src/api/providers/ollama.ts

+		const ollamaMessages: OllamaMessage[] = [{ role: "system", content: systemPrompt }]
+
+		// Convert messages to Ollama format
+		for (const message of messages) {


I notice we check for deepseek-r1 models and adjust temperature, but unlike the previous OpenAI implementation, we don't apply R1 format conversion to the messages. Is this intentional? DeepSeek R1 models might need special message formatting for optimal performance.

roomote · 2025-08-14T01:52:07Z

src/api/providers/ollama.ts

+				} else {
+					throw new Error(`Ollama completion error: ${error.message}`)
+				}
+			}
 			if (error instanceof Error) {


This error handling seems redundant - axios errors are already handled above. Could we remove this duplicate handler to simplify the error flow?

roomote · 2025-08-14T01:52:08Z

src/api/providers/ollama.ts

+	model: string
+	created_at: string
+	message: {
+		role: string


Consider using the more specific type "system" | "user" | "assistant" instead of string for better type safety?

roomote · 2025-08-14T01:52:08Z

src/api/providers/ollama.ts

-			apiKey: "ollama",
-			timeout: getApiRequestTimeout(),
-		})
+		this.baseUrl = this.options.ollamaBaseUrl || "http://localhost:11434"


The default Ollama URL appears in multiple places. Would it be cleaner to extract this to a constant like const DEFAULT_OLLAMA_BASE_URL = "http://localhost:11434"?

roomote bot requested review from mrubens, cte and jr as code owners August 14, 2025 01:47

github-project-automation bot added this to Roo Code Roadmap Aug 14, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Aug 14, 2025

github-project-automation bot added this to Roo Code Roadmap Aug 14, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Aug 14, 2025

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. bug Something isn't working labels Aug 14, 2025

ellipsis-dev bot reviewed Aug 14, 2025

View reviewed changes

roomote bot commented Aug 14, 2025

View reviewed changes

roomote bot mentioned this pull request Aug 14, 2025

Wrong routes for Ollama models #7070

Open

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Aug 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: use native Ollama API endpoints instead of OpenAI-compatible routes #7071

fix: use native Ollama API endpoints instead of OpenAI-compatible routes #7071

roomote bot commented Aug 14, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

ellipsis-dev bot Aug 14, 2025

Uh oh!

ellipsis-dev bot Aug 14, 2025

Uh oh!

roomote bot left a comment

Uh oh!

roomote bot Aug 14, 2025

Uh oh!

roomote bot Aug 14, 2025

Uh oh!

roomote bot Aug 14, 2025

Uh oh!

roomote bot Aug 14, 2025

Uh oh!

roomote bot Aug 14, 2025

Uh oh!

Uh oh!

fix: use native Ollama API endpoints instead of OpenAI-compatible routes #7071

Are you sure you want to change the base?

fix: use native Ollama API endpoints instead of OpenAI-compatible routes #7071

Conversation

roomote bot commented Aug 14, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changes Made

Testing

Uh oh!

ellipsis-dev bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot left a comment

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

roomote bot Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

roomote bot commented Aug 14, 2025 •

edited by ellipsis-dev bot

Loading