fix: use native Ollama API instead of OpenAI compatibility layer #7137

daniel-lxs · 2025-08-15T22:41:24Z

Description

This PR fixes issue #7070 where Ollama models (like gpt-oss:120b) were incorrectly using OpenAI-compatible routes instead of native Ollama API endpoints.

Problem

When using models like gpt-oss:120b with Ollama, the plugin was trying to get completions from OpenAI routes (/v1) instead of the native Ollama API endpoint. This issue was reported by @LivioGama who noted that "Does not happen on Kilo Code".

Solution

After investigating, I discovered that Kilo-Org/kilocode had already solved this by using the official ollama npm package for native API access. This PR adapts their approach to our codebase.

Changes Made:

Added the official ollama npm package (v0.5.17) as a dependency
Created src/api/providers/native-ollama.ts - New handler using native Ollama SDK
Updated src/api/index.ts - Switched to use NativeOllamaHandler for ollama provider
Added comprehensive tests in src/api/providers/__tests__/native-ollama.spec.ts
Maintained backward compatibility - Old OllamaHandler remains available but unused

Key Features:

Direct communication with Ollama's native API
Proper error handling for Ollama-specific scenarios (service not running, model not found)
Support for streaming responses with token usage tracking
DeepSeek R1 reasoning detection support
No more OpenAI compatibility layer overhead

Testing

✅ All new unit tests pass
✅ TypeScript compilation succeeds
✅ Linting passes

Credits

This solution was inspired by Kilo-Org/kilocode's implementation.

Breaking Changes

None - the change is transparent to users and maintains full backward compatibility.

Important

Replaces OpenAI compatibility layer with native Ollama API for Ollama models, introducing NativeOllamaHandler and adding comprehensive tests.

Behavior:
- Replaces OpenAI compatibility layer with native Ollama API for Ollama models in src/api/index.ts.
- Introduces NativeOllamaHandler in native-ollama.ts for direct API communication.
- Maintains backward compatibility by keeping old OllamaHandler.
Testing:
- Adds tests for NativeOllamaHandler in native-ollama.spec.ts.
- Tests cover message streaming, prompt completion, and error handling.
Dependencies:
- Adds ollama npm package (v0.5.17) to package.json.

^{This description was created by}^{for 5c83d3a. You can customize this summary. It will automatically update as commits are pushed.}

- Implements native Ollama API using the official ollama npm package - Fixes issue #7070 where models like gpt-oss:120b failed with OpenAI routes - Based on the approach successfully used by Kilo-Org/kilocode - Maintains backward compatibility by keeping old handler available - Adds comprehensive tests for the new implementation Credits: Solution inspired by Kilo-Org/kilocode's implementation Fixes #7070

ellipsis-dev · 2025-08-15T22:43:01Z

src/api/index.ts

@@ -13,7 +13,7 @@ import {
 	VertexHandler,
 	AnthropicVertexHandler,
 	OpenAiHandler,
-	OllamaHandler,
+	// OllamaHandler, // Replaced with NativeOllamaHandler


Remove the commented-out 'OllamaHandler' import if it’s no longer needed.

Suggested change

// OllamaHandler, // Replaced with NativeOllamaHandler

^{This comment was generated because it violated a code review rule: irule_Vw7dJWzvznOJagxS.}

ellipsis-dev · 2025-08-15T22:43:01Z