pydantic
diff --git a/‎README.md‎
Lines changed: 1 addition & 1 deletion b/‎README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/api/models/ollama.md‎
Lines changed: 0 additions & 75 deletions b/‎docs/api/models/ollama.md‎
Lines changed: 0 additions & 75 deletions
diff --git a/‎docs/index.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/index.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/models.md‎
Lines changed: 76 additions & 21 deletions b/‎docs/models.md‎
Lines changed: 76 additions & 21 deletions
diff --git a/‎mkdocs.yml‎
Lines changed: 0 additions & 1 deletion b/‎mkdocs.yml‎
Lines changed: 0 additions & 1 deletion
diff --git a/‎pydantic_ai_slim/pydantic_ai/models/__init__.py‎
Lines changed: 2 additions & 24 deletions b/‎pydantic_ai_slim/pydantic_ai/models/__init__.py‎
Lines changed: 2 additions & 24 deletions
diff --git a/‎pydantic_ai_slim/pydantic_ai/models/ollama.py‎
Lines changed: 0 additions & 123 deletions b/‎pydantic_ai_slim/pydantic_ai/models/ollama.py‎
Lines changed: 0 additions & 123 deletions
@@ -37,7 +37,7 @@ We built PydanticAI with one simple aim: to bring that FastAPI feeling to GenAI
 Built by the team behind [Pydantic](https://docs.pydantic.dev/latest/) (the validation layer of the OpenAI SDK, the Anthropic SDK, LangChain, LlamaIndex, AutoGPT, Transformers, CrewAI, Instructor and many more).
 
 * __Model-agnostic__
-Supports OpenAI, Anthropic, Gemini, Ollama, Groq, and Mistral, and there is a simple interface to implement support for [other models](https://ai.pydantic.dev/models/).
+Supports OpenAI, Anthropic, Gemini, Deepseek, Ollama, Groq, Cohere, and Mistral, and there is a simple interface to implement support for [other models](https://ai.pydantic.dev/models/).
 
 * __Pydantic Logfire Integration__
 Seamlessly [integrates](https://ai.pydantic.dev/logfire/) with [Pydantic Logfire](https://pydantic.dev/logfire) for real-time debugging, performance monitoring, and behavior tracking of your LLM-powered applications.
 
@@ -17,7 +17,7 @@ We built PydanticAI with one simple aim: to bring that FastAPI feeling to GenAI
 Built by the team behind [Pydantic](https://docs.pydantic.dev/latest/) (the validation layer of the OpenAI SDK, the Anthropic SDK, LangChain, LlamaIndex, AutoGPT, Transformers, CrewAI, Instructor and many more).
 
 :fontawesome-solid-shapes:{ .md .middle .shapes-orange }&nbsp;<strong class="vertical-middle">Model-agnostic</strong><br>
-Supports OpenAI, Anthropic, Gemini, Ollama, Groq, and Mistral, and there is a simple interface to implement support for [other models](models.md).
+Supports OpenAI, Anthropic, Gemini, Deepseek, Ollama, Groq, Cohere, and Mistral, and there is a simple interface to implement support for [other models](models.md).
 
 :logfire-logo:{ .md .middle }&nbsp;<strong class="vertical-middle">Pydantic Logfire Integration</strong><br>
 Seamlessly [integrates](logfire.md) with [Pydantic Logfire](https://pydantic.dev/logfire) for real-time debugging, performance monitoring, and behavior tracking of your LLM-powered applications.
 
@@ -4,10 +4,11 @@ PydanticAI is Model-agnostic and has built in support for the following model pr
 * [Anthropic](#anthropic)
 * Gemini via two different APIs: [Generative Language API](#gemini) and [VertexAI API](#gemini-via-vertexai)
 * [Ollama](#ollama)
+* [Deepseek](#deepseek)
 * [Groq](#groq)
 * [Mistral](#mistral)
 
-See [OpenAI-compatible models](#openai-compatible-models) for more examples on how to use models such as [OpenRouter](#openrouter), [Grok (xAI)](#grok-xai) and [DeepSeek](#deepseek) that support the OpenAI SDK.
+See [OpenAI-compatible models](#openai-compatible-models) for more examples on how to use models such as [OpenRouter](#openrouter), and [Grok (xAI)](#grok-xai) that support the OpenAI SDK.
 
 You can also [add support for other models](#implementing-custom-models).
 
@@ -304,26 +305,6 @@ agent = Agent(model)
 
 [`VertexAiRegion`][pydantic_ai.models.vertexai.VertexAiRegion] contains a list of available regions.
 
-## Ollama
-
-### Install
-
-To use [`OllamaModel`][pydantic_ai.models.ollama.OllamaModel], you need to either install [`pydantic-ai`](install.md), or install [`pydantic-ai-slim`](install.md#slim-install) with the `openai` optional group:
-
-```bash
-pip/uv-add 'pydantic-ai-slim[openai]'
-```
-
-**This is because internally, `OllamaModel` uses the OpenAI API.**
-
-### Configuration
-
-To use [Ollama](https://ollama.com/), you must first download the Ollama client, and then download a model using the [Ollama model library](https://ollama.com/library).
-
-You must also ensure the Ollama server is running when trying to make requests to it. For more information, please see the [Ollama documentation](https://github.com/ollama/ollama/tree/main/docs).
-
-For detailed setup and example, please see the [Ollama setup documentation](https://github.com/pydantic/pydantic-ai/blob/main/docs/api/models/ollama.md).
-
 ## Groq
 
 ### Install
@@ -456,6 +437,80 @@ model = OpenAIModel(
 ...
 ```
 
+### Ollama
+
+To use [Ollama](https://ollama.com/), you must first download the Ollama client, and then download a model using the [Ollama model library](https://ollama.com/library).
+
+You must also ensure the Ollama server is running when trying to make requests to it. For more information, please see the [Ollama documentation](https://github.com/ollama/ollama/tree/main/docs).
+
+#### Example local usage
+
+With `ollama` installed, you can run the server with the model you want to use:
+
+```bash {title="terminal-run-ollama"}
+ollama run llama3.2
+```
+(this will pull the `llama3.2` model if you don't already have it downloaded)
+
+Then run your code, here's a minimal example:
+
+```python {title="ollama_example.py"}
+from pydantic import BaseModel
+
+from pydantic_ai import Agent
+from pydantic_ai.models.openai import OpenAIModel
+
+
+class CityLocation(BaseModel):
+    city: str
+    country: str
+
+
+ollama_model = OpenAIModel(model_name='llama3.2', base_url='http://localhost:11434/v1')
+agent = Agent(ollama_model, result_type=CityLocation)
+
+result = agent.run_sync('Where were the olympics held in 2012?')
+print(result.data)
+#> city='London' country='United Kingdom'
+print(result.usage())
+"""
+Usage(requests=1, request_tokens=57, response_tokens=8, total_tokens=65, details=None)
+"""
+```
+
+#### Example using a remote server
+
+```python {title="ollama_example_with_remote_server.py"}
+from pydantic import BaseModel
+
+from pydantic_ai import Agent
+from pydantic_ai.models.openai import OpenAIModel
+
+ollama_model = OpenAIModel(
+    model_name='qwen2.5-coder:7b',  # (1)!
+    base_url='http://192.168.1.74:11434/v1',  # (2)!
+)
+
+
+class CityLocation(BaseModel):
+    city: str
+    country: str
+
+
+agent = Agent(model=ollama_model, result_type=CityLocation)
+
+result = agent.run_sync('Where were the olympics held in 2012?')
+print(result.data)
+#> city='London' country='United Kingdom'
+print(result.usage())
+"""
+Usage(requests=1, request_tokens=57, response_tokens=8, total_tokens=65, details=None)
+"""
+```
+
+1. The name of the model running on the remote server
+2. The url of the remote server
+
 ### OpenRouter
 
 To use [OpenRouter](https://openrouter.ai), first create an API key at [openrouter.ai/keys](https://openrouter.ai/keys).
 
@@ -54,7 +54,6 @@ nav:
     - api/models/vertexai.md
     - api/models/groq.md
     - api/models/mistral.md
-    - api/models/ollama.md
     - api/models/test.md
     - api/models/function.md
     - api/pydantic_graph/graph.md
 
@@ -12,9 +12,10 @@
 from dataclasses import dataclass, field
 from datetime import datetime
 from functools import cache
-from typing import TYPE_CHECKING, Literal
+from typing import TYPE_CHECKING
 
 import httpx
+from typing_extensions import Literal
 
 from .._parts_manager import ModelResponsePartsManager
 from ..exceptions import UserError
@@ -107,25 +108,6 @@
     'o1-mini-2024-09-12',
     'o1-preview',
     'o1-preview-2024-09-12',
-    'ollama:codellama',
-    'ollama:deepseek-r1',
-    'ollama:gemma',
-    'ollama:gemma2',
-    'ollama:llama3',
-    'ollama:llama3.1',
-    'ollama:llama3.2',
-    'ollama:llama3.2-vision',
-    'ollama:llama3.3',
-    'ollama:mistral',
-    'ollama:mistral-nemo',
-    'ollama:mixtral',
-    'ollama:phi3',
-    'ollama:phi4',
-    'ollama:qwen',
-    'ollama:qwen2',
-    'ollama:qwen2.5',
-    'ollama:qwq',
-    'ollama:starcoder2',
     'openai:chatgpt-4o-latest',
     'openai:gpt-3.5-turbo',
     'openai:gpt-3.5-turbo-0125',
@@ -356,10 +338,6 @@ def infer_model(model: Model | KnownModelName) -> Model:
         from .mistral import MistralModel
 
         return MistralModel(model[8:])
-    elif model.startswith('ollama:'):
-        from .ollama import OllamaModel
-
-        return OllamaModel(model[7:])
     elif model.startswith('anthropic'):
         from .anthropic import AnthropicModel