Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
67 changes: 42 additions & 25 deletions docs/src/ai/models.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,14 @@
---
title: AI Models and Pricing - Zed
description: AI models available via Zed Pro including Claude, GPT-5.2, Gemini 3.1 Pro, and Grok. Pricing, context windows, and tool call support.
description: AI models available via Zed Pro including Claude, GPT-5.4, Gemini 3.1 Pro, and Grok. Pricing, context windows, and tool call support.
---

# Models

Zed's plans offer hosted versions of major LLMs with higher rate limits than direct API access. Model availability is updated regularly. To use your own API keys instead, see [LLM Providers](./llm-providers.md). For general setup, see [Configuration](./configuration.md).

> **Note:** Claude Opus models and GPT-5.4 pro are not available on the [Student plan](./plans-and-usage.md#student).

| Model | Provider | Token Type | Provider Price per 1M tokens | Zed Price per 1M tokens |
| ---------------------- | --------- | ------------------- | ---------------------------- | ----------------------- |
| Claude Opus 4.5 | Anthropic | Input | $5.00 | $5.50 |
Expand All @@ -29,12 +31,20 @@ Zed's plans offer hosted versions of major LLMs with higher rate limits than dir
| | Anthropic | Output | $5.00 | $5.50 |
| | Anthropic | Input - Cache Write | $1.25 | $1.375 |
| | Anthropic | Input - Cache Read | $0.10 | $0.11 |
| GPT-5.2 | OpenAI | Input | $1.25 | $1.375 |
| | OpenAI | Output | $10.00 | $11.00 |
| | OpenAI | Cached Input | $0.125 | $0.1375 |
| GPT-5.2 Codex | OpenAI | Input | $1.25 | $1.375 |
| | OpenAI | Output | $10.00 | $11.00 |
| | OpenAI | Cached Input | $0.125 | $0.1375 |
| GPT-5.4 pro | OpenAI | Input | $30.00 | $33.00 |
| | OpenAI | Output | $180.00 | $198.00 |
| GPT-5.4 | OpenAI | Input | $2.50 | $2.75 |
| | OpenAI | Output | $15.00 | $16.50 |
| | OpenAI | Cached Input | $0.025 | $0.0275 |
| GPT-5.3-Codex | OpenAI | Input | $1.75 | $1.925 |
| | OpenAI | Output | $14.00 | $15.40 |
| | OpenAI | Cached Input | $0.175 | $0.1925 |
| GPT-5.2 | OpenAI | Input | $1.75 | $1.925 |
| | OpenAI | Output | $14.00 | $15.40 |
| | OpenAI | Cached Input | $0.175 | $0.1925 |
| GPT-5.2-Codex | OpenAI | Input | $1.75 | $1.925 |
| | OpenAI | Output | $14.00 | $15.40 |
| | OpenAI | Cached Input | $0.175 | $0.1925 |
| GPT-5 mini | OpenAI | Input | $0.25 | $0.275 |
| | OpenAI | Output | $2.00 | $2.20 |
| | OpenAI | Cached Input | $0.025 | $0.0275 |
Expand All @@ -43,8 +53,8 @@ Zed's plans offer hosted versions of major LLMs with higher rate limits than dir
| | OpenAI | Cached Input | $0.005 | $0.0055 |
| Gemini 3.1 Pro | Google | Input | $2.00 | $2.20 |
| | Google | Output | $12.00 | $13.20 |
| Gemini 3 Flash | Google | Input | $0.30 | $0.33 |
| | Google | Output | $2.50 | $2.75 |
| Gemini 3 Flash | Google | Input | $0.50 | $0.55 |
| | Google | Output | $3.00 | $3.30 |
| Grok 4 | X.ai | Input | $3.00 | $3.30 |
| | X.ai | Output | $15.00 | $16.5 |
| | X.ai | Cached Input | $0.75 | $0.825 |
Expand All @@ -65,7 +75,7 @@ As of February 19, 2026, Zed Pro serves newer model versions in place of the ret
- Claude Opus 4.1 → Claude Opus 4.5 or Claude Opus 4.6
- Claude Sonnet 4 → Claude Sonnet 4.5 or Claude Sonnet 4.6
- Claude Sonnet 3.7 (retired Feb 19) → Claude Sonnet 4.5 or Claude Sonnet 4.6
- GPT-5.1 and GPT-5 → GPT-5.2 or GPT-5.2 Codex
- GPT-5.1 and GPT-5 → GPT-5.2 or GPT-5.2-Codex
- Gemini 2.5 Pro → Gemini 3.1 Pro
- Gemini 3 Pro → Gemini 3.1 Pro
- Gemini 2.5 Flash → Gemini 3 Flash
Expand All @@ -80,21 +90,28 @@ Any usage of a Zed-hosted model will be billed at the Zed Price (rightmost colum

A context window is the maximum span of text and code an LLM can consider at once, including both the input prompt and output generated by the model.

| Model | Provider | Zed-Hosted Context Window |
| ----------------- | --------- | ------------------------- |
| Claude Opus 4.5 | Anthropic | 200k |
| Claude Opus 4.6 | Anthropic | 1M |
| Claude Sonnet 4.5 | Anthropic | 200k |
| Claude Sonnet 4.6 | Anthropic | 1M |
| Claude Haiku 4.5 | Anthropic | 200k |
| GPT-5.2 | OpenAI | 400k |
| GPT-5.2 Codex | OpenAI | 400k |
| GPT-5 mini | OpenAI | 400k |
| GPT-5 nano | OpenAI | 400k |
| Gemini 3.1 Pro | Google | 200k |
| Gemini 3 Flash | Google | 200k |

> Context window limits for hosted Gemini 3.1 Pro/3 Pro/Flash may increase in future releases.
| Model | Provider | Zed-Hosted Context Window |
| --------------------------- | --------- | ------------------------- |
| Claude Opus 4.5 | Anthropic | 200k |
| Claude Opus 4.6 | Anthropic | 1M |
| Claude Sonnet 4.5 | Anthropic | 200k |
| Claude Sonnet 4.6 | Anthropic | 1M |
| Claude Haiku 4.5 | Anthropic | 200k |
| GPT-5.4 pro | OpenAI | 400k |
| GPT-5.4 | OpenAI | 400k |
| GPT-5.3-Codex | OpenAI | 400k |
| GPT-5.2 | OpenAI | 400k |
| GPT-5.2-Codex | OpenAI | 400k |
| GPT-5 mini | OpenAI | 400k |
| GPT-5 nano | OpenAI | 400k |
| Gemini 3.1 Pro | Google | 200k |
| Gemini 3 Flash | Google | 200k |
| Grok 4 | X.ai | 128k |
| Grok 4 Fast | X.ai | 128k |
| Grok 4 Fast (Non-Reasoning) | X.ai | 128k |
| Grok Code Fast 1 | X.ai | 256k |

> Context window limits for hosted Gemini 3.1 Pro/3 Flash may increase in future releases.

Each Agent thread in Zed maintains its own context window.
The more prompts, attached files, and responses included in a session, the larger the context window grows.
Expand Down
Loading