Skip to content

Commit 29fef5e

Browse files
committed
update model section
1 parent d279df9 commit 29fef5e

File tree

2 files changed

+30
-32
lines changed

2 files changed

+30
-32
lines changed

src/content/docs/ai-search/configuration/models/index.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ AI Search models follow a defined lifecycle to ensure stability and predictable
5151

5252
1. **Production:** The model is actively supported and recommended for use. It is included in Smart Defaults and receives ongoing updates and maintenance.
5353
2. **Announcement & Transition:** The model remains available but has been marked for deprecation. An end-of-life date is communicated through documentation, release notes, and other official channels. During this phase, users are encouraged to migrate to the recommended replacement model.
54-
3. **Automatic Upgrade (if applicable):** For some models, AI Search may automatically upgrade requests to a recommended replacement.
54+
3. **Automatic Upgrade (if applicable):** If you have selected the Smart Default option, AI Search will automatically upgrade requests to a recommended replacement.
5555
4. **End of life:** The model is no longer available. Any requests to the retired model return a clear error message, and the model is removed from documentation and Smart Defaults.
5656

5757
See models are their lifecycle status in [supported models](/ai-search/configuration/models/supported-models/).

src/content/docs/ai-search/configuration/models/supported-models.mdx

Lines changed: 29 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -18,41 +18,39 @@ Production models are the actively supported and recommended models that are sta
1818
### Text generation
1919
| Provider | Alias | Context window (tokens) |
2020
|---|---|---|
21-
| **Anthropic** | anthropic/claude-3-7-sonnet | 200,000 |
22-
| | anthropic/claude-sonnet-4 | 200,000 |
23-
| | anthropic/claude-opus-4 | 200,000 |
24-
| | anthropic/claude-3-5-haiku | 200,000 |
25-
| **Cerebras** | cerebras/qwen-3-235b-a22b-instruct | 64,000 |
26-
| | cerebras/qwen-3-235b-a22b-thinking | 65,000 |
27-
| | cerebras/llama-3.3-70b | 65,000 |
28-
| | cerebras/llama-4-maverick-17b-128e-instruct | 8,000 |
29-
| | cerebras/llama-4-scout-17b-16e-instruct | 8,000 |
30-
| | cerebras/gpt-oss-120b | 64,000 |
31-
| **Google AI Studio** | google-ai-studio/gemini-2.5-flash | 1,048,576 |
32-
| | google-ai-studio/gemini-2.5-pro | 1,048,576 |
33-
| **Grok (x.ai)** | grok/grok-4 | 256,000 |
34-
| **Groq** | groq/llama-3.3-70b-versatile | 131,072 |
35-
| | groq/llama-3.1-8b-instant | 131,072 |
36-
| **OpenAI** | openai/gpt-5 | 400,000 |
37-
| | openai/gpt-5-mini | 400,000 |
38-
| | openai/gpt-5-nano | 400,000 |
39-
| **Workers AI** | @cf/meta/llama-3.3-70b-instruct-fp8-fast | 24,000 |
40-
| | @cf/meta/llama-3.1-8b-instruct-fast | 60,000 |
41-
| | @cf/meta/llama-3.1-8b-instruct-fp8 | 32,000 |
42-
| | @cf/meta/llama-4-scout-17b-16e-instruct | 131,000 |
43-
| | @cf/qwen/qwen3-30b-a3b-fp8 | 32,000 |
44-
| | @cf/moonshotai/kimi-k2-instruct | 128,000 |
21+
| **Anthropic** | `anthropic/claude-3-7-sonnet` | 200,000 |
22+
| | `anthropic/claude-sonnet-4` | 200,000 |
23+
| | `anthropic/claude-opus-4` | 200,000 |
24+
| | `anthropic/claude-3-5-haiku` | 200,000 |
25+
| **Cerebras** | `cerebras/qwen-3-235b-a22b-instruct` | 64,000 |
26+
| | `cerebras/qwen-3-235b-a22b-thinking` | 65,000 |
27+
| | `cerebras/llama-3.3-70b` | 65,000 |
28+
| | `cerebras/llama-4-maverick-17b-128e-instruct` | 8,000 |
29+
| | `cerebras/llama-4-scout-17b-16e-instruct` | 8,000 |
30+
| | `cerebras/gpt-oss-120b` | 64,000 |
31+
| **Google AI Studio** | `google-ai-studio/gemini-2.5-flash` | 1,048,576 |
32+
| | `google-ai-studio/gemini-2.5-pro` | 1,048,576 |
33+
| **Grok (x.ai)** | `grok/grok-4` | 256,000 |
34+
| **Groq** | `groq/llama-3.3-70b-versatile` | 131,072 |
35+
| | `groq/llama-3.1-8b-instant` | 131,072 |
36+
| **OpenAI** | `openai/gpt-5` | 400,000 |
37+
| | `openai/gpt-5-mini` | 400,000 |
38+
| | `openai/gpt-5-nano` | 400,000 |
39+
| **Workers AI** | `@cf/meta/llama-3.3-70b-instruct-fp8-fast` | 24,000 |
40+
| | `@cf/meta/llama-3.1-8b-instruct-fast` | 60,000 |
41+
| | `@cf/meta/llama-3.1-8b-instruct-fp8` | 32,000 |
42+
| | `@cf/meta/llama-4-scout-17b-16e-instruct` | 131,000 |
43+
| | `@cf/qwen/qwen3-30b-a3b-fp8` | 32,000 |
44+
| | `@cf/moonshotai/kimi-k2-instruct` | 128,000 |
4545

4646
### Embedding
4747
| Provider | Alias | Vector dims | Input tokens | Metric |
4848
|---|---|---|---|---|
49-
| **Google AI Studio** | google-ai-studio/gemini-embedding-001 | 1,536 | 512 | cosine |
50-
| **OpenAI** | openai/text-embedding-3-small | 1,536 | 512 | cosine |
51-
| | openai/text-embedding-3-large | 1,536 | 512 | cosine |
52-
| **Workers AI** | @cf/baai/bge-m3 | 1,024 | 512 | cosine |
53-
| | @cf/baai/bge-large-en-v1.5 | 1,024 | 512 | cosine |
54-
| | @cf/google/embeddinggemma-300m | 768 | 512 | cosine |
55-
| | @cf/qwen/qwen3-embedding-0.6b | 1,024 | 512 | cosine |
49+
| **Google AI Studio** | `google-ai-studio/gemini-embedding-001` | 1,536 | 2048 | cosine |
50+
| **OpenAI** | `openai/text-embedding-3-small` | 1,536 | 8192 | cosine |
51+
| | `openai/text-embedding-3-large` | 1,536 | 8192 | cosine |
52+
| **Workers AI** | `@cf/baai/bge-m3` | 1,024 | 512 | cosine |
53+
| | `@cf/baai/bge-large-en-v1.5` | 1,024 | 512 | cosine |
5654

5755
## Transition models
5856

0 commit comments

Comments
 (0)