reranking documentation

aninibread · aninibread · commit 25664988a40d · 2025-10-15T19:08:57.000-04:00
diff --git a/src/content/docs/ai-search/configuration/index.mdx b/src/content/docs/ai-search/configuration/index.mdx
@@ -22,6 +22,7 @@ The table below lists all available configuration options:
 | [Query rewrite system prompt](/ai-search/configuration/system-prompt/)         | yes                     | Custom system prompt to guide query rewriting behavior                                     |
 | [Match threshold](/ai-search/configuration/retrieval-configuration/)           | yes                     | Minimum similarity score required for a vector match                                       |
 | [Maximum number of results](/ai-search/configuration/retrieval-configuration/) | yes                     | Maximum number of vector matches returned (`top_k`)                                        |
+| [Reranking](/ai-search/configuration/reranking/) | yes | Rerank to reorder retrieved results by semantic relevance using a reranking model after initial retrieval |
 | [Generation model](/ai-search/configuration/models/)                           | yes                     | Model used to generate the final response                                                  |
 | [Generation system prompt](/ai-search/configuration/system-prompt/)            | yes                     | Custom system prompt to guide response generation                                          |
 | [Similarity caching](/ai-search/configuration/cache/)                          | yes                     | Enable or disable caching of responses for similar (not just exact) prompts                |
diff --git a/src/content/docs/ai-search/configuration/models/supported-models.mdx b/src/content/docs/ai-search/configuration/models/supported-models.mdx
@@ -50,6 +50,11 @@ Production models are the actively supported and recommended models that are sta
 | **Workers AI** | `@cf/baai/bge-m3` | 1,024 | 512 | cosine |
 |  | `@cf/baai/bge-large-en-v1.5` | 1,024 | 512 | cosine |
 
+### Reranking
+| Provider | Alias | Input tokens | 
+|---|---|---|
+| **Workers AI** | `@cf/baai/bge-reranker-base` | 512 | 
+
 ## Transition models
 
 There are currently no models marked for end-of-life.
diff --git a/src/content/docs/ai-search/configuration/reranking.mdx b/src/content/docs/ai-search/configuration/reranking.mdx
@@ -0,0 +1,61 @@
+---
+pcx_content_type: concept
+title: Reranking
+sidebar:
+  order: 4
+---
+
+Reranking can help improve the quality of AI Search results by reordering retrieved documents based on semantic relevance to the user’s query. It applies a secondary model after retrieval to "rerank" the top results before they are outputted.
+
+## How it works
+
+By default, reranking is **disabled** for all AI Search instances. You can enable it during creation or later from the settings page.
+
+When enabled, AI Search will:
+
+1. Retrieve a set of relevant results from your index, constrained by your `max_num_of_results` and `score_threshold` parameters.  
+2. Pass those results through a [reranking model](/ai-search/configuration/models/supported-models/)
+3. Return the reranked results, which the text generation model can use for answer generation.
+
+Reranking helps improve accuracy, especially for large or noisy datasets where vector similarity alone may not produce the optimal ordering.
+
+## Configuration
+
+You can configure reranking in several ways:
+
+### Configure via API
+
+You can also configure via the API. When you make a `/search` or `/ai-search` request using the [Workers Binding](/ai-search/usage/workers-binding/) or [REST API](/ai-search/usage/rest-api/), you can:
+
+- Enable or disable reranking per request
+- Specify the reranking model
+
+For example:
+
+```javascript
+const answer = await env.AI.autorag("my-autorag").aiSearch({
+  query: "How do I train a llama to deliver coffee?",
+  model: "@cf/meta/llama-3.3-70b-instruct-fp8-fast",
+  reranking: {
+    enabled: true,
+    model: "@cf/baai/bge-reranker-base"
+  }
+});
+```
+
+### Configure in dashboard for new AI Search
+
+When creating a new RAG in the dashboard:
+
+1. In the Retrieval configuration step, open the Reranking dropdown
+2. Toggle Reranking on
+3. Select the reranking model
+
+### Configure in dashboard for existing AI Search
+
+To update reranking for an existing instance:
+
+1. Go to your AI Search instance
+2. Open the Settings tab
+3. Enable or disable reranking, and select the reranking model
+
diff --git a/src/content/docs/ai-search/usage/rest-api.mdx b/src/content/docs/ai-search/usage/rest-api.mdx
@@ -52,7 +52,9 @@ curl https://api.cloudflare.com/client/v4/accounts/{ACCOUNT_ID}/ai-search/rags/{
 	"rewrite_query": false,
 	"max_num_results": 10,
 	"ranking_options": {
-		"score_threshold": 0.3
+		"score_threshold": 0.3,
+		"enabled": true,
+    	"model": "@cf/baai/bge-reranker-base"
 	},
 	"stream": true,
 }'
@@ -89,7 +91,9 @@ curl https://api.cloudflare.com/client/v4/accounts/{ACCOUNT_ID}/ai-search/rags/{
 	"rewrite_query": true,
 	"max_num_results": 10,
 	"ranking_options": {
-		"score_threshold": 0.3
+		"score_threshold": 0.3,
+		"enabled": true
+    	"model": "@cf/baai/bge-reranker-base"
 	},
 }'
 
diff --git a/src/content/docs/ai-search/usage/workers-binding.mdx b/src/content/docs/ai-search/usage/workers-binding.mdx
@@ -47,6 +47,8 @@ const answer = await env.AI.autorag("my-autorag").aiSearch({
 	max_num_results: 2,
 	ranking_options: {
 		score_threshold: 0.3,
+    enabled: true,
+    model: "@cf/baai/bge-reranker-base"
 	},
 	stream: true,
 });
@@ -116,6 +118,8 @@ const answer = await env.AI.autorag("my-autorag").search({
 	max_num_results: 2,
 	ranking_options: {
 		score_threshold: 0.3,
+    enabled: true,
+    model: "@cf/baai/bge-reranker-base"
 	},
 });
 ```