Skip to content

Commit e53e653

Browse files
committed
Refresh on staging
1 parent 20a9315 commit e53e653

File tree

1 file changed

+12
-4
lines changed

1 file changed

+12
-4
lines changed

articles/search/search-agentic-retrieval-concept.md

Lines changed: 12 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -18,13 +18,15 @@ ms.custom:
1818

1919
[!INCLUDE [Feature preview](./includes/previews/preview-generic.md)]
2020

21-
In Azure AI Search, *agentic retrieval* is a new multi-query pipeline designed for complex questions posed by users or agents in chat and copilot apps. It's characterized by:
21+
In Azure AI Search, *agentic retrieval* is a new multi-query pipeline designed for complex questions posed by users or agents in chat and copilot apps. It works by:
2222

23-
+ Using a large language model (LLM) to break down a complex query into smaller subqueries. You can specify whether to include contextual chat history as a subquery.
23+
+ Using a large language model (LLM) to break down complex queries into smaller, focused subqueries. You can include chat history for additional context.
2424

25-
+ Running the subqueries run in parallel, each one searching for the most relevant content in your index.
25+
+ Running multiple subqueries simultaneously to search your index. Each subquery is semantically reranked to find the most relevant matches.
2626

27-
+ Combining the results into a long unified string that's sent back to your LLM for response generation, using your proprietary content to formulate the answer.
27+
+ Combining the best results into a unified response that your LLM can use to generate answers with your proprietary content.
28+
29+
This high-performance pipeline delivers fast, comprehensive answers to complex questions.
2830

2931
Programmatically, agentic retrieval is supported through a new Knowledge Agents object in the 2025-05-01-preview data plane REST API and in Azure SDK prerelease packages that provide the feature. A knowledge agent's retrieval response is designed for downstream consumption by other agents and chat apps.
3032

@@ -60,6 +62,12 @@ Agentic retrieval is designed for a conversational search experience that includ
6062

6163
:::image type="content" source="media/agentic-retrieval/agentic-retrieval-architecture.png" alt-text="Diagram of agentic retrieval workflow using an example query." lightbox="media/agentic-retrieval/agentic-retrieval-architecture.png" :::
6264

65+
The workflow includes:
66+
67+
*Query planning* where the search engine calls an LLM (a chat completion model) that you provide. The output is one or more subqueries. This step is mostly internal. You can review the subqueries that are generated, but query planning isn't intended to be customizable or configurable.
68+
69+
*
70+
6371
Agentic retrieval has these components:
6472

6573
| Component | Resource | Usage |

0 commit comments

Comments
 (0)