You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In Azure AI Search, *agentic retrieval* is a new multi-query pipeline designed for complex questions posed by users or agents in chat and copilot apps. It's characterized by:
21
+
In Azure AI Search, *agentic retrieval* is a new multi-query pipeline designed for complex questions posed by users or agents in chat and copilot apps. It works by:
22
22
23
-
+ Using a large language model (LLM) to break down a complex query into smallersubqueries. You can specify whether to include contextual chat history as a subquery.
23
+
+ Using a large language model (LLM) to break down complex queries into smaller, focused subqueries. You can include chat history for additional context.
24
24
25
-
+ Running the subqueries run in parallel, each one searching for the most relevant content in your index.
25
+
+ Running multiple subqueries simultaneously to search your index. Each subquery is semantically reranked to find the most relevant matches.
26
26
27
-
+ Combining the results into a long unified string that's sent back to your LLM for response generation, using your proprietary content to formulate the answer.
27
+
+ Combining the best results into a unified response that your LLM can use to generate answers with your proprietary content.
28
+
29
+
This high-performance pipeline delivers fast, comprehensive answers to complex questions.
28
30
29
31
Programmatically, agentic retrieval is supported through a new Knowledge Agents object in the 2025-05-01-preview data plane REST API and in Azure SDK prerelease packages that provide the feature. A knowledge agent's retrieval response is designed for downstream consumption by other agents and chat apps.
30
32
@@ -60,6 +62,12 @@ Agentic retrieval is designed for a conversational search experience that includ
60
62
61
63
:::image type="content" source="media/agentic-retrieval/agentic-retrieval-architecture.png" alt-text="Diagram of agentic retrieval workflow using an example query." lightbox="media/agentic-retrieval/agentic-retrieval-architecture.png" :::
62
64
65
+
The workflow includes:
66
+
67
+
*Query planning* where the search engine calls an LLM (a chat completion model) that you provide. The output is one or more subqueries. This step is mostly internal. You can review the subqueries that are generated, but query planning isn't intended to be customizable or configurable.
0 commit comments