You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/retrieval-augmented-generation-overview.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -58,7 +58,7 @@ A high-level summary of the pattern looks like this:
58
58
+ Send it to Azure AI Search to find relevant information.
59
59
+ Return the top ranked search results to an LLM.
60
60
+ Use the natural language understanding and reasoning capabilities of the LLM to generate a response to the initial prompt.
61
-
+ Optionally, use agentic RAG where an agent evaluates an answer and finds a better one if the original answer is incomplete or low quality.
61
+
+ Optionally, use agentic retrieval where an agent evaluates an answer and finds a better one if the original answer is incomplete or low quality.
62
62
63
63
Azure AI Search provides inputs to the LLM prompt, but doesn't train the model. In RAG architecture, there's no extra training. The LLM is pretrained using public data, but it generates responses that are augmented by information from the retriever, in this case, Azure AI Search.
In Azure AI Search, *agentic retrieval* is a new parallel query processing architecture that uses conversational language models to generate multiple subqueries for a single retrieval request, incorporating conversation history and semantic ranking to produce high-quality grounding data for custom chat and generative AI solutions that include agents.
19
+
In Azure AI Search, *agentic retrieval* is a new parallel query processing architecture that incorporates user conversation history and Azure OpenAI models to plan, retrieve and synthesize queries for improved results. It produces high-quality grounding data for custom chat and generative AI solutions that include agents.
20
20
21
21
Programmatically, agentic retrieval is supported through a new Knowledge Agents object (also known as a search agent) in the 2025-05-01-preview data plane REST API and in Azure SDK prerelease packages that provide the feature. An agent's retrieval response is designed for downstream consumption by other agents and chat apps.
22
22
23
23
## Why use agentic retrieval
24
24
25
-
You should use agentic retrieval when you want to send data to an agent or customize a chat experience with high quality inputs that include your proprietary data.
25
+
You should use agentic retrieval when you want to send high quality data to an agent or to ground a chat experience with inputs that include your proprietary content.
26
26
27
-
The *agentic* aspect is a reasoning step in query planning processing that's performed by a supported large language model (LLM) that you provide. The LLM is tasked with designing multiple subqueries based on: user questions, chat history, and parameters on the request. The subqueries target your indexed documents (plain text and vectors) in Azure AI Search.
27
+
The *agentic* aspect is a reasoning step in query planning processing that's performed by a supported large language model (LLM) that you provide. The LLM analyzes the entire chat thread to identify the underlying information need. Instead of a single, catch-all query, the model breaks down compound questions into focused subqueries based on: user questions, chat history, and parameters on the request. The subqueries target your indexed documents (plain text and vectors) in Azure AI Search.This hybrid approach ensures you surface both keyword matches and semantic similarities at once, dramatically improving recall.
28
28
29
29
The *retrieval* component is the ability to run subqueries simultaneously, merge results, semantically rank results, and return a three-part response that includes grounding data for the next conversation turn, reference data so that you can inspect the source content, and an activity plan that shows query execution steps.
30
30
31
31
Query expansion and parallel execution, plus the retrieval response, are the key capabilities of agentic retrieval that make it the best choice for generative AI (RAG) applications.
32
-
<!-- Queries target a new retrieval pipeline in AI Search supports parallel processing, expanding the scope of single request to include subqueries Query execution runs on your search service, utilizing the best and most effective relevance enhancements in Azure AI Search, including semantic ranker. Output is intended for integration into custom chat solutions, particularly those patterned after an agent-to-agent approach. -->
33
32
34
33
Agentic retrieval adds latency to query processing, but it makes up for it by adding these capabilities:
35
34
@@ -83,7 +82,7 @@ Billing for agentic retrieval has two parts:
83
82
84
83
+ Billing for query planning is pay-as-you-go in Azure OpenAI. It's token based for both input and output tokens. The model you assign to the agent is the one charged for token usage. For example, if you use gpt-4o, the token charge appears in the bill for gpt-4o.
85
84
86
-
+ Billing for semantic ranking during query execution. Billing is suspended during the initial roll-out phase, from May 19 through June 30, 2025. On July 1, billing will be token based and is pay-as-you-go on the Azure AI Search side through the semantic ranker. Semantic ranker, which is a premium billable feature, is an integral part of agentic retrieval. You're charged on the Azure AI Search side for token inputs to the semantic ranking models.
85
+
+ Billing for semantic ranking during query execution. Billing is suspended during the initial roll-out phase but then transitions to pay-as-you-go on the Azure AI Search side through the semantic ranker. Semantic ranker, which is a premium billable feature, is an integral part of agentic retrieval. You're charged on the Azure AI Search side for token inputs to the semantic ranking models.
87
86
88
87
Semantic ranking is performed for every subquery in the plan. Semantic ranking charges are based on the number of tokens returned by each subquery.
89
88
@@ -144,17 +143,22 @@ You must use the preview REST APIs or a prerelease Azure SDK package that provid
144
143
Choose any of these options for your next step.
145
144
146
145
<!-- + Watch this demo. -->
147
-
+[Quickstart](search-get-started-agentic-retrieval.md). Learn the basic workflow using sample data and a prepared index and queries.
146
+
+[Quickstart article: Run agentic retrieval in Azure AI Search](search-get-started-agentic-retrieval.md). Learn the basic workflow using sample data and a prepared index and queries.
148
147
149
-
+[(Sample code) Build an agentic retrieval pipeline using Azure AI Search and Azure AI Agent in the Foundry portal](https://github.com/Azure-Samples/azure-search-python-samples/agent-example)
148
+
+ Sample code:
150
149
151
-
+ How-to guides for a closer look at building an agentic retrieval pipeline:
+[End-to-end with Azure AI Search and Azure AI Agent Service](https://github.com/Azure-Samples/azure-search-python-samples/tree/main/agentic-retrieval-pipeline-example)
154
+
155
+
+ How-to guides for a focused look at development tasks:
152
156
153
157
+[Create an agent](search-agentic-retrieval-how-to-create.md)
154
158
+[Use an agent to retrieve data](search-agentic-retrieval-how-to-retrieve.md)
155
159
+[Build an agent-to-agent retrieval solution](search-agentic-retrieval-how-to-pipeline.md).
156
160
157
-
+ REST API reference, [Agents](/rest/api/searchservice/knowledge-agents/create-or-update?view=rest-searchservice-2025-05-01-preview&preserve-view=true) and [retrieve](/rest/api/searchservice/knowledge-retrieval/retrieve?view=rest-searchservice-2025-05-01-preview&preserve-view=true).
161
+
+ REST API reference, [Agents](/rest/api/searchservice/knowledge-agents?view=rest-searchservice-2025-05-01-preview&preserve-view=true) and [retrieve](/rest/api/searchservice/knowledge-retrieval/retrieve?view=rest-searchservice-2025-05-01-preview&preserve-view=true).
158
162
159
163
+[Azure OpenAI Demo](https://github.com/Azure-Samples/azure-search-openai-demo), updated to use agentic retrieval.
Copy file name to clipboardExpand all lines: articles/search/search-agentic-retrieval-how-to-pipeline.md
+1-2Lines changed: 1 addition & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,8 +17,7 @@ ms.date: 05/10/2025
17
17
18
18
This article describes an approach or pattern for building a solution that uses Azure AI Search for data retrieval and how to integrate the retrieval into a custom solution that includes Azure AI Agent.
19
19
20
-
This article supports the [agent-example](https://github.com/Azure-Samples/azure-search-python-samples/agent-example
21
-
) Python sample on GitHub.
20
+
This article supports the [agentic-retrieval-pipeline-example](https://github.com/Azure-Samples/azure-search-python-samples/tree/main/agentic-retrieval-pipeline-example) Python sample on GitHub.
22
21
23
22
This exercise differs from the [Agentic Retrieval Quickstart](search-get-started-agentic-retrieval.md) in how it uses Azure AI Agent to determine whether to retrieve data from the index, and how it uses an agent tool for orchestration.
Copy file name to clipboardExpand all lines: articles/search/search-document-level-access-overview.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -51,15 +51,15 @@ Document-level permissions in Azure AI Search provide a structured framework for
51
51
52
52
## Reference documents
53
53
54
-
To help you dive deeper into document-level access control in Azure AI Search, here’s a table of key resources:
54
+
To help you dive deeper into document-level access control in Azure AI Search, here are more articles and samples:
55
55
56
56
| Functionality | Reference | Sample |
57
-
|---|---|
58
-
|**Index permissions using Push APIs**|[Index permissions using REST API](search-index-access-control-lists-and-rbac-push-api.md)|[azure-search-rest-samples/Quickstart-ACL](https://github.com/Azure-Samples/azure-search-rest-samples-pr/tree/main/Quickstart-ACL) or [azure-search-python-samples/Quickstart-ACL](https://github.com/Azure-Samples/azure-search-python-samples-pr/blob/main/Quickstart-ACL)|
59
-
|**Index ADLS Gen2 permissions metadata using built-in indexers**|[Index permissions using ADLS Gen2 indexer](search-indexer-access-control-lists-and-role-based-access.md)|[azure-search-python-samples/Quickstart-Document-Permissions](https://github.com/Azure-Samples/azure-search-python-samples/blob/main/Quickstart-Document-Permissions)|
60
-
|**Query using Microsoft Entra token-based permissions**|[Query using Microsoft Entra token-based permissions](https://aka.ms/azs-query-preserving-permissions)| See previous samples. |
57
+
|---|---|---|
58
+
|**Index permissions using Push APIs**|[How to index permissions using REST API](search-index-access-control-lists-and-rbac-push-api.md)|[azure-search-rest-samples/Quickstart-ACL](https://github.com/Azure-Samples/azure-search-rest-samples/tree/main/Quickstart-ACL) or [azure-search-python-samples/Quickstart-Document-Permissions-Push-API](https://github.com/Azure-Samples/azure-search-python-samples/blob/main/Quickstart-Document-Permissions-Push-API)|
59
+
|**Index ADLS Gen2 permissions metadata using built-in indexers**|[How to index permissions using ADLS Gen2 indexer](search-indexer-access-control-lists-and-role-based-access.md)and [Tutorial: Index ADLS Gen2 permissions metadata using an indexer](tutorial-adls-gen2-indexer-acls.md)|[azure-search-python-samples/Quickstart-Document-Permissions-Pull-API](https://github.com/Azure-Samples/azure-search-python-samples/blob/main/Quickstart-Document-Permissions-Pull-API)|
60
+
|**Query using Microsoft Entra token-based permissions**|[How to query using Microsoft Entra token-based permissions](https://aka.ms/azs-query-preserving-permissions)| See previous samples. |
61
61
|**Security trimming via filters**|[Security trimming via filters](search-security-trimming-for-azure-search.md)| Not available. |
62
-
62
+
63
63
## Next steps
64
64
65
-
-[Tutorial: Index ADLS Gen2 permissions metadata](tutorial-adls-gen2-indexer-acls.md)
65
+
-[Tutorial: Index ADLS Gen2 permissions metadata](tutorial-adls-gen2-indexer-acls.md)
0 commit comments