Skip to content

Commit a66dfca

Browse files
authored
Merge pull request #4917 from HeidiSteen/heidist-build-3
[Monday automerge] corrections to a feature name
2 parents c359d95 + 3f4911f commit a66dfca

7 files changed

+54
-50
lines changed

articles/search/index.yml

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
11
### YamlMime:Landing
22

33
title: Azure AI Search documentation # < 60 chars
4-
summary: Information retrieval at scale for agentic RAG, with vector and text content in traditional or generative search scenarios.
4+
summary: Information retrieval at scale for agentic retrieval, with vector and text content in traditional or generative search scenarios.
55
metadata:
66
title: Azure AI Search documentation
7-
description: Information retrieval at scale for agentic RAG, with vector and text content in traditional or generative search scenarios.
7+
description: Information retrieval at scale for agentic retrieval, with vector and text content in traditional or generative search scenarios.
88
ms.service: azure-ai-search
99
ms.custom:
1010
- ignite-2023
@@ -43,13 +43,13 @@ landingContent:
4343
url: vector-search-how-to-query.md
4444

4545
# Card
46-
- title: Agentic RAG
46+
- title: Agentic retrieval and vectors
4747
linkLists:
4848
- linkListType: concept
4949
links:
5050
- text: Retrieval Augmented Generation (RAG)
5151
url: retrieval-augmented-generation-overview.md
52-
- text: Agentic RAG
52+
- text: Agentic retrieval
5353
url: search-agentic-retrieval-concept.md
5454
- text: Vector search
5555
url: vector-search-overview.md

articles/search/retrieval-augmented-generation-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -58,7 +58,7 @@ A high-level summary of the pattern looks like this:
5858
+ Send it to Azure AI Search to find relevant information.
5959
+ Return the top ranked search results to an LLM.
6060
+ Use the natural language understanding and reasoning capabilities of the LLM to generate a response to the initial prompt.
61-
+ Optionally, use agentic RAG where an agent evaluates an answer and finds a better one if the original answer is incomplete or low quality.
61+
+ Optionally, use agentic retrieval where an agent evaluates an answer and finds a better one if the original answer is incomplete or low quality.
6262

6363
Azure AI Search provides inputs to the LLM prompt, but doesn't train the model. In RAG architecture, there's no extra training. The LLM is pretrained using public data, but it generates responses that are augmented by information from the retriever, in this case, Azure AI Search.
6464

articles/search/search-agentic-retrieval-concept.md

Lines changed: 14 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -9,27 +9,26 @@ ms.author: heidist
99
ms.service: azure-ai-search
1010
ms.topic: concept-article
1111
ms.custom: references_regions
12-
ms.date: 05/15/2025
12+
ms.date: 05/19/2025
1313
---
1414

1515
# Agentic retrieval in Azure AI Search
1616

1717
[!INCLUDE [Feature preview](./includes/previews/preview-generic.md)]
1818

19-
In Azure AI Search, *agentic retrieval* is a new parallel query processing architecture that uses conversational language models to generate multiple subqueries for a single retrieval request, incorporating conversation history and semantic ranking to produce high-quality grounding data for custom chat and generative AI solutions that include agents.
19+
In Azure AI Search, *agentic retrieval* is a new parallel query processing architecture that incorporates user conversation history and Azure OpenAI models to plan, retrieve and synthesize queries for improved results. It produces high-quality grounding data for custom chat and generative AI solutions that include agents.
2020

2121
Programmatically, agentic retrieval is supported through a new Knowledge Agents object (also known as a search agent) in the 2025-05-01-preview data plane REST API and in Azure SDK prerelease packages that provide the feature. An agent's retrieval response is designed for downstream consumption by other agents and chat apps.
2222

2323
## Why use agentic retrieval
2424

25-
You should use agentic retrieval when you want to send data to an agent or customize a chat experience with high quality inputs that include your proprietary data.
25+
You should use agentic retrieval when you want to send high quality data to an agent or to ground a chat experience with inputs that include your proprietary content.
2626

27-
The *agentic* aspect is a reasoning step in query planning processing that's performed by a supported large language model (LLM) that you provide. The LLM is tasked with designing multiple subqueries based on: user questions, chat history, and parameters on the request. The subqueries target your indexed documents (plain text and vectors) in Azure AI Search.
27+
The *agentic* aspect is a reasoning step in query planning processing that's performed by a supported large language model (LLM) that you provide. The LLM analyzes the entire chat thread to identify the underlying information need. Instead of a single, catch-all query, the model breaks down compound questions into focused subqueries based on: user questions, chat history, and parameters on the request. The subqueries target your indexed documents (plain text and vectors) in Azure AI Search.This hybrid approach ensures you surface both keyword matches and semantic similarities at once, dramatically improving recall.
2828

2929
The *retrieval* component is the ability to run subqueries simultaneously, merge results, semantically rank results, and return a three-part response that includes grounding data for the next conversation turn, reference data so that you can inspect the source content, and an activity plan that shows query execution steps.
3030

3131
Query expansion and parallel execution, plus the retrieval response, are the key capabilities of agentic retrieval that make it the best choice for generative AI (RAG) applications.
32-
<!-- Queries target a new retrieval pipeline in AI Search supports parallel processing, expanding the scope of single request to include subqueries Query execution runs on your search service, utilizing the best and most effective relevance enhancements in Azure AI Search, including semantic ranker. Output is intended for integration into custom chat solutions, particularly those patterned after an agent-to-agent approach. -->
3332

3433
Agentic retrieval adds latency to query processing, but it makes up for it by adding these capabilities:
3534

@@ -83,7 +82,7 @@ Billing for agentic retrieval has two parts:
8382

8483
+ Billing for query planning is pay-as-you-go in Azure OpenAI. It's token based for both input and output tokens. The model you assign to the agent is the one charged for token usage. For example, if you use gpt-4o, the token charge appears in the bill for gpt-4o.
8584

86-
+ Billing for semantic ranking during query execution. Billing is suspended during the initial roll-out phase, from May 19 through June 30, 2025. On July 1, billing will be token based and is pay-as-you-go on the Azure AI Search side through the semantic ranker. Semantic ranker, which is a premium billable feature, is an integral part of agentic retrieval. You're charged on the Azure AI Search side for token inputs to the semantic ranking models.
85+
+ Billing for semantic ranking during query execution. Billing is suspended during the initial roll-out phase but then transitions to pay-as-you-go on the Azure AI Search side through the semantic ranker. Semantic ranker, which is a premium billable feature, is an integral part of agentic retrieval. You're charged on the Azure AI Search side for token inputs to the semantic ranking models.
8786

8887
Semantic ranking is performed for every subquery in the plan. Semantic ranking charges are based on the number of tokens returned by each subquery.
8988

@@ -144,17 +143,22 @@ You must use the preview REST APIs or a prerelease Azure SDK package that provid
144143
Choose any of these options for your next step.
145144

146145
<!-- + Watch this demo. -->
147-
+ [Quickstart](search-get-started-agentic-retrieval.md). Learn the basic workflow using sample data and a prepared index and queries.
146+
+ [Quickstart article: Run agentic retrieval in Azure AI Search](search-get-started-agentic-retrieval.md). Learn the basic workflow using sample data and a prepared index and queries.
148147

149-
+ [(Sample code) Build an agentic retrieval pipeline using Azure AI Search and Azure AI Agent in the Foundry portal](https://github.com/Azure-Samples/azure-search-python-samples/agent-example)
148+
+ Sample code:
150149

151-
+ How-to guides for a closer look at building an agentic retrieval pipeline:
150+
+ [Quickstart-Agentic-Retrieval: Python](https://github.com/Azure-Samples/azure-search-python-samples/tree/main/Quickstart-Agentic-Retrieval)
151+
+ [Quickstart-Agentic-Retrieval: .NET](https://github.com/Azure-Samples/azure-search-dotnet-samples/blob/main/quickstart-agentic-retrieval)
152+
+ [Quickstart-Agentic-Retrieval: REST](https://github.com/Azure-Samples/azure-search-rest-samples/tree/main/agentic-retrieval)
153+
+ [End-to-end with Azure AI Search and Azure AI Agent Service](https://github.com/Azure-Samples/azure-search-python-samples/tree/main/agentic-retrieval-pipeline-example)
154+
155+
+ How-to guides for a focused look at development tasks:
152156

153157
+ [Create an agent](search-agentic-retrieval-how-to-create.md)
154158
+ [Use an agent to retrieve data](search-agentic-retrieval-how-to-retrieve.md)
155159
+ [Build an agent-to-agent retrieval solution](search-agentic-retrieval-how-to-pipeline.md).
156160

157-
+ REST API reference, [Agents](/rest/api/searchservice/knowledge-agents/create-or-update?view=rest-searchservice-2025-05-01-preview&preserve-view=true) and [retrieve](/rest/api/searchservice/knowledge-retrieval/retrieve?view=rest-searchservice-2025-05-01-preview&preserve-view=true).
161+
+ REST API reference, [Agents](/rest/api/searchservice/knowledge-agents?view=rest-searchservice-2025-05-01-preview&preserve-view=true) and [retrieve](/rest/api/searchservice/knowledge-retrieval/retrieve?view=rest-searchservice-2025-05-01-preview&preserve-view=true).
158162

159163
+ [Azure OpenAI Demo](https://github.com/Azure-Samples/azure-search-openai-demo), updated to use agentic retrieval.
160164

articles/search/search-agentic-retrieval-how-to-pipeline.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,7 @@ ms.date: 05/10/2025
1717

1818
This article describes an approach or pattern for building a solution that uses Azure AI Search for data retrieval and how to integrate the retrieval into a custom solution that includes Azure AI Agent.
1919

20-
This article supports the [agent-example](https://github.com/Azure-Samples/azure-search-python-samples/agent-example
21-
) Python sample on GitHub.
20+
This article supports the [agentic-retrieval-pipeline-example](https://github.com/Azure-Samples/azure-search-python-samples/tree/main/agentic-retrieval-pipeline-example) Python sample on GitHub.
2221

2322
This exercise differs from the [Agentic Retrieval Quickstart](search-get-started-agentic-retrieval.md) in how it uses Azure AI Agent to determine whether to retrieve data from the index, and how it uses an agent tool for orchestration.
2423

articles/search/search-document-level-access-overview.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -51,15 +51,15 @@ Document-level permissions in Azure AI Search provide a structured framework for
5151

5252
## Reference documents
5353

54-
To help you dive deeper into document-level access control in Azure AI Search, here’s a table of key resources:
54+
To help you dive deeper into document-level access control in Azure AI Search, here are more articles and samples:
5555

5656
| Functionality | Reference | Sample |
57-
|---|---|
58-
| **Index permissions using Push APIs** | [Index permissions using REST API](search-index-access-control-lists-and-rbac-push-api.md) | [azure-search-rest-samples/Quickstart-ACL](https://github.com/Azure-Samples/azure-search-rest-samples-pr/tree/main/Quickstart-ACL) or [azure-search-python-samples/Quickstart-ACL](https://github.com/Azure-Samples/azure-search-python-samples-pr/blob/main/Quickstart-ACL) |
59-
| **Index ADLS Gen2 permissions metadata using built-in indexers** | [Index permissions using ADLS Gen2 indexer](search-indexer-access-control-lists-and-role-based-access.md) | [azure-search-python-samples/Quickstart-Document-Permissions](https://github.com/Azure-Samples/azure-search-python-samples/blob/main/Quickstart-Document-Permissions) |
60-
| **Query using Microsoft Entra token-based permissions** | [Query using Microsoft Entra token-based permissions](https://aka.ms/azs-query-preserving-permissions) | See previous samples. |
57+
|---|---|---|
58+
| **Index permissions using Push APIs** | [How to index permissions using REST API](search-index-access-control-lists-and-rbac-push-api.md) | [azure-search-rest-samples/Quickstart-ACL](https://github.com/Azure-Samples/azure-search-rest-samples/tree/main/Quickstart-ACL) or [azure-search-python-samples/Quickstart-Document-Permissions-Push-API](https://github.com/Azure-Samples/azure-search-python-samples/blob/main/Quickstart-Document-Permissions-Push-API) |
59+
| **Index ADLS Gen2 permissions metadata using built-in indexers** | [How to index permissions using ADLS Gen2 indexer](search-indexer-access-control-lists-and-role-based-access.md) and [Tutorial: Index ADLS Gen2 permissions metadata using an indexer](tutorial-adls-gen2-indexer-acls.md) | [azure-search-python-samples/Quickstart-Document-Permissions-Pull-API](https://github.com/Azure-Samples/azure-search-python-samples/blob/main/Quickstart-Document-Permissions-Pull-API) |
60+
| **Query using Microsoft Entra token-based permissions** | [How to query using Microsoft Entra token-based permissions](https://aka.ms/azs-query-preserving-permissions) | See previous samples. |
6161
| **Security trimming via filters** | [Security trimming via filters](search-security-trimming-for-azure-search.md) | Not available. |
62-
62+
6363
## Next steps
6464

65-
- [Tutorial: Index ADLS Gen2 permissions metadata](tutorial-adls-gen2-indexer-acls.md)
65+
- [Tutorial: Index ADLS Gen2 permissions metadata](tutorial-adls-gen2-indexer-acls.md)

0 commit comments

Comments
 (0)