Skip to content

Commit c7dae8e

Browse files
committed
Add note, link eval docs in more places, link to videos
1 parent f4a7334 commit c7dae8e

File tree

8 files changed

+27
-5
lines changed

8 files changed

+27
-5
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -253,13 +253,15 @@ You can find extensive documentation in the [docs](docs/README.md) folder:
253253
- [Deploying with existing Azure resources](docs/deploy_existing.md)
254254
- [Deploying from a free account](docs/deploy_lowcost.md)
255255
- [Enabling optional features](docs/deploy_features.md)
256+
- [All features](docs/deploy_features.md)
256257
- [Login and access control](docs/login_and_acl.md)
257258
- [GPT-4 Turbo with Vision](docs/gpt4v.md)
258259
- [Private endpoints](docs/deploy_private.md)
259260
- [Sharing deployment environments](docs/sharing_environments.md)
260261
- [Local development](docs/localdev.md)
261262
- [Customizing the app](docs/customization.md)
262263
- [Data ingestion](docs/data_ingestion.md)
264+
- [Evaluation](docs/evaluation.md)
263265
- [Monitoring with Application Insights](docs/monitoring.md)
264266
- [Productionizing](docs/productionizing.md)
265267
- [Alternative RAG chat samples](docs/other_samples.md)

docs/README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,12 +10,14 @@ These are advanced topics that are not necessary for a basic deployment.
1010
- [Deploying with existing Azure resources](deploy_existing.md)
1111
- [Deploying from a free account](deploy_lowcost.md)
1212
- [Enabling optional features](deploy_features.md)
13+
- [All features](docs/deploy_features.md)
1314
- [Login and access control](login_and_acl.md)
1415
- [GPT-4 Turbo with Vision](gpt4v.md)
1516
- [Private endpoints](deploy_private.md)
1617
- [Sharing deployment environments](sharing_environments.md)
1718
- [Local development](localdev.md)
1819
- [Customizing the app](customization.md)
20+
- [Evaluation](docs/evaluation.md)
1921
- [Data ingestion](data_ingestion.md)
2022
- [Monitoring with Application Insights](monitoring.md)
2123
- [Productionizing](productionizing.md)

docs/customization.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,8 @@
11
# RAG chat: Customizing the chat app
22

3-
This guide provides more details for customizing the Chat App.
3+
[📺 Watch: (RAG Deep Dive series) Customizing the app](https://www.youtube.com/watch?v=D3slfMqydHc)
4+
5+
This guide provides more details for customizing the RAG chat app.
46

57
- [Using your own data](#using-your-own-data)
68
- [Customizing the UI](#customizing-the-ui)
@@ -124,7 +126,7 @@ You can also try changing the ChatCompletion parameters, like temperature, to se
124126

125127
### Improving Azure AI Search results
126128

127-
If the problem is with Azure AI Search (step 2 above), the first step is to check what search parameters you're using. Generally, the best results are found with hybrid search (text + vectors) plus the additional semantic re-ranking step, and that's what we've enabled by default. There may be some domains where that combination isn't optimal, however. Check out this blog post which [evaluates AI search strategies](https://techcommunity.microsoft.com/blog/azure-ai-services-blog/azure-ai-search-outperforming-vector-search-with-hybrid-retrieval-and-ranking-ca/3929167) for a better understanding of the differences.
129+
If the problem is with Azure AI Search (step 2 above), the first step is to check what search parameters you're using. Generally, the best results are found with hybrid search (text + vectors) plus the additional semantic re-ranking step, and that's what we've enabled by default. There may be some domains where that combination isn't optimal, however. Check out this blog post which [evaluates AI search strategies](https://techcommunity.microsoft.com/blog/azure-ai-services-blog/azure-ai-search-outperforming-vector-search-with-hybrid-retrieval-and-ranking-ca/3929167) for a better understanding of the differences, or watch this [RAG Deep Dive video on AI Search](https://www.youtube.com/watch?v=ugJy9QkgLYg).
128130

129131
#### Configuring parameters in the app
130132

@@ -175,4 +177,4 @@ Here are additional ways for improving the search results:
175177

176178
### Evaluating answer quality
177179

178-
Once you've made changes to the prompts or settings, you'll want to rigorously evaluate the results to see if they've improved. You can use tools in [the AI RAG Chat evaluator](https://github.com/Azure-Samples/ai-rag-chat-evaluator) repository to run evaluations, review results, and compare answers across runs.
180+
Once you've made changes to the prompts or settings, you'll want to rigorously evaluate the results to see if they've improved. Follow the [evaluation guide](./evaluation.md) to learn how to run evaluations, review results, and compare answers across runs.

docs/deploy_features.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -179,6 +179,8 @@ Convert them first to PDF or image formats to enable media description.
179179

180180
## Enabling client-side chat history
181181

182+
[📺 Watch: (RAG Deep Dive series) Storing chat history](https://www.youtube.com/watch?v=1YiTFnnLVIA)
183+
182184
This feature allows users to view the chat history of their conversation, stored in the browser using [IndexedDB](https://developer.mozilla.org/docs/Web/API/IndexedDB_API). That means the chat history will be available only on the device where the chat was initiated. To enable browser-stored chat history, run:
183185

184186
```shell
@@ -187,6 +189,8 @@ azd env set USE_CHAT_HISTORY_BROWSER true
187189

188190
## Enabling persistent chat history with Azure Cosmos DB
189191

192+
[📺 Watch: (RAG Deep Dive series) Storing chat history](https://www.youtube.com/watch?v=1YiTFnnLVIA)
193+
190194
This feature allows authenticated users to view the chat history of their conversations, stored in the server-side storage using [Azure Cosmos DB](https://learn.microsoft.com/azure/cosmos-db/).This option requires that authentication be enabled. The chat history will be persistent and accessible from any device where the user logs in with the same account. To enable server-stored chat history, run:
191195

192196
```shell

docs/deploy_private.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,8 @@ urlFragment: azure-search-openai-demo-private-access
1919

2020
# RAG chat: Deploying with private access
2121

22+
[📺 Watch: (RAG Deep Dive series) Private network deployment](https://www.youtube.com/watch?v=08wtL1eB15g)
23+
2224
The [azure-search-openai-demo](/) project can set up a full RAG chat app on Azure AI Search and OpenAI so that you can chat on custom data, like internal enterprise data or domain-specific knowledge sets. For full instructions on setting up the project, consult the [main README](/README.md), and then return here for detailed instructions on configuring private endpoints.
2325

2426
⚠️ This feature is not yet compatible with Azure Container Apps, so you will need to [deploy to Azure App Service](./azure_app_service.md) instead.

docs/evaluation.md

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
# Evaluating the RAG answer quality
22

3+
[📺 Watch: (RAG Deep Dive series) Evaluating RAG answer quality](https://www.youtube.com/watch?v=lyCLu53fb3g)
4+
35
Follow these steps to evaluate the quality of the answers generated by the RAG flow.
46

57
* [Deploy an evaluation model](#deploy-an-evaluation-model)
@@ -87,7 +89,7 @@ The options are:
8789
* `resultsdir`: The directory to write the evaluation results. By default, this is a timestamped folder in `evals/results`. This option can also be specified in `eval_config.json`.
8890
* `targeturl`: The URL of the running application to evaluate. By default, this is `http://localhost:50505`. This option can also be specified in `eval_config.json`.
8991
90-
🕰️ This may take a long time, possibly several hours, depending on the number of ground truth questions.
92+
🕰️ This may take a long time, possibly several hours, depending on the number of ground truth questions, and the TPM capacity of the evaluation model, and the number of GPT metrics requested.
9193
9294
## Review the evaluation results
9395
@@ -113,4 +115,8 @@ python -m evaltools diff evals/results/baseline/ evals/results/SECONDRUNHERE
113115
114116
## Run bulk evaluation on a PR
115117
116-
To run the evaluation on the changes in a PR, you can add a `/evaluate` comment to the PR. This will trigger the evaluation workflow to run the evaluation on the PR changes and will post the results to the PR.
118+
This repository includes a GitHub Action workflow `evaluate.yaml` that can be used to run the evaluation on the changes in a PR.
119+
120+
In order for the workflow to run successfully, you must first set up [continuous integration](./azd.md#github-actions) for the repository.
121+
122+
To run the evaluation on the changes in a PR, a repository member can post a `/evaluate` comment to the PR. This will trigger the evaluation workflow to run the evaluation on the PR changes and will post the results to the PR.

docs/gpt4v.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
# RAG chat: Using GPT vision model with RAG approach
22

3+
[📺 Watch: (RAG Deep Dive series) Multimedia data ingestion](https://www.youtube.com/watch?v=5FfIy7G2WW0)
4+
35
This repository includes an optional feature that uses the GPT vision model to generate responses based on retrieved content. This feature is useful for answering questions based on the visual content of documents, such as photos and charts.
46

57
## How it works

docs/login_and_acl.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,8 @@ urlFragment: azure-search-openai-demo-document-security
1919

2020
# RAG chat: Setting up optional login and document level access control
2121

22+
[📺 Watch: (RAG Deep Dive series) Login and access control](https://www.youtube.com/watch?v=GwEiYJgM8Vw)
23+
2224
The [azure-search-openai-demo](/) project can set up a full RAG chat app on Azure AI Search and OpenAI so that you can chat on custom data, like internal enterprise data or domain-specific knowledge sets. For full instructions on setting up the project, consult the [main README](/README.md), and then return here for detailed instructions on configuring login and access control.
2325

2426
## Table of Contents

0 commit comments

Comments
 (0)