You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* update some broken links
* rename `development` section to `configure your environment`, add a description
* update alt text and add a caption to the architecture diagram
Copy file name to clipboardExpand all lines: README.md
+12-9Lines changed: 12 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# Getting Started with Agents Using Azure AI Foundry
2
2
3
-
MENU: [**FEATURES**](#features)\|[**QUICK DEPLOY**](#quick-deploy)\|[**GETTING STARTED**](#getting-started)\|[**DEVELOPMENT**](#development)\|[**DEPLOYMENT**](#deployment)\|[**TRACING AND MONITORING**](#tracing-and-monitoring)\|[**DEVELOPMENT OPTIONS**](#development-options)\|[**GUIDANCE**](#guidance)\|[**TROUBLESHOOTING**](#troubleshooting)
3
+
MENU: [**FEATURES**](#features)\|[**GETTING STARTED**](#getting-started)\|[**CONFIGURE YOUR ENVIRONMENT**](#configure-your-environment)\|[**DEPLOYMENT**](#deployment)\|[**RESOURCE CLEAN-UP**](#resource-clean-up)\|[**TRACING AND MONITORING**](#tracing-and-monitoring)\|[**GUIDANCE**](#guidance)\|[**TROUBLESHOOTING**](#troubleshooting)
4
4
5
5
## Important Security Notice
6
6
@@ -18,7 +18,8 @@ Instructions are provided for deployment through GitHub Codespaces, VS Code Dev
18
18
19
19
#### Architecture diagram
20
20
21
-

21
+

22
+
The app code runs in Azure Container apps to process the user input and generate a response to the user. It leverages Azure AI projects and Azure AI services, including the model and agent.
22
23
23
24
## Getting Started
24
25
@@ -50,7 +51,9 @@ Make sure the following tools are installed:
This section details the customizable options for this solution, including agent model, knowledge retrieval, logging, tracing, and quota recommendations. If you want to proceed with the default settings, continue to the [deployment section](#deployment).
54
57
55
58
#### Code
56
59
If you are using one of the [Quick Deploy options](#quick-deploy), open the codespace now.
@@ -118,7 +121,7 @@ To enable message contents to be included in the traces, set the following envir
The default for the model capacity in deployment is 30k tokens. For optimal performance, it is recommended to increase to 100k tokens. You can change the capacity by following the steps in [setting capacity and deployment SKU](docs/deploy_customization.md#customizing-model-deployments).
124
127
@@ -235,13 +238,13 @@ Once you've opened the project in [Codespaces](#github-codespaces) or in [Dev Co
235
238
azd auth login
236
239
```
237
240
238
-
2. (Optional) If you would like to customize the deployment to [disable resources](docs/deploy_customization.md#disabling-resources), [customize resource names](docs/deploy_customization.md#customizing-resource-names), [customize the models](docs/deploy_customization.md#customizing-model-deployments) or [increase quota](docs/deploy_customization.md#customizing-model-deployments), you can follow those steps now.
241
+
2. (Optional) If you would like to customize the deployment to [disable resources](docs/deploy_customization.md#disabling-resources), [customize resource names](docs/deploy_customization.md#customizing-resource-names), [customize the models](docs/deploy_customization.md#customizing-model-deployments) or [increase quota](docs/deploy_customization.md#setting-capacity-and-deployment-sku), you can follow those steps now.
239
242
240
243
⚠️ **NOTE!** For optimal performance, the recommended quota is 100k tokens per minute. If you have the capacity, we recommend increasing the quota by running the following command:
241
244
```shell
242
245
azd env set AZURE_AI_AGENT_DEPLOYMENT_CAPACITY 100
243
246
```
244
-
If you do not increase your quota, you may encounter rate limit issues. If needed, you can increase the quota after deployment by editing your model in the Models and Endpoints tab of the [Azure AI Foundry Portal](https://ai.azure.com/).
247
+
⚠️ If you do not increase your quota, you may encounter rate limit issues. If needed, you can increase the quota after deployment by editing your model in the Models and Endpoints tab of the [Azure AI Foundry Portal](https://ai.azure.com/).
245
248
246
249
3. Provision and deploy all the resources by running the following in get-started-with-ai-agents directory:
247
250
@@ -252,7 +255,7 @@ Once you've opened the project in [Codespaces](#github-codespaces) or in [Dev Co
252
255
4. You will be prompted to provide an `azd` environment name (like "azureaiapp"), select a subscription from your Azure account, and select a location which has quota for all the resources. Then, it will provision the resources in your account and deploy the latest code.
253
256
254
257
* For guidance on selecting a region with quota and model availability, follow the instructions in the [quota recommendations](#quota-recommendations-optional) section and ensure that your model is available in your selected region by checking the [list of models supported by Azure AI Agent Service](https://learn.microsoft.com/azure/ai-services/agents/concepts/model-region-support)
255
-
* This deployment will take 8-12 minutes to provision the resources in your account and set up the solution with sample data.
258
+
* This deployment will take 7-10 minutes to provision the resources in your account and set up the solution with sample data.
256
259
* If you get an error or timeout with deployment, changing the location can help, as there may be availability constraints for the resources. You can do this by running `azd down` and deleting the `.azure` folder from your code, and then running `azd up` again and selecting a new region.
257
260
258
261
**NOTE!** If you get authorization failed and/or permission related errors during the deployment, please refer to the Azure account requirements in the [Prerequisites](#prerequisites) section. If you were recently granted these permissions, it may take a few minutes for the authorization to apply.
@@ -355,7 +358,7 @@ To ensure continued best practices in your own repository, we recommend that any
355
358
356
359
You may want to consider additional security measures, such as:
357
360
358
-
* Enabling Microsoft Defender for Cloud to [secure your Azure resources](https://learn.microsoft.com/azure/security-center/defender-for-cloud).
361
+
* Enabling Microsoft Defender for Cloud to [secure your Azure resources](https://learn.microsoft.com/azure/defender-for-cloud/).
359
362
* Protecting the Azure Container Apps instance with a [firewall](https://learn.microsoft.com/azure/container-apps/waf-app-gateway) and/or [Virtual Network](https://learn.microsoft.com/azure/container-apps/networking?tabs=workload-profiles-env%2Cazure-cli).
360
363
361
364
#### Resources
@@ -391,7 +394,7 @@ The template also includes dependent resources required by all AI Hub resources:
391
394
* If your agent is occasionally unresponsive, your model may have reached its rate limit. You can increase its quota by adjusting the bicep configuration or by editing the model in the Azure AI Foundry page for your project's model deployments.
392
395
* If your agent is crashing, confirm that you are using a model that you have deployed to your project.
393
396
* For easier agents configuration and streamlined integration with an existing assistants library, export the agent from Azure AI Foundry and implement a yaml-based configuration.
394
-
* This application is designed to serve multiple users on multiple browsers. This application uses cookies to ensure that the same thread is reused forconversations across multiple tabsin the same browser. If the browser is restarted, the old thread will continue to serve the user. However, if the application has a new agent after a server restart or a thread is deleted, a new thread will be created without requiring a browser refresh or signaling to the users.When users submit a message to the web server, the web server will create an agent, thread, and stream back a reply. The response contains `agent_id` and `thread_id`in cookies. As a result, each subsequent message sent to the web server will also contain these IDs. As long as the same agent is being used in the system and the thread can be retrieved in the cookie, the same thread will be used to serve the users.
397
+
* This application is designed to serve multiple users on multiple browsers. This application uses cookies to ensure that the same thread is reused forconversations across multiple tabsin the same browser. If the browser is restarted, the old thread will continue to serve the user. However, if the application has a new agent after a server restart or a thread is deleted, a new thread will be created without requiring a browser refresh or signaling to the users.When users submit a message to the web server, the web server will create an agent, thread, and stream back a reply. The response contains `agent_id` and `thread_id`in cookies. As a result, each subsequent message sent to the web server will also contain these IDs. As long as the same agent is being used in the system and the thread can be retrieved in the cookie, the same thread will be used to serve the users.
395
398
* For document handling, use filename-based downloads to avoid storing files in dictionaries.
396
399
* Intermittent errors may arise when retrieving filenames for file IDs, which may be mitigated by using a single worker and fresh threads for each new assistant.
397
400
* File citation can be enhanced by automatically including filenames to reduce manual steps.
Copy file name to clipboardExpand all lines: docs/deploy_customization.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
2
2
# Getting Started with Agents Using Azure AI Foundry: Deployment customization
3
3
4
-
This document describes how to customize the deployment of the Agents Chat with Azure AI Foundry. Once you follow the steps here, you can run `azd up` as described in the [Deploying](./README.md#deploying) steps.
4
+
This document describes how to customize the deployment of the Agents Chat with Azure AI Foundry. Once you follow the steps here, you can run `azd up` as described in the [Deploying](./../README.md#deploying-steps) steps.
0 commit comments