You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# If you encounter a blocking error during a DefaultAzureCredential resolution, you can exclude the problematic credential by using a parameter (ex. exclude_shared_token_cache_credential=True)
help="Optional. Use this Azure AI Search account key instead of the current user identity to login (use az login to set current user for Azure)",
293
281
)
294
-
parser.add_argument(
295
-
"--searchsecretname",
296
-
required=False,
297
-
help="Required if searchkey is not provided and search service is free sku. Fetch the Azure AI Vision key from this keyvault instead of the instead of the current user identity to login (use az login to set current user for Azure)",
Copy file name to clipboardExpand all lines: docs/deploy_lowcost.md
+8-11Lines changed: 8 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
3
3
This AI RAG chat application is designed to be easily deployed using the Azure Developer CLI, which provisions the infrastructure according to the Bicep files in the `infra` folder. Those files describe each of the Azure resources needed, and configures their SKU (pricing tier) and other parameters. Many Azure services offer a free tier, but the infrastructure files in this project do *not* default to the free tier as there are often limitations in that tier.
4
4
5
-
However, if your goal is to minimize costs while prototyping your application, follow the steps below _before_ running `azd up`. Once you've gone through these steps, return to the [deployment steps](../README.md#deploying).
5
+
However, if your goal is to minimize costs while prototyping your application, follow the steps below *before* running `azd up`. Once you've gone through these steps, return to the [deployment steps](../README.md#deploying).
6
6
7
7
[📺 Live stream: Deploying from a free account](https://www.youtube.com/watch?v=nlIyos0RXHw)
8
8
@@ -21,15 +21,15 @@ However, if your goal is to minimize costs while prototyping your application, f
21
21
Enter a name that will be used for the resource group.
22
22
This will create a new folder in the `.azure` folder, and set it as the active environment for any calls to `azd` going forward.
23
23
24
-
2. Use the free tier of App Service:
24
+
1. Use the free tier of App Service:
25
25
26
26
```shell
27
27
azd env set AZURE_APP_SERVICE_SKU F1
28
28
```
29
29
30
30
Limitation: You are only allowed a certain number of free App Service instances per region. If you have exceeded your limit in a region, you will get an error during the provisioning stage. If that happens, you can run `azd down`, then`azd env new` to create a new environment with a new region.
31
31
32
-
3. Use the free tier of Azure AI Search:
32
+
1. Use the free tier of Azure AI Search:
33
33
34
34
```shell
35
35
azd env set AZURE_SEARCH_SERVICE_SKU free
@@ -41,11 +41,8 @@ However, if your goal is to minimize costs while prototyping your application, f
41
41
reuse your [existing search service](../README.md#existing-azure-ai-search-resource).
42
42
2. The free tier does not support semantic ranker, so the app UI will no longer display
43
43
the option to use the semantic ranker. Note that will generally result in [decreased search relevance](https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-ai-search-outperforming-vector-search-with-hybrid/ba-p/3929167).
44
-
3. The free tier does not support Managed Identity (keyless API access),
45
-
so the Bicep will use Azure Key Vault to securely store the key instead.
46
-
47
-
4. Use the free tier of Azure Document Intelligence (used in analyzing files):
48
44
45
+
1. Use the free tier of Azure Document Intelligence (used in analyzing files):
49
46
50
47
```shell
51
48
azd env set AZURE_DOCUMENTINTELLIGENCE_SKU F0
@@ -74,7 +71,7 @@ However, if your goal is to minimize costs while prototyping your application, f
74
71
azd env set USE_LOCAL_HTML_PARSER true
75
72
```
76
73
77
-
5. Turn off Azure Monitor (Application Insights):
74
+
1. Turn off Azure Monitor (Application Insights):
78
75
79
76
```shell
80
77
azd env set AZURE_USE_APPLICATION_INSIGHTS false
@@ -83,7 +80,7 @@ However, if your goal is to minimize costs while prototyping your application, f
83
80
Application Insights is quite inexpensive already, so turning this off may not be worth the costs saved,
84
81
but it is an option for those who want to minimize costs.
85
82
86
-
6. Use OpenAI.com instead of Azure OpenAI: This is only a necessary step for Azure free/student accounts, as they do not currently have access to Azure OpenAI.
83
+
1. Use OpenAI.com instead of Azure OpenAI: This is only a necessary step for Azure free/student accounts, as they do not currently have access to Azure OpenAI.
87
84
88
85
```shell
89
86
azd env set OPENAI_HOST openai
@@ -94,7 +91,7 @@ However, if your goal is to minimize costs while prototyping your application, f
94
91
Both Azure OpenAI and openai.com OpenAI accounts will incur costs, based on tokens used,
95
92
but the costs are fairly low for the amount of sample data (less than $10).
96
93
97
-
6. Disable vector search:
94
+
1. Disable vector search:
98
95
99
96
```shell
100
97
azd env set USE_VECTORS false
@@ -106,7 +103,7 @@ However, if your goal is to minimize costs while prototyping your application, f
106
103
so the benefits of vector search would typically outweigh the costs, but it is possible to disable vector support.
107
104
If you do so, the application will fall back to a keyword search, which is less accurate.
108
105
109
-
7. Once you've made the desired customizations, follow the steps in the README [to run `azd up`](../README.md#deploying-from-scratch). We recommend using "eastus" as the region, for availability reasons.
106
+
1. Once you've made the desired customizations, follow the steps in the README [to run `azd up`](../README.md#deploying-from-scratch). We recommend using "eastus" as the region, for availability reasons.
0 commit comments