Skip to content

Commit 1cc6510

Browse files
author
Harmanpreet Kaur
committed
edit 2
1 parent d53ad79 commit 1cc6510

File tree

6 files changed

+9
-9
lines changed

6 files changed

+9
-9
lines changed

docs/CustomizingAzdParameters.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Change the Model Deployment Type (allowed values: Standard, GlobalStandard)
1818
azd env set AZURE_ENV_MODEL_DEPLOYMENT_TYPE Standard
1919
```
2020

21-
Set the Model Name (allowed values: gpt-4, gpt-4o,gpt-4.1)
21+
Set the Model Name
2222

2323
```shell
2424
azd env set AZURE_ENV_MODEL_NAME gpt-4.1

docs/DeploymentGuide.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -103,7 +103,7 @@ When you start the deployment, most parameters will have **default values**, but
103103
| **Environment Name** | A **3-20 character alphanumeric value** used to generate a unique ID to prefix the resources. | byctemplate |
104104
| **Secondary Location** | A **less busy** region for **CosmosDB**, useful in case of availability constraints. | eastus2 |
105105
| **Deployment Type** | Select from a drop-down list. | Global Standard |
106-
| **GPT Model** | Choose from **gpt-4, gpt-4o , gpt-4.1** | gpt-4.1 |
106+
| **GPT Model** | Choose from **gpt-4, gpt-4.1** | gpt-4.1 |
107107
| **GPT Model Deployment Capacity** | Configure capacity for **GPT models**. | 30k |
108108
| **Embedding Model** | Default: **text-embedding-ada-002**. | text-embedding-ada-002 |
109109
| **Embedding Model Capacity** | Set the capacity for **embedding models**. | 80k |

docs/QuotaCheck.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
Before deploying the accelerator, **ensure sufficient quota availability** for the required model.
44

5-
> **For Global Standard | GPT-4o |GPT-4.1- the capacity to at least 150k tokens post-deployment for optimal performance.**
5+
> **For Global Standard |GPT-4.1- the capacity to at least 150k tokens post-deployment for optimal performance.**
66
77
> **For Standard | GPT-4 - ensure a minimum of 30k–40k tokens for best results.**
88
@@ -14,7 +14,7 @@ azd auth login
1414

1515
### 📌 Default Models & Capacities:
1616
```
17-
gpt-4.1:30, text-embedding-ada-002:80, gpt-4:30, gpt-4o:30
17+
gpt-4.1:30, text-embedding-ada-002:80, gpt-4:30
1818
```
1919
### 📌 Default Regions:
2020
```
@@ -40,15 +40,15 @@ eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southc
4040
```
4141
✔️ Check specific model(s) in default regions:
4242
```
43-
./quota_check_params.sh --models gpt-4o:30,text-embedding-ada-002:80
43+
./quota_check_params.sh --models gpt-4.1:30,text-embedding-ada-002:80
4444
```
4545
✔️ Check default models in specific region(s):
4646
```
4747
./quota_check_params.sh --regions eastus,westus
4848
```
4949
✔️ Passing Both models and regions:
5050
```
51-
./quota_check_params.sh --models gpt-4o:30 --regions eastus,westus2
51+
./quota_check_params.sh --models gpt-4.1:30 --regions eastus,westus2
5252
```
5353
✔️ All parameters combined:
5454
```

infra/main.bicep

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -385,7 +385,7 @@ module appserviceModule 'deploy_app_service.bicep' = {
385385
aiSearchService: aifoundry.outputs.aiSearchService
386386
AzureSearchKey: keyVault.getSecret('AZURE-SEARCH-KEY')
387387
AzureOpenAIEndpoint:aifoundry.outputs.aiServicesTarget
388-
AzureOpenAIModel: gptModelName //'gpt-4o-mini'
388+
AzureOpenAIModel: gptModelName
389389
AzureOpenAIKey:keyVault.getSecret('AZURE-OPENAI-KEY')
390390
azureOpenAIApiVersion: azureOpenaiAPIVersion //'2024-02-15-preview'
391391
AZURE_OPENAI_RESOURCE:aifoundry.outputs.aiServicesName

infra/scripts/index_scripts/02_process_data.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@ def get_secrets_from_kv(kv_name, secret_name):
3232
openai_api_key = get_secrets_from_kv(key_vault_name, "AZURE-OPENAI-KEY")
3333
openai_api_base = get_secrets_from_kv(key_vault_name, "AZURE-OPENAI-ENDPOINT")
3434
openai_api_version = get_secrets_from_kv(key_vault_name, "AZURE-OPENAI-PREVIEW-API-VERSION")
35-
deployment = get_secrets_from_kv(key_vault_name, "AZURE-OPEN-AI-DEPLOYMENT-MODEL") # "gpt-4o-mini"
35+
deployment = get_secrets_from_kv(key_vault_name, "AZURE-OPEN-AI-DEPLOYMENT-MODEL")
3636

3737

3838
# Function: Get Embeddings

scripts/quota_check_params.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ log_verbose() {
4747
}
4848

4949
# Default Models and Capacities (Comma-separated in "model:capacity" format)
50-
DEFAULT_MODEL_CAPACITY="gpt-4.1:30,text-embedding-ada-002:80,gpt-4:30,gpt-4o:30"
50+
DEFAULT_MODEL_CAPACITY="gpt-4.1:30,text-embedding-ada-002:80,gpt-4:30"
5151

5252
# Convert the comma-separated string into an array
5353
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"

0 commit comments

Comments
 (0)