Skip to content

Commit f666ab1

Browse files
updated gpt model capacity and fixed aideployment location issue
1 parent f126101 commit f666ab1

File tree

3 files changed

+8
-7
lines changed

3 files changed

+8
-7
lines changed

.github/workflows/deploy.yml

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,8 +14,8 @@ on:
1414
- cron: '0 9,21 * * *' # Runs at 9:00 AM and 9:00 PM GMT
1515

1616
env:
17-
GPT_MIN_CAPACITY: 250
18-
TEXT_EMBEDDING_MIN_CAPACITY: 40
17+
GPT_MIN_CAPACITY: 150
18+
TEXT_EMBEDDING_MIN_CAPACITY: 80
1919
BRANCH_NAME: ${{ github.head_ref || github.ref_name }}
2020

2121
jobs:
@@ -148,6 +148,7 @@ jobs:
148148
gptDeploymentCapacity=${{ env.GPT_MIN_CAPACITY }} \
149149
embeddingModel="text-embedding-ada-002" \
150150
embeddingDeploymentCapacity=${{ env.TEXT_EMBEDDING_MIN_CAPACITY }} \
151+
aiDeploymentsLocation=${{ env.AZURE_LOCATION }} \
151152
imageTag="${IMAGE_TAG}"
152153
153154
- name: Get Deployment Output and extract Values

docs/QuotaCheck.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ azd auth login
1414

1515
### 📌 Default Models & Capacities:
1616
```
17-
gpt4.1:30, text-embedding-ada-002:80, gpt-4:30
17+
gpt4.1:150, text-embedding-ada-002:80, gpt-4:150
1818
```
1919
### 📌 Default Regions:
2020
```
@@ -40,19 +40,19 @@ francecentral, australiaeast, uksouth, eastus2, northcentralus, swedencentral, w
4040
```
4141
✔️ Check specific model(s) in default regions:
4242
```
43-
./quota_check_params.sh --models gpt4.1:30,text-embedding-ada-002:80
43+
./quota_check_params.sh --models gpt4.1:150,text-embedding-ada-002:80
4444
```
4545
✔️ Check default models in specific region(s):
4646
```
4747
./quota_check_params.sh --regions eastus2,westus
4848
```
4949
✔️ Passing Both models and regions:
5050
```
51-
./quota_check_params.sh --models gpt4.1:30 --regions eastus2,westus2
51+
./quota_check_params.sh --models gpt4.1:150 --regions eastus2,westus2
5252
```
5353
✔️ All parameters combined:
5454
```
55-
./quota_check_params.sh --models gpt-4:30,text-embedding-ada-002:80 --regions eastus2,westus --verbose
55+
./quota_check_params.sh --models gpt-4:150,text-embedding-ada-002:80 --regions eastus2,westus --verbose
5656
```
5757

5858
### **Sample Output**

scripts/quota_check_params.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ log_verbose() {
4747
}
4848

4949
# Default Models and Capacities (Comma-separated in "model:capacity" format)
50-
DEFAULT_MODEL_CAPACITY="gpt4.1:30,text-embedding-ada-002:80"
50+
DEFAULT_MODEL_CAPACITY="gpt4.1:150,text-embedding-ada-002:80"
5151

5252
# Convert the comma-separated string into an array
5353
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"

0 commit comments

Comments
 (0)