Skip to content

Commit 36ab95a

Browse files
Merge pull request #529 from microsoft/update-model-capacity-similar-to-bicep
fix: update model capacity similar to bicep
2 parents 12f619d + 3b14454 commit 36ab95a

File tree

3 files changed

+7
-7
lines changed

3 files changed

+7
-7
lines changed

.github/workflows/deploy-KMGeneric.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -40,8 +40,8 @@ jobs:
4040
export AZURE_TENANT_ID=${{ secrets.AZURE_TENANT_ID }}
4141
export AZURE_CLIENT_SECRET=${{ secrets.AZURE_CLIENT_SECRET }}
4242
export AZURE_SUBSCRIPTION_ID="${{ secrets.AZURE_SUBSCRIPTION_ID }}"
43-
export GPT_MIN_CAPACITY="150"
44-
export TEXT_EMBEDDING_MIN_CAPACITY="80"
43+
export GPT_MIN_CAPACITY=${{ env.GPT_MIN_CAPACITY }}
44+
export TEXT_EMBEDDING_MIN_CAPACITY=${{ env.TEXT_EMBEDDING_MIN_CAPACITY }}
4545
export AZURE_REGIONS="${{ vars.AZURE_REGIONS_KM }}"
4646
chmod +x infra/scripts/checkquota_km.sh
4747
if ! infra/scripts/checkquota_km.sh; then

documents/QuotaCheck.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ azd auth login
1010

1111
### 📌 Default Models & Capacities:
1212
```
13-
gpt-4o:30, gpt-4o-mini:30, gpt-4:30, text-embedding-ada-002:80
13+
gpt-4o:150, gpt-4o-mini:150, gpt-4:150, text-embedding-ada-002:80
1414
```
1515
### 📌 Default Regions:
1616
```
@@ -36,19 +36,19 @@ eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southc
3636
```
3737
✔️ Check specific model(s) in default regions:
3838
```
39-
./quota_check_params.sh --models gpt-4o:30,text-embedding-ada-002:80
39+
./quota_check_params.sh --models gpt-4o:150,text-embedding-ada-002:80
4040
```
4141
✔️ Check default models in specific region(s):
4242
```
4343
./quota_check_params.sh --regions eastus,westus
4444
```
4545
✔️ Passing Both models and regions:
4646
```
47-
./quota_check_params.sh --models gpt-4o:30 --regions eastus,westus2
47+
./quota_check_params.sh --models gpt-4o:150 --regions eastus,westus2
4848
```
4949
✔️ All parameters combined:
5050
```
51-
./quota_check_params.sh --models gpt-4:30,text-embedding-ada-002:80 --regions eastus,westus --verbose
51+
./quota_check_params.sh --models gpt-4:150,text-embedding-ada-002:80 --regions eastus,westus --verbose
5252
```
5353

5454
### **Sample Output**

infra/scripts/quota_check_params.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ log_verbose() {
4747
}
4848

4949
# Default Models and Capacities (Comma-separated in "model:capacity" format)
50-
DEFAULT_MODEL_CAPACITY="gpt-4o:30,gpt-4o-mini:30,gpt-4:30,text-embedding-ada-002:80"
50+
DEFAULT_MODEL_CAPACITY="gpt-4o:150,gpt-4o-mini:150,gpt-4:150,text-embedding-ada-002:80"
5151

5252
# Convert the comma-separated string into an array
5353
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"

0 commit comments

Comments
 (0)