Skip to content

Commit ce5c9f3

Browse files
Merge pull request microsoft#329 from microsoft/main
chore: down merge main to dev
2 parents 8494ea9 + aa5eb59 commit ce5c9f3

File tree

4 files changed

+7
-8
lines changed

4 files changed

+7
-8
lines changed

.github/workflows/deploy-waf.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ jobs:
2121
export AZURE_TENANT_ID=${{ secrets.AZURE_TENANT_ID }}
2222
export AZURE_CLIENT_SECRET=${{ secrets.AZURE_CLIENT_SECRET }}
2323
export AZURE_SUBSCRIPTION_ID="${{ secrets.AZURE_SUBSCRIPTION_ID }}"
24-
export GPT_MIN_CAPACITY="5"
24+
export GPT_MIN_CAPACITY="150"
2525
export AZURE_REGIONS="${{ vars.AZURE_REGIONS }}"
2626
2727
chmod +x infra/scripts/checkquota.sh

.github/workflows/deploy.yml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,6 @@
11
name: Validate Deployment
22

33
on:
4-
push:
54
workflow_run:
65
workflows: ["Build Docker and Optional Push"]
76
types:

docs/quota_check.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
## Check Quota Availability Before Deployment
22

33
Before deploying the accelerator, **ensure sufficient quota availability** for the required model.
4-
> **For Global Standard | GPT-4o - the capacity to at least 140k tokens for optimal performance.**
4+
> **For Global Standard | GPT-4o - the capacity to at least 150k tokens for optimal performance.**
55
66
### Login if you have not done so already
77
```
@@ -11,7 +11,7 @@ azd auth login
1111

1212
### 📌 Default Models & Capacities:
1313
```
14-
gpt-4o:140
14+
gpt-4o:150
1515
```
1616
### 📌 Default Regions:
1717
```
@@ -37,19 +37,19 @@ eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southc
3737
```
3838
✔️ Check specific model(s) in default regions:
3939
```
40-
./quota_check_params.sh --models gpt-4o:140
40+
./quota_check_params.sh --models gpt-4o:150
4141
```
4242
✔️ Check default models in specific region(s):
4343
```
4444
./quota_check_params.sh --regions eastus,westus
4545
```
4646
✔️ Passing Both models and regions:
4747
```
48-
./quota_check_params.sh --models gpt-4o:140 --regions eastus,westus2
48+
./quota_check_params.sh --models gpt-4o:150 --regions eastus,westus2
4949
```
5050
✔️ All parameters combined:
5151
```
52-
./quota_check_params.sh --models gpt-4o:140 --regions eastus,westus --verbose
52+
./quota_check_params.sh --models gpt-4o:150 --regions eastus,westus --verbose
5353
```
5454

5555
### **Sample Output**

infra/scripts/quota_check_params.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ log_verbose() {
4747
}
4848

4949
# Default Models and Capacities (Comma-separated in "model:capacity" format)
50-
DEFAULT_MODEL_CAPACITY="gpt-4o:50"
50+
DEFAULT_MODEL_CAPACITY="gpt-4o:150"
5151
# Convert the comma-separated string into an array
5252
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"
5353

0 commit comments

Comments
 (0)