Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions .github/workflows/deploy.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,8 @@ on:
- cron: '0 9,21 * * *' # Runs at 9:00 AM and 9:00 PM GMT

env:
GPT_MIN_CAPACITY: 250
TEXT_EMBEDDING_MIN_CAPACITY: 40
GPT_MIN_CAPACITY: 150
TEXT_EMBEDDING_MIN_CAPACITY: 80
BRANCH_NAME: ${{ github.head_ref || github.ref_name }}

jobs:
Expand Down Expand Up @@ -148,6 +148,7 @@ jobs:
gptDeploymentCapacity=${{ env.GPT_MIN_CAPACITY }} \
embeddingModel="text-embedding-ada-002" \
embeddingDeploymentCapacity=${{ env.TEXT_EMBEDDING_MIN_CAPACITY }} \
aiDeploymentsLocation=${{ env.AZURE_LOCATION }} \
imageTag="${IMAGE_TAG}"

- name: Get Deployment Output and extract Values
Expand Down
8 changes: 4 additions & 4 deletions docs/QuotaCheck.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ azd auth login

### 📌 Default Models & Capacities:
```
gpt4.1:30, text-embedding-ada-002:80, gpt-4:30
gpt4.1:150, text-embedding-ada-002:80, gpt-4:150
```
### 📌 Default Regions:
```
Expand All @@ -40,19 +40,19 @@ francecentral, australiaeast, uksouth, eastus2, northcentralus, swedencentral, w
```
✔️ Check specific model(s) in default regions:
```
./quota_check_params.sh --models gpt4.1:30,text-embedding-ada-002:80
./quota_check_params.sh --models gpt4.1:150,text-embedding-ada-002:80
```
✔️ Check default models in specific region(s):
```
./quota_check_params.sh --regions eastus2,westus
```
✔️ Passing Both models and regions:
```
./quota_check_params.sh --models gpt4.1:30 --regions eastus2,westus2
./quota_check_params.sh --models gpt4.1:150 --regions eastus2,westus2
```
✔️ All parameters combined:
```
./quota_check_params.sh --models gpt-4:30,text-embedding-ada-002:80 --regions eastus2,westus --verbose
./quota_check_params.sh --models gpt-4:150,text-embedding-ada-002:80 --regions eastus2,westus --verbose
```

### **Sample Output**
Expand Down
2 changes: 1 addition & 1 deletion scripts/quota_check_params.sh
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,7 @@ log_verbose() {
}

# Default Models and Capacities (Comma-separated in "model:capacity" format)
DEFAULT_MODEL_CAPACITY="gpt4.1:30,text-embedding-ada-002:80"
DEFAULT_MODEL_CAPACITY="gpt4.1:150,text-embedding-ada-002:80"

# Convert the comma-separated string into an array
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"
Expand Down
Loading