|
1 | 1 | ## Check Quota Availability Before Deployment |
2 | 2 |
|
3 | | -Before deploying the accelerator, **ensure sufficient quota availability** for the required model. |
4 | | -Use one of the following scripts based on your needs: |
| 3 | +Before deploying the accelerator, **ensure sufficient quota availability** for the required model. |
5 | 4 |
|
6 | | -- **`quota_check_params.sh`** → If you **know the model and capacity** required. |
7 | | -- **`quota_check_all_regions.sh`** → If you **want to check available capacity across all regions** for supported models. |
| 5 | +### 📌 Default Models & Capacities: |
| 6 | +``` |
| 7 | +gpt-4o:30, gpt-4o-mini:30, gpt-4:30, text-embedding-ada-002:80 |
| 8 | +``` |
| 9 | +### 📌 Default Regions: |
| 10 | +``` |
| 11 | +eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southcentralus, canadacentral |
| 12 | +``` |
| 13 | +### Usage Scenarios: |
| 14 | +- No parameters passed → Default models and capacities will be checked in default regions. |
| 15 | +- Only model(s) provided → The script will check for those models in the default regions. |
| 16 | +- Only region(s) provided → The script will check default models in the specified regions. |
| 17 | +- Both models and regions provided → The script will check those models in the specified regions. |
| 18 | + |
| 19 | +### **Input Formats** |
| 20 | +✔️ Run without parameters to check default models & regions: |
| 21 | + ``` |
| 22 | + ./quota_check_params.sh |
| 23 | + ``` |
| 24 | +✔️ Model name and required capacity in the format: |
| 25 | + ``` |
| 26 | + ./quota_check_params.sh gpt-4o:30 |
| 27 | + ``` |
| 28 | +✔️ Multiple models can be passed, separated by commas: |
| 29 | + ``` |
| 30 | + ./quota_check_params.sh gpt-4o:30,text-embedding-ada-002:80 |
| 31 | + ``` |
| 32 | +✔️ Passing Both models and regions: |
| 33 | + ``` |
| 34 | + ./quota_check_params.sh gpt-4o:30 eastus,westus2 |
| 35 | + ``` |
| 36 | +✔️ Check default models in specific regions: |
| 37 | + ``` |
| 38 | + ./quota_check_params.sh "" eastus,westus2 |
| 39 | + ``` |
| 40 | + |
| 41 | +### **Sample Output** |
| 42 | +The final table lists regions with available quota. You can select any of these regions for deployment. |
| 43 | + |
| 44 | + |
8 | 45 |
|
9 | 46 | --- |
10 | | -## **If using Azure Portal and Cloud Shell** |
| 47 | +### **If using Azure Portal and Cloud Shell** |
11 | 48 |
|
12 | 49 | 1. Navigate to the [Azure Portal](https://portal.azure.com). |
13 | 50 | 2. Click on **Azure Cloud Shell** in the top right navigation menu. |
14 | 51 | 3. Run the appropriate command based on your requirement: |
15 | 52 |
|
16 | | - **To check quota for a specific model and capacity:** |
| 53 | + **To check quota for the deployment** |
17 | 54 |
|
18 | 55 | ```sh |
19 | 56 | curl -L -o quota_check_params.sh "https://raw.githubusercontent.com/microsoft/Conversation-Knowledge-Mining-Solution-Accelerator/main/infra/scripts/quota_check_params.sh" |
20 | 57 | chmod +x quota_check_params.sh |
21 | | - ./quota_check_params.sh <model_name:capacity> [<model_region>] (e.g., gpt-4o-mini:30,text-embedding-ada-002:20 eastus) |
| 58 | + ./quota_check_params.sh |
22 | 59 | ``` |
23 | | - |
24 | | - **To check available quota across all regions for supported models:** |
25 | | - |
26 | | - ```sh |
27 | | - curl -L -o quota_check_all_regions.sh "https://raw.githubusercontent.com/microsoft/Conversation-Knowledge-Mining-Solution-Accelerator/main/infra/scripts/quota_check_all_regions.sh" |
28 | | - chmod +x quota_check_all_regions.sh |
29 | | - ./quota_check_all_regions.sh |
| 60 | + - Refer to [Input Formats](#input-formats) for detailed commands. |
| 61 | + |
| 62 | +### **If using VS Code or Codespaces** |
| 63 | +1. Open the terminal in VS Code or Codespaces. |
| 64 | +2. If you're using VS Code, click the dropdown on the right side of the terminal window, and select `Git Bash`. |
| 65 | +  |
| 66 | +3. Navigate to the `scripts` folder where the script files are located and make the script as executable: |
| 67 | + ```sh |
| 68 | + cd infra/scripts |
| 69 | + chmod +x quota_check_params.sh |
30 | 70 | ``` |
31 | | - |
32 | | -## **If using VS Code or Codespaces** |
33 | | - |
34 | | -1. Run the appropriate script based on your requirement: |
| 71 | +4. Run the appropriate script based on your requirement: |
35 | 72 |
|
36 | | - **To check quota for a specific model and capacity:** |
| 73 | + **To check quota for the deployment** |
37 | 74 |
|
38 | 75 | ```sh |
39 | | - ./quota_check_params.sh <model_name:capacity> [<model_region>] (e.g., gpt-4o-mini:30,text-embedding-ada-002:20 eastus) |
| 76 | + ./quota_check_params.sh |
40 | 77 | ``` |
| 78 | + - Refer to [Input Formats](#input-formats) for detailed commands. |
41 | 79 |
|
42 | | - **To check available quota across all regions for supported models:** |
43 | | - |
44 | | - ```sh |
45 | | - ./quota_check_all_regions.sh |
46 | | - ``` |
47 | | -2. If you see the error `_bash: az: command not found_`, install Azure CLI: |
| 80 | +5. If you see the error `_bash: az: command not found_`, install Azure CLI: |
48 | 81 |
|
49 | 82 | ```sh |
50 | 83 | curl -sL https://aka.ms/InstallAzureCLIDeb | sudo bash |
51 | 84 | az login |
52 | 85 | ``` |
53 | | -3. Rerun the script after installing Azure CLI. |
54 | | - |
55 | | - **Parameters** |
56 | | - - `<model_name:capacity>`: The name and required capacity for each model, in the format model_name:capacity (**e.g., gpt-4o-mini:30,text-embedding-ada-002:20**). |
57 | | - - `[<model_region>] (optional)`: The Azure region to check first. If not provided, all supported regions will be checked (**e.g., eastus**). |
| 86 | +6. Rerun the script after installing Azure CLI. |
0 commit comments