Before deploying the accelerator, ensure sufficient quota availability for the required model.
We recommend increasing the capacity to 100k tokens for optimal performance.
az login
gpt-4o:150, gpt-4o-mini:150, gpt-4:150, text-embedding-3-small:100
eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southcentralus, canadacentral, australiaeast, japaneast, norwayeast
- No parameters passed → Default models and capacities will be checked in default regions.
- Only model(s) provided → The script will check for those models in the default regions.
- Only region(s) provided → The script will check default models in the specified regions.
- Both models and regions provided → The script will check those models in the specified regions.
--verbosepassed → Enables detailed logging output for debugging and traceability.
Use the --models, --regions, and --verbose options for parameter handling:
✔️ Run without parameters to check default models & regions without verbose logging:
./quota_check.sh
✔️ Enable verbose logging:
./quota_check.sh --verbose
✔️ Check specific model(s) in default regions:
./quota_check.sh --models gpt-4o:150,text-embedding-3-small:100
✔️ Check default models in specific region(s):
./quota_check.sh --regions eastus,westus
✔️ Passing Both models and regions:
./quota_check.sh --models gpt-4o:150 --regions eastus,westus2
✔️ All parameters combined:
./quota_check.sh --models gpt-4:150,text-embedding-3-small:100 --regions eastus,westus --verbose
✔️ Multiple models with single region:
./quota_check.sh --models gpt-4:150,text-embedding-3-small:100 --regions eastus2 --verbose
The final table lists regions with available quota. You can select any of these regions for deployment.
-
Navigate to the Azure Portal.
-
Click on Azure Cloud Shell in the top right navigation menu.
-
Run the appropriate command based on your requirement:
To check quota for the deployment
curl -L -o quota_check.sh "https://raw.githubusercontent.com/microsoft/Deploy-Your-AI-Application-In-Production/main/scripts/quota_check.sh" chmod +x quota_check.sh ./quota_check.sh- Refer to Input Formats for detailed commands.
-
Open the terminal in VS Code or Codespaces.
-
If you're using VS Code, click the dropdown on the right side of the terminal window, and select
Git Bash.
-
Navigate to the
scriptsfolder where the script files are located and make the script as executable:cd scripts chmod +x quota_check.sh -
Run the appropriate script based on your requirement:
To check quota for the deployment
./quota_check.sh
- Refer to Input Formats for detailed commands.
