fix: updated model capacity minimum to 200 #602

Priyanka-Microsoft · 2025-07-14T07:17:14Z

Purpose

...
This pull request updates default model capacities and environment variables to align with new requirements for resource allocation. The most significant changes include increasing the capacity for gpt-4o-mini and adjusting related documentation and scripts accordingly.

Updates to model capacities:

.github/workflows/CAdeploy.yml: Reduced GPT_MIN_CAPACITY from 250 to 200 and increased TEXT_EMBEDDING_MIN_CAPACITY from 40 to 80 in the workflow environment variables.
infra/scripts/quota_check_params.sh: Updated DEFAULT_MODEL_CAPACITY to set gpt-4o-mini capacity to 200 (previously 30) while keeping text-embedding-ada-002 at 80.

Documentation updates:

docs/QuotaCheck.md: Updated examples and default values to reflect the new gpt-4o-mini capacity of 200 in various usage scenarios. [1] [2]

Does this introduce a breaking change?

Yes
No

Golden Path Validation

I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.

Deployment Validation

I have validated the deployment process successfully and all services are running as expected with this change.

What to Check

Verify that the following are valid

...

Other Information

Copilot

Pull Request Overview

This PR updates the minimum resource allocations for models to meet new capacity requirements.

Bump gpt-4o-mini default capacity from 30 to 200 and text-embedding-ada-002 from 40 to 80 in the quota check script.
Align documentation examples in docs/QuotaCheck.md with the new defaults.
Adjust CI workflow environment variables (GPT_MIN_CAPACITY, TEXT_EMBEDDING_MIN_CAPACITY) in .github/workflows/CAdeploy.yml.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
infra/scripts/quota_check_params.sh	Updated DEFAULT_MODEL_CAPACITY string to new thresholds
docs/QuotaCheck.md	Revised example commands and default values for capacities
.github/workflows/CAdeploy.yml	Lowered GPT_MIN_CAPACITY and increased TEXT_EMBEDDING_MIN_CAPACITY

Comments suppressed due to low confidence (2)

docs/QuotaCheck.md:55

The sample output section still shows the previous capacity values. Please update any example output to reflect the new default gpt-4o-mini capacity of 200.

### **Sample Output**

infra/scripts/quota_check_params.sh:50

Consider adding or updating tests for quota_check_params.sh to verify that the script correctly parses and applies the new default capacities.

DEFAULT_MODEL_CAPACITY="gpt-4o-mini:200,text-embedding-ada-002:80"

github-actions · 2025-07-16T14:40:53Z

🎉 This PR is included in version 1.6.1 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

* updated model capacity minimume to 200 (#602) * docs: Added file and images (#577) * Create exp.md * Add files via upload * Delete docs/images/re_use_log/exp.md * Add files via upload * Update CustomizingAzdParameters.md * Update DeploymentGuide.md add the section Reusing an Existing Log Analytics Workspace * Update re-use-log-analytics.md update the back link url and remove extra space --------- Co-authored-by: Roopan-Microsoft <[email protected]> Co-authored-by: Thanusree-Microsoft <[email protected]> * docs: Add and update links for CustomizingAzdParameters.md and re-use-log-analytics.md (#606) * Update CustomizingAzdParameters.md Added link * Update re-use-log-analytics.md Updated link * fix: Increase retry attempts and improve error messaging for Azure Blob Storage uploads (#607) --------- Co-authored-by: Priyanka-Microsoft <[email protected]> Co-authored-by: Atulku-Microsoft <[email protected]> Co-authored-by: Roopan-Microsoft <[email protected]> Co-authored-by: Thanusree-Microsoft <[email protected]>

updated model capacity minimume to 200

42d2a53

Copilot AI review requested due to automatic review settings July 14, 2025 07:17

Priyanka-Microsoft requested review from Avijit-Microsoft, Prajwal-Microsoft, Roopan-Microsoft, Vinay-Microsoft and aniaroramsft as code owners July 14, 2025 07:17

Copilot AI reviewed Jul 14, 2025

View reviewed changes

Roopan-Microsoft approved these changes Jul 14, 2025

View reviewed changes

Roopan-Microsoft merged commit ccbe67f into main Jul 14, 2025
9 checks passed

Roopan-Microsoft deleted the update-model-capacity-similar-to-bicep branch July 14, 2025 13:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: updated model capacity minimum to 200 #602

fix: updated model capacity minimum to 200 #602

Uh oh!

Priyanka-Microsoft commented Jul 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

github-actions bot commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: updated model capacity minimum to 200 #602

fix: updated model capacity minimum to 200 #602

Uh oh!

Conversation

Priyanka-Microsoft commented Jul 14, 2025

Purpose

Updates to model capacities:

Documentation updates:

Does this introduce a breaking change?

Golden Path Validation

Deployment Validation

What to Check

Other Information

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

github-actions bot commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants