fix: update GPT model capacity and fix aiDeploymentsLocation issue by Priyanka-Microsoft · Pull Request #499 · microsoft/content-generation-solution-accelerator

Priyanka-Microsoft · 2025-07-11T12:44:00Z

Purpose

...
This pull request updates the default capacities for AI models, adjusts environment variables in the deployment workflow, and ensures documentation reflects these changes. The most important changes include modifying model capacities, adding a new environment variable for deployment location, and updating scripts and documentation to align with the new capacities.

Deployment Workflow Updates:

.github/workflows/deploy.yml: Reduced GPT_MIN_CAPACITY from 250 to 150 and increased TEXT_EMBEDDING_MIN_CAPACITY from 40 to 80. Added AZURE_LOCATION as a new environment variable for AI deployments. [1] [2]

Documentation Updates:

docs/QuotaCheck.md: Updated default capacities for gpt4.1 and gpt-4 to 150, ensuring consistency with the new deployment settings. Adjusted example commands to reflect the updated capacities. [1] [2]

Script Updates:

scripts/quota_check_params.sh: Updated DEFAULT_MODEL_CAPACITY to use the new capacities (gpt4.1:150 and text-embedding-ada-002:80).

Does this introduce a breaking change?

Yes
No

Golden Path Validation

I have tested the primary workflows (the "golden path") to ensure they function correctly without errors.

Deployment Validation

I have validated the deployment process successfully and all services are running as expected with this change.

What to Check

Verify that the following are valid

I have built and tested the code locally and in a deployed app
For frontend changes, I have pulled the latest code from main, built the frontend, and committed all static files.
This is a change for all users of this app. No code or asset is specific to my use case or my organization.

Other Information

github-actions · 2025-07-18T10:50:00Z

🎉 This PR is included in version 1.6.0 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

updated gpt model capacity and fixed aideployment location issue

f666ab1

Priyanka-Microsoft requested review from Avijit-Microsoft, Prajwal-Microsoft, Roopan-Microsoft, Vinay-Microsoft, aniaroramsft, malrose07 and toherman-msft as code owners July 11, 2025 12:44

Roopan-Microsoft approved these changes Jul 11, 2025

View reviewed changes

Roopan-Microsoft merged commit 2e3f6ac into main Jul 11, 2025
11 checks passed

Roopan-Microsoft deleted the fix-aiDeploymentsLocation-parameter-error branch July 11, 2025 12:49

github-actions bot added the released label Jul 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: update GPT model capacity and fix aiDeploymentsLocation issue#499

fix: update GPT model capacity and fix aiDeploymentsLocation issue#499
Roopan-Microsoft merged 1 commit intomainfrom
fix-aiDeploymentsLocation-parameter-error

Priyanka-Microsoft commented Jul 11, 2025

Uh oh!

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Priyanka-Microsoft commented Jul 11, 2025

Purpose

Deployment Workflow Updates:

Documentation Updates:

Script Updates:

Does this introduce a breaking change?

Golden Path Validation

Deployment Validation

What to Check

Other Information

Uh oh!

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants