Increase embedding TPM capacity and add note in cloud ingestion guide #2846

pamelafox · 2025-11-26T18:21:12Z

Purpose

Some developers have tried out the cloud ingestion approach already and ran into errors due to rate limits with the embedding model. This PR increases the embedding TPM default capacity to 200 and adds a note to the ingestion guide about how to increase even farther. It seems that this is a safe capacity to request, as Azure OpenAI quotas seem to default quite high for embedding models, at least in my account.

Does this introduce a breaking change?

When developers merge from main and run the server, azd up, or azd deploy, will this produce an error?
If you're not sure, try it out on an old environment.

[ ] Yes
[X] No

Does this require changes to learn.microsoft.com docs?

This repository is referenced by this tutorial
which includes deployment, settings and usage instructions. If text or screenshot need to change in the tutorial,
check the box below and notify the tutorial author. A Microsoft employee can do this for you if you're an external contributor.

[ ] Yes
[X] No

Type of change

[ ] Bugfix
[ ] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[X] Other... Please describe:

Code quality checklist

See CONTRIBUTING.md for more details.

The current tests all pass (python -m pytest).
I added tests that prove my fix is effective or that my feature works
I ran python -m pytest --cov to verify 100% coverage of added lines
I ran python -m mypy to check for type errors
I either used the pre-commit hooks or ran ruff and black manually on my code.

Copilot

Pull request overview

This PR increases the default embedding model capacity to address rate limit issues encountered during cloud ingestion. The default capacity changes from 30K TPM to 200K TPM, which aligns with typical Azure OpenAI quota defaults for embedding models.

Key Changes:

Increased default embedding deployment capacity from 30 to 200 in infrastructure configuration
Added optional configuration step in cloud ingestion guide recommending capacity increase to 400

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
infra/main.bicep	Updated default embedding deployment capacity from 30 to 200
docs/data_ingestion.md	Added recommended step to increase embedding capacity to 400 for cloud ingestion, renumbered subsequent steps accordingly

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

pamelafox · 2025-11-26T18:41:07Z

.github/workflows/lint-markdown.yml

-      - name: Run markdownlint
-        uses: articulate/actions-markdownlint@v1
+      - name: Run markdownlint-cli2
+        uses: DavidAnson/markdownlint-cli2-action@v21


We suddenly started getting markdownlint errors, and in debugging, I realized that our markdownlint action was deprecated in favor of this one. David Anson also authors the VS Code extension that we recommend in the repo configuration, so this makes CI consistent with VS Code errors, in theory.

sounds good

mattgotteiner · 2025-12-01T18:00:17Z

.github/workflows/lint-markdown.yml

+          globs: |
+            **/*.md
+            !data/**
+            !.github/**


The default markdown files like SECURITY.md are riddled with issues, and we seemed to ignore them before, so I ignored them again here. Could fix em up in future.

Increase embedding TPM capacity and add note in cloud ingestion guide

12e4eca

pamelafox requested review from Copilot and mattgotteiner November 26, 2025 18:21

Copilot started reviewing on behalf of pamelafox November 26, 2025 18:21 View session

Copilot finished reviewing on behalf of pamelafox November 26, 2025 18:23

Copilot AI reviewed Nov 26, 2025

View reviewed changes

pamelafox added 5 commits November 26, 2025 10:25

Update markdown lint

21cb2a1

Better config format

cce7b76

Better config format

2f89549

Fix the config

c0f5e5f

Ignore table style rule

887350f

pamelafox commented Nov 26, 2025

View reviewed changes

mattgotteiner reviewed Dec 1, 2025

View reviewed changes

mattgotteiner approved these changes Dec 1, 2025

View reviewed changes

Merge branch 'main' into embeddingtpm

a0ad452

pamelafox merged commit 828b7d2 into Azure-Samples:main Dec 1, 2025
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Increase embedding TPM capacity and add note in cloud ingestion guide #2846

Increase embedding TPM capacity and add note in cloud ingestion guide #2846

Uh oh!

pamelafox commented Nov 26, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

pamelafox Nov 26, 2025

Uh oh!

mattgotteiner Dec 1, 2025

Uh oh!

mattgotteiner Dec 1, 2025

Uh oh!

pamelafox Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Increase embedding TPM capacity and add note in cloud ingestion guide #2846

Increase embedding TPM capacity and add note in cloud ingestion guide #2846

Uh oh!

Conversation

pamelafox commented Nov 26, 2025

Purpose

Does this introduce a breaking change?

Does this require changes to learn.microsoft.com docs?

Type of change

Code quality checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

pamelafox Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

mattgotteiner Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

mattgotteiner Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

pamelafox Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants