Skip to content

Commit 12e4eca

Browse files
committed
Increase embedding TPM capacity and add note in cloud ingestion guide
1 parent 17320f7 commit 12e4eca

File tree

2 files changed

+10
-4
lines changed

2 files changed

+10
-4
lines changed

docs/data_ingestion.md

Lines changed: 9 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -153,15 +153,21 @@ This project includes an optional feature to perform data ingestion in the cloud
153153

154154
3. Open `azure.yaml` and un-comment the document-extractor, figure-processor, and text-processor sections. Those are the Azure Functions apps that will be deployed and serve as Azure AI Search skills.
155155

156-
4. Provision the new Azure Functions resources, deploy the function apps, and update the search indexer with:
156+
4. (Recommended) Increase the capacity for the embedding model to the maximum quota allowed for your region/subscription, so that the Azure Functions can generate embeddings without hitting rate limits:
157+
158+
```shell
159+
azd env set AZURE_OPENAI_EMB_DEPLOYMENT_CAPACITY 400
160+
```
161+
162+
5. Provision the new Azure Functions resources, deploy the function apps, and update the search indexer with:
157163

158164
```shell
159165
azd up
160166
```
161167

162-
5. That will upload the documents in the `data/` folder to the Blob storage container, create the indexer and skillset, and run the indexer to ingest the data. You can monitor the indexer status from the portal.
168+
6. That will upload the documents in the `data/` folder to the Blob storage container, create the indexer and skillset, and run the indexer to ingest the data. You can monitor the indexer status from the portal.
163169

164-
6. When you have new documents to ingest, you can upload documents to the Blob storage container and run the indexer from the Azure Portal to ingest new documents.
170+
7. When you have new documents to ingest, you can upload documents to the Blob storage container and run the indexer from the Azure Portal to ingest new documents.
165171

166172
### Indexer architecture
167173

infra/main.bicep

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -231,7 +231,7 @@ var embedding = {
231231
deploymentName: !empty(embeddingDeploymentName) ? embeddingDeploymentName : 'text-embedding-3-large'
232232
deploymentVersion: !empty(embeddingDeploymentVersion) ? embeddingDeploymentVersion : (embeddingModelName == 'text-embedding-ada-002' ? '2' : '1')
233233
deploymentSkuName: !empty(embeddingDeploymentSkuName) ? embeddingDeploymentSkuName : (embeddingModelName == 'text-embedding-ada-002' ? 'Standard' : 'GlobalStandard')
234-
deploymentCapacity: embeddingDeploymentCapacity != 0 ? embeddingDeploymentCapacity : 30
234+
deploymentCapacity: embeddingDeploymentCapacity != 0 ? embeddingDeploymentCapacity : 200
235235
dimensions: embeddingDimensions != 0 ? embeddingDimensions : 3072
236236
}
237237

0 commit comments

Comments
 (0)