Skip to content

Commit 14f9dbd

Browse files
committed
Learn Editor: Update working-with-models.md
1 parent 181aabe commit 14f9dbd

File tree

1 file changed

+6
-3
lines changed

1 file changed

+6
-3
lines changed

articles/ai-services/openai/how-to/working-with-models.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -314,7 +314,7 @@ curl -X PUT https://management.azure.com/subscriptions/00000000-0000-0000-0000-0
314314

315315
```json
316316
{
317-
"id": "/subscriptions/{subscription-id}/resourceGroups/resource-group-temp/providers/Microsoft.CognitiveServices/accounts/docs-openai-test-001/deployments/gpt-35-turbo",
317+
"id": "/subscriptions/{subscription-id}/resourceGroups/resource-group-temp/providers/Microsoft.CognitiveServices/accounts/docs-openai-test-001/deployments/gpt-4o-ptu-deployment",
318318
"type": "Microsoft.CognitiveServices/accounts/deployments",
319319
"name": "gpt-4o-ptu-deployment",
320320
"sku": {
@@ -342,7 +342,7 @@ curl -X PUT https://management.azure.com/subscriptions/00000000-0000-0000-0000-0
342342
{
343343
"key": "request",
344344
"renewalPeriod": 10,
345-
"count": 120
345+
"count": 300
346346
}
347347
]
348348
},
@@ -363,7 +363,10 @@ curl -X PUT https://management.azure.com/subscriptions/00000000-0000-0000-0000-0
363363
364364
### Multi-deployment migrations for provisioned deployments
365365
Multi-deployment migrations allow you to have greater control over the model migration process. With multi-deployment migrations, you can dictate how quickly you would like to migrate your existing traffic to the target model version or model family on a new provisioned deployment. The process to migrate to a new model version or model family using the multi-deployment migration approach is as follows:
366-
- Create a new provisioned deployment. For this new deployment, you can choose to maintain the same provisioned deployment type as your existing provisioned deployment or select a new deployment type if de
366+
- Create a new provisioned deployment. For this new deployment, you can choose to maintain the same provisioned deployment type as your existing deployment or select a new deployment type if desired.
367+
- Transition traffic from the existing provisioned deployment to the newly created provisioned deployment with your target model version or model family until all traffic is offloaded from the original deployment.
368+
- Once traffic is migrated over to the new deployment, validate that there are no inference requests being processed on the previous provisioned deployment by ensuring the Azure OpenAI Requests metric does not show any API calls made within 5-10 minutes of the inference traffic being migrated over to the new deployment. For more information on this metric, [see the Monitor Azure OpenAI documentation](https://aka.ms/aoai/docs/monitor-azure-openai).
369+
- Once you confirm that no inference calls have been made, delete the original provisioned deployment.
367370

368371
## Next steps
369372

0 commit comments

Comments
 (0)