MicrosoftDocs
diff --git a/‎articles/ai-foundry/.openpublishing.redirection.ai-studio.json‎
Lines changed: 5 additions & 0 deletions b/‎articles/ai-foundry/.openpublishing.redirection.ai-studio.json‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎articles/ai-foundry/agents/concepts/model-region-support.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-foundry/agents/concepts/model-region-support.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-foundry/foundry-models/concepts/deployment-types.md‎
Lines changed: 106 additions & 49 deletions b/‎articles/ai-foundry/foundry-models/concepts/deployment-types.md‎
Lines changed: 106 additions & 49 deletions
diff --git a/‎articles/ai-foundry/openai/audio-completions-quickstart.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/ai-foundry/openai/audio-completions-quickstart.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-foundry/openai/concepts/provisioned-migration.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-foundry/openai/concepts/provisioned-migration.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-foundry/openai/concepts/provisioned-throughput.md‎
Lines changed: 1 addition & 1 deletion b/‎articles/ai-foundry/openai/concepts/provisioned-throughput.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-foundry/openai/how-to/batch.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-foundry/openai/how-to/batch.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-foundry/openai/how-to/deployment-types.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-foundry/openai/how-to/deployment-types.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-foundry/openai/how-to/fine-tune-test.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-foundry/openai/how-to/fine-tune-test.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-foundry/openai/how-to/fine-tuning-deploy.md‎
Lines changed: 6 additions & 6 deletions b/‎articles/ai-foundry/openai/how-to/fine-tuning-deploy.md‎
Lines changed: 6 additions & 6 deletions
@@ -210,6 +210,11 @@
             "redirect_url": "/azure/ai-foundry/how-to/deploy-models-llama",
             "redirect_document_id": true
           },
+          {
+            "source_path_from_root": "/articles/ai-foundry/open-ai/how-to/deployment-types.md",
+            "redirect_url": "/azure/ai-foundry/foundry-models/concepts/deployment-types",
+            "redirect_document_id": true
+          },
           {
             "source_path_from_root": "/articles/ai-foundry/how-to/deploy-models-llama.md",
             "redirect_url": "/azure/ai-foundry/concepts/models-featured#meta",
 
@@ -19,7 +19,7 @@ Agents are powered by a diverse set of Azure OpenAI models with different capabi
 - **Standard** is offered with a global deployment option, routing traffic globally to provide higher throughput.
 - **Provisioned** is also offered with a global deployment option, allowing customers to purchase and deploy provisioned throughput units across Azure global infrastructure.
 
-All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see [deployment types guide](../../openai/how-to/deployment-types.md).
+All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see [deployment types guide](../../foundry-models/concepts/deployment-types.md).
 
 ## Available models
 
@@ -130,4 +130,4 @@ Azure AI Foundry Agent Service supports the following Azure OpenAI models in the
 
 ## Next steps
 
-[Create a new Agent project](../quickstart.md)
+[Create a new Agent project](../quickstart.md)
@@ -60,5 +60,5 @@ If you want to clean up and remove an Azure OpenAI resource, you can delete the
 
 ## Related content
 
-* Learn more about Azure OpenAI [deployment types](./how-to/deployment-types.md).
+* Learn more about Azure OpenAI [deployment types](../foundry-models/concepts/deployment-types.md).
 * Learn more about Azure OpenAI [quotas and limits](quotas-limits.md).
@@ -34,7 +34,7 @@ This article is intended for existing users of the provisioned throughput offeri
 |Self-service quota requests | Request quota increases without engaging the sales team – many can be autoapproved. |
 |Default provisioned-managed quota in many regions | Get started quickly without having to first request quota. |
 |Transparent information on real-time capacity availability + New deployment flow | Reduced negotiation around availability accelerates time-to-market. |
-| Data zone provisioned deployments | Allows you to leverage Azure's global infrastructure to dynamically route traffic to the data center within the Microsoft defined data zone with the best availability for each request. For more information, see the [deployment types](../how-to/deployment-types.md#data-zone-provisioned) article. |
+| Data zone provisioned deployments | Allows you to leverage Azure's global infrastructure to dynamically route traffic to the data center within the Microsoft defined data zone with the best availability for each request. For more information, see the [deployment types](../../foundry-models/concepts/deployment-types.md#data-zone-provisioned) article. |
 
 ### New hourly/reservation commercial model
 
@@ -45,7 +45,7 @@ This article is intended for existing users of the provisioned throughput offeri
 | Default provisioned-managed quota in many regions | Get started quickly in new regions without having to first request quota. |
 | Flexible choice of payment model for existing provisioned customers | Customers with commitments can stay on the commitment model until the end of life of the currently supported models, and can choose to migrate existing commitments to hourly/reservations via managed process. We recommend migrating to hourly/ reservations to take advantage of term discounts and to work with the latest models. |
 | Supports latest model generations | The latest models are available only on hourly/ reservations in provisioned offering. |
-| Differentiated pricing | Greater flexibility and control of pricing and performance. In December 2024, we introduced  differentiated hourly pricing across [global provisioned](../how-to/deployment-types.md#global-provisioned), [data zone provisioned](../how-to/deployment-types.md#data-zone-provisioned), and [regional provisioned](../how-to/deployment-types.md#regional-provisioned) deployment types with the option to purchase [Azure Reservations](#new-azure-reservations-for-global-and-data-zone-provisioned-deployments) to support additional discounts. For more information on the hourly price for each provisioned deployment type, see the [Pricing details](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) page. |
+| Differentiated pricing | Greater flexibility and control of pricing and performance. In December 2024, we introduced  differentiated hourly pricing across [global provisioned](../../foundry-models/concepts/deployment-types.md#global-provisioned), [data zone provisioned](../../foundry-models/concepts/deployment-types.md#data-zone-provisioned), and [regional provisioned](../../foundry-models/concepts/deployment-types.md#regional-provisioned) deployment types with the option to purchase [Azure Reservations](#new-azure-reservations-for-global-and-data-zone-provisioned-deployments) to support additional discounts. For more information on the hourly price for each provisioned deployment type, see the [Pricing details](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) page. |
 
 ## Usability improvement details
 
 
@@ -31,7 +31,7 @@ Provisioned throughput provides:
 
 > [!TIP]
 > * You can take advantage of more cost savings when you buy [Microsoft Azure AI Foundry Provisioned Throughput reservations](/azure/cost-management-billing/reservations/azure-openai#buy-a-microsoft-azure-openai-service-reservation).
-> * Provisioned throughput is available as the following deployment types: [global provisioned](../how-to/deployment-types.md#global-provisioned), [data zone provisioned](../how-to/deployment-types.md#data-zone-provisioned) and [regional provisioned](../how-to/deployment-types.md#regional-provisioned).
+> * Provisioned throughput is available as the following deployment types: [global provisioned](../../foundry-models/concepts/deployment-types.md#global-provisioned), [data zone provisioned](../../foundry-models/concepts/deployment-types.md#data-zone-provisioned) and [regional provisioned](../../foundry-models/concepts/deployment-types.md#regional-provisioned).
 
 
 <!--
 
@@ -92,7 +92,7 @@ The following aren't currently supported:
 ### Batch deployment
 
 > [!NOTE]
-> In the [Azure AI Foundry portal](https://ai.azure.com/?cid=learnDocs) the batch deployment types will appear as `Global-Batch` and `Data Zone Batch`. To learn more about Azure OpenAI deployment types, see our [deployment types guide](../how-to/deployment-types.md).
+> In the [Azure AI Foundry portal](https://ai.azure.com/?cid=learnDocs) the batch deployment types will appear as `Global-Batch` and `Data Zone Batch`. To learn more about Azure OpenAI deployment types, see our [deployment types guide](../../foundry-models/concepts/deployment-types.md).
 
 :::image type="content" source="../media/how-to/global-batch/global-batch.png" alt-text="Screenshot that shows the model deployment dialog in Azure AI Foundry portal with Global-Batch deployment type highlighted." lightbox="../media/how-to/global-batch/global-batch.png":::
 
@@ -246,5 +246,5 @@ When a job failure occurs, you'll find details about the failure in the `errors`
 
 ## See also
 
-* Learn more about Azure OpenAI [deployment types](./deployment-types.md)
+* Learn more about Azure OpenAI [deployment types](../../foundry-models/concepts/deployment-types.md)
 * Learn more about Azure OpenAI [quotas and limits](../quotas-limits.md)
@@ -12,7 +12,7 @@ ms.custom:
   - build-2025
 ---
 
-# Deployment types for Azure AI Foundry Models
+# Understanding Deployment types for Azure AI Foundry Models
 
 Azure AI Foundry makes models available by using the model deployment concept in Azure AI Foundry Services (formerly known as Azure AI Services). Model deployments are also Azure resources and, when created, give access to a given model under certain configurations. Such a configuration includes the infrastructure required to process the requests.
 
@@ -33,7 +33,7 @@ For standard deployments, there are three deployment-type options to choose from
 
 ### Global deployments
 
-Global deployments use the global infrastructure of Azure to dynamically route customer traffic to the datacenter with the best availability for the customer's inference requests. This means that global offers the highest initial throughput limits and best model availability, but still provides our uptime SLA and low latency. For high-volume workloads above the specified usage tiers on Standard and Global Standard, you might experience increased latency variation. For customers that require the lower latency variance at large workload usage, we recommend using our provisioned deployment types.
+Global deployments use the global infrastructure of Azure to dynamically route customer traffic to the datacenter with the best availability for the customer's inference requests. This means that global offers the highest initial throughput limits and best model availability, but still provides our uptime [SLA](https://www.microsoft.com/licensing/docs/view/Service-Level-Agreements-SLA-for-Online-Services) and low latency. For high-volume workloads above the specified usage tiers on Standard and Global Standard, you might experience increased latency variation. For customers that require the lower latency variance at large workload usage, we recommend using our provisioned deployment types.
 
 Our global deployments are the first location for all new models and features. Depending on call volume, customers with large volume and low latency variance requirements should consider our provisioned deployment types.
 
 
@@ -20,7 +20,7 @@ After you've fine-tuned a model, you may want to test its quality via the Chat C
 A Developer Tier deployment allows you to deploy your new model without the hourly hosting fee incurred by Standard or Global deployments. The only charges incurred are per-token. Consult the [pricing page](https://aka.ms/aoaipricing) for the most up-to-date pricing.
 
 > [!IMPORTANT]
-> Developer Tier offers no availability SLA and no [data residency](https://aka.ms/data-residency) guarantees. If you require an SLA or data residency, choose an alternative [deployment type](./deployment-types.md) for testing your model.
+> Developer Tier offers no availability SLA and no [data residency](https://aka.ms/data-residency) guarantees. If you require an SLA or data residency, choose an alternative [deployment type](../../foundry-models/concepts/deployment-types.md) for testing your model.
 >
 > Developer Tier deployments have a fixed lifetime of **24 hours**. Learn more [below](#clean-up-your-deployment) about the deployment lifecycle.
 
@@ -213,4 +213,4 @@ curl -X DELETE "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resour
 
 - [Deploy for production](./fine-tuning-deploy.md)
 - Understand [Azure OpenAI Quotas & limits](./quota.md)
-- Read more about other [Azure OpenAI deployment types](./deployment-types.md)
+- Read more about other [Azure OpenAI deployment types](../../foundry-models/concepts/deployment-types.md)
@@ -18,7 +18,7 @@ Once your model is fine-tuned, you can deploy the model and can use it in your o
 
 When you deploy the model, you make the model available for inferencing, and that incurs an hourly hosting charge. Fine-tuned models, however, can be stored in Azure AI Foundry at no cost until you're ready to use them.
 
-Azure OpenAI provides choices of deployment types for fine-tuned models on the hosting structure that fits different business and usage patterns: **Standard**, **Global Standard** (preview) and **Provisioned Throughput** (preview). Learn more about [deployment types for fine-tuned models](#deployment-types) and the [concepts of all deployment types](./deployment-types.md).
+Azure OpenAI provides choices of deployment types for fine-tuned models on the hosting structure that fits different business and usage patterns: **Standard**, **Global Standard** (preview) and **Provisioned Throughput** (preview). Learn more about [deployment types for fine-tuned models](#deployment-types) and the [concepts of all deployment types](../../foundry-models/concepts/deployment-types.md).
 
 ## Deploy your fine-tuned model
 
@@ -362,7 +362,7 @@ Azure OpenAI fine-tuning supports the following deployment types.
 
 ### Standard
 
-[Standard deployments](./deployment-types.md#standard) provide a pay-per-token billing model with data residency confined to the deployed region.
+[Standard deployments](../../foundry-models/concepts/deployment-types.md) provide a pay-per-token billing model with data residency confined to the deployed region.
 
 | Models             | East US2 | North Central US | Sweden Central | Switzerland West |
 |--------------------|:--------:|:----------------:|:--------------:|:----------------:|
@@ -377,7 +377,7 @@ Azure OpenAI fine-tuning supports the following deployment types.
 
 ### Global Standard
 
-[Global standard](./deployment-types.md#global-standard) fine-tuned deployments offer [cost savings](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/), but custom model weights may temporarily be stored outside the geography of your Azure OpenAI resource.
+[Global standard](../../foundry-models/concepts/deployment-types.md) fine-tuned deployments offer [cost savings](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/), but custom model weights may temporarily be stored outside the geography of your Azure OpenAI resource.
 
 Global standard deployments are available from all Azure OpenAI regions for the following models:
 
@@ -392,7 +392,7 @@ Global standard deployments are available from all Azure OpenAI regions for the
 
 ### Developer Tier
 
-[Developer](./deployment-types.md#developer-for-fine-tuned-models) fine-tuned deployments offer a similar experience as [Global Standard](#global-standard) without an hourly hosting fee, but do not offer an availability SLA. Developer deployments are designed for model candidate evaluation and not for production use.
+[Developer](../../foundry-models/concepts/deployment-types.md) fine-tuned deployments offer a similar experience as [Global Standard](#global-standard) without an hourly hosting fee, but do not offer an availability SLA. Developer deployments are designed for model candidate evaluation and not for production use.
 
 Developer deployments are available from all Azure OpenAI regions for the following models:
 
@@ -409,7 +409,7 @@ Developer deployments are available from all Azure OpenAI regions for the follow
 | GPT-4o       | ✅               | ✅             |
 | GPT-4o-mini  | ✅               | ✅             |
 
-[Provisioned throughput](./deployment-types.md#regional-provisioned) fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md) for latency-sensitive agents and applications. They use the same regional provisioned throughput (PTU) capacity as base models, so if you already have regional PTU quota you can deploy your fine-tuned model in support regions.
+[Provisioned throughput](../../foundry-models/concepts/deployment-types.md) fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md) for latency-sensitive agents and applications. They use the same regional provisioned throughput (PTU) capacity as base models, so if you already have regional PTU quota you can deploy your fine-tuned model in support regions.
 
 ## Clean up your deployment
 
@@ -433,4 +433,4 @@ You can also delete a deployment in Azure AI Foundry portal, or use [Azure CLI](
 ## Next steps
 
 - [Azure OpenAI Quotas & limits](./quota.md)
-- [Azure OpenAI deployment types](./deployment-types.md)
+- [Azure OpenAI deployment types](../../foundry-models/concepts/deployment-types.md)