Skip to content

Commit 71a2e68

Browse files
committed
freshness updates deployment overview
AB#482225 AB#475803
1 parent 9154b4c commit 71a2e68

File tree

2 files changed

+13
-18
lines changed

2 files changed

+13
-18
lines changed

articles/ai-foundry/concepts/deployments-overview.md

Lines changed: 13 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -4,16 +4,16 @@ titleSuffix: Azure AI Foundry
44
description: Learn about deployment options for Azure AI Foundry Models.
55
ms.service: azure-ai-foundry
66
ms.topic: concept-article
7-
ms.date: 06/30/2025
8-
ms.reviewer: fasantia
7+
ms.date: 09/22/2025
98
ms.author: mopeakande
9+
author: msakande
1010
manager: nitinme
1111
author: msakande
1212
---
1313

1414
# Deployment overview for Azure AI Foundry Models
1515

16-
The model catalog in Azure AI Foundry is the hub to discover and use a wide range of Foundry Models for building generative AI applications. Models need to be deployed to make them available for receiving inference requests. Azure AI Foundry offers a comprehensive suite of deployment options for Foundry Models, depending on your needs and model requirements.
16+
The model catalog in Azure AI Foundry is the hub to discover and use a wide range of Foundry Models for building generative AI applications. You need to deploy models to make them available for receiving inference requests. Azure AI Foundry offers a comprehensive suite of deployment options for Foundry Models, depending on your needs and model requirements.
1717

1818
## Deployment options
1919

@@ -31,23 +31,21 @@ This deployment option is available in:
3131

3232
* Azure AI Foundry resources
3333
* Azure OpenAI resources<sup>1</sup>
34-
* Azure AI hub, when connected to an Azure AI Foundry resource (requires the [Deploy models to Azure AI Foundry resources](#configure-azure-ai-foundry-portal-for-deployment-options) feature to be turned on).
34+
* Azure AI hub, when connected to an Azure AI Foundry resource
3535

36-
<sup>1</sup>If you're using Azure OpenAI resources, the model catalog shows only Azure OpenAI in Foundry Models for deployment. You can get the full list of Foundry Models by upgrading to an Azure AI Foundry resource.
36+
<sup>1</sup>If you use Azure OpenAI resources, the model catalog shows only Azure OpenAI in Foundry Models for deployment. You can get the full list of Foundry Models by upgrading to an Azure AI Foundry resource.
3737

3838
To get started with standard deployment in Azure AI Foundry resources, see [How-to: Deploy models to Azure AI Foundry Models](../foundry-models/how-to/create-model-deployments.md).
3939

4040
### Serverless API endpoint
4141

42-
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md) and it allows the creation of dedicated endpoints to host the model, accessible via API. Azure AI Foundry Models support serverless API endpoints with pay-as-you-go billing.
43-
44-
Only regional deployments can be created for serverless API endpoints, and to use it, you _must_ **turn off** the "Deploy models to Azure AI Foundry resources" option.
42+
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md). It allows you to create dedicated endpoints to host the model, accessible through an API. Azure AI Foundry Models support serverless API endpoints with pay-as-you-go billing, and you can create only regional deployments for serverless API endpoints.
4543

4644
To get started with deployment to a serverless API endpoint, see [Deploy models as serverless API deployments](../how-to/deploy-models-serverless.md).
4745

4846
### Managed compute
4947

50-
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md) and it allows the creation of a dedicated endpoint to host the model in a **dedicated compute**. You need to have compute quota in your subscription to host the model, and you're billed per compute uptime.
48+
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md). It allows you to create a dedicated endpoint to host the model in a **dedicated compute**. You need to have compute quota in your subscription to host the model, and you're billed per compute uptime.
5149

5250
Managed compute deployment is required for model collections that include:
5351

@@ -61,7 +59,7 @@ To get started, see [How to deploy and inference a managed compute deployment](.
6159

6260
## Capabilities for the deployment options
6361

64-
We recommend using [Standard deployments in Azure AI Foundry resources](#standard-deployment-in-azure-ai-foundry-resources) whenever possible, as it offers the largest set of capabilities among the available deployment options. The following table lists details about specific capabilities available for each deployment option:
62+
Use [Standard deployments in Azure AI Foundry resources](#standard-deployment-in-azure-ai-foundry-resources) whenever possible. This deployment option provides the most capabilities among the available deployment options. The following table lists details about specific capabilities for each deployment option:
6563

6664
| Capability | Standard deployment in Azure AI Foundry resources | Serverless API Endpoint | Managed compute |
6765
|-------------------------------|--------------------------------------------------|------------------------|-----------------|
@@ -73,19 +71,16 @@ We recommend using [Standard deployments in Azure AI Foundry resources](#standar
7371
| Content filtering | Yes | Yes | No |
7472
| Custom content filtering | Yes | No | No |
7573
| Key-less authentication | Yes | No | No |
76-
| Billing bases | Token usage & [provisioned throughput units](../../ai-services/openai/concepts/provisioned-throughput.md) | Token usage<sup>1</sup> | Compute core hours<sup>2</sup> |
74+
| Billing bases | Token usage & [provisioned throughput units](../../ai-services/openai/concepts/provisioned-throughput.md) | Token usage<sup>2</sup> | Compute core hours<sup>3</sup> |
7775

78-
<sup>1</sup> A minimal endpoint infrastructure is billed per minute. You aren't billed for the infrastructure that hosts the model in standard deployment. After you delete the endpoint, no further charges accrue.
76+
<sup>2</sup> A minimal endpoint infrastructure is billed per minute. You aren't billed for the infrastructure that hosts the model in serverless deployment. After you delete the endpoint, no further charges accrue.
7977

80-
<sup>2</sup> Billing is on a per-minute basis, depending on the product tier and the number of instances used in the deployment since the moment of creation. After you delete the endpoint, no further charges accrue.
78+
<sup>3</sup> Billing is on a per-minute basis, depending on the product tier and the number of instances used in the deployment since the moment of creation. After you delete the endpoint, no further charges accrue.
8179

8280
## Configure Azure AI Foundry portal for deployment options
8381

84-
Azure AI Foundry portal might automatically pick up a deployment option based on your environment and configuration. We recommend using Azure AI Foundry resources for deployment whenever possible. To do that, ensure that the **Deploy models to Azure AI Foundry resources** feature is **turned on**.
85-
86-
:::image type="content" source="../media/concepts/deployments-overview/docs-flag-enable-foundry.png" alt-text="A screenshot showing the steps to enable deployment to Azure AI Foundry resources in the Azure AI Foundry portal." lightbox="../media/concepts/deployments-overview/docs-flag-enable-foundry.png":::
87-
88-
Once the **Deploy models to Azure AI Foundry resources** feature is enabled, models that support multiple deployment options default to deploy to Azure AI Foundry resources for deployment. To access other deployment options, either disable the feature or use the Azure CLI or Azure Machine Learning SDK for deployment. You can disable and enable the feature as many times as needed without affecting existing deployments.
82+
Azure AI Foundry portal might automatically pick a deployment option based on your environment and configuration. Use Azure AI Foundry resources for deployment whenever possible.
83+
Models that support multiple deployment options default to Azure AI Foundry resources for deployment. To access other deployment options, use the Azure CLI or Azure Machine Learning SDK for deployment.
8984

9085
## Related content
9186

-135 KB
Binary file not shown.

0 commit comments

Comments
 (0)