Skip to content

Commit 3a8933d

Browse files
Merge pull request #7204 from MicrosoftDocs/main
Auto Publish – main to live - 2025-09-22 22:03 UTC
2 parents 0d89fdb + d7ca363 commit 3a8933d

20 files changed

+234
-248
lines changed
Lines changed: 17 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,19 @@
11
---
22
title: Deployment options for Azure AI Foundry Models
33
titleSuffix: Azure AI Foundry
4-
description: Learn about deployment options for Azure AI Foundry Models.
4+
description: Learn about deployment options for Azure AI Foundry Models including standard, serverless API, and managed compute deployments.
55
ms.service: azure-ai-foundry
66
ms.topic: concept-article
7-
ms.date: 06/30/2025
8-
ms.reviewer: fasantia
7+
ms.date: 09/22/2025
98
ms.author: mopeakande
10-
manager: nitinme
119
author: msakande
10+
manager: nitinme
11+
#CustomerIntent: As a developer or AI practitioner, I want to understand the different deployment options available for Azure AI Foundry Models so that I can choose the most appropriate deployment method for my specific use case, requirements, and infrastructure needs.
1212
---
1313

1414
# Deployment overview for Azure AI Foundry Models
1515

16-
The model catalog in Azure AI Foundry is the hub to discover and use a wide range of Foundry Models for building generative AI applications. Models need to be deployed to make them available for receiving inference requests. Azure AI Foundry offers a comprehensive suite of deployment options for Foundry Models, depending on your needs and model requirements.
16+
The model catalog in Azure AI Foundry is the hub to discover and use a wide range of Foundry Models for building generative AI applications. You need to deploy models to make them available for receiving inference requests. Azure AI Foundry offers a comprehensive suite of deployment options for Foundry Models, depending on your needs and model requirements.
1717

1818
## Deployment options
1919

@@ -23,6 +23,9 @@ Azure AI Foundry provides several deployment options depending on the type of mo
2323
- Deployment to serverless API endpoints
2424
- Deployment to managed computes
2525

26+
Azure AI Foundry portal might automatically pick a deployment option based on your environment and configuration. Use Azure AI Foundry resources for deployment whenever possible.
27+
Models that support multiple deployment options default to Azure AI Foundry resources for deployment. To access other deployment options, use the Azure CLI or Azure Machine Learning SDK for deployment.
28+
2629
### Standard deployment in Azure AI Foundry resources
2730

2831
Azure AI Foundry resources (formerly referred to as Azure AI Services resources), is **the preferred deployment option** in Azure AI Foundry. It offers the widest range of capabilities, including regional, data zone, or global processing, and it offers standard and [provisioned throughput (PTU)](../../ai-services/openai/concepts/provisioned-throughput.md) options. Flagship models in Azure AI Foundry Models support this deployment option.
@@ -31,23 +34,21 @@ This deployment option is available in:
3134

3235
* Azure AI Foundry resources
3336
* Azure OpenAI resources<sup>1</sup>
34-
* Azure AI hub, when connected to an Azure AI Foundry resource (requires the [Deploy models to Azure AI Foundry resources](#configure-azure-ai-foundry-portal-for-deployment-options) feature to be turned on).
37+
* Azure AI hub, when connected to an Azure AI Foundry resource
3538

36-
<sup>1</sup>If you're using Azure OpenAI resources, the model catalog shows only Azure OpenAI in Foundry Models for deployment. You can get the full list of Foundry Models by upgrading to an Azure AI Foundry resource.
39+
<sup>1</sup>If you use Azure OpenAI resources, the model catalog shows only Azure OpenAI in Foundry Models for deployment. You can get the full list of Foundry Models by upgrading to an Azure AI Foundry resource.
3740

3841
To get started with standard deployment in Azure AI Foundry resources, see [How-to: Deploy models to Azure AI Foundry Models](../foundry-models/how-to/create-model-deployments.md).
3942

4043
### Serverless API endpoint
4144

42-
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md) and it allows the creation of dedicated endpoints to host the model, accessible via API. Azure AI Foundry Models support serverless API endpoints with pay-as-you-go billing.
43-
44-
Only regional deployments can be created for serverless API endpoints, and to use it, you _must_ **turn off** the "Deploy models to Azure AI Foundry resources" option.
45+
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md). It allows you to create dedicated endpoints to host the model, accessible through an API. Azure AI Foundry Models support serverless API endpoints with pay-as-you-go billing, and you can create only regional deployments for serverless API endpoints.
4546

4647
To get started with deployment to a serverless API endpoint, see [Deploy models as serverless API deployments](../how-to/deploy-models-serverless.md).
4748

4849
### Managed compute
4950

50-
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md) and it allows the creation of a dedicated endpoint to host the model in a **dedicated compute**. You need to have compute quota in your subscription to host the model, and you're billed per compute uptime.
51+
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md). It allows you to create a dedicated endpoint to host the model in a **dedicated compute**. You need to have compute quota in your subscription to host the model, and you're billed per compute uptime.
5152

5253
Managed compute deployment is required for model collections that include:
5354

@@ -61,7 +62,7 @@ To get started, see [How to deploy and inference a managed compute deployment](.
6162

6263
## Capabilities for the deployment options
6364

64-
We recommend using [Standard deployments in Azure AI Foundry resources](#standard-deployment-in-azure-ai-foundry-resources) whenever possible, as it offers the largest set of capabilities among the available deployment options. The following table lists details about specific capabilities available for each deployment option:
65+
Use [Standard deployments in Azure AI Foundry resources](#standard-deployment-in-azure-ai-foundry-resources) whenever possible. This deployment option provides the most capabilities among the available deployment options. The following table lists details about specific capabilities for each deployment option:
6566

6667
| Capability | Standard deployment in Azure AI Foundry resources | Serverless API Endpoint | Managed compute |
6768
|-------------------------------|--------------------------------------------------|------------------------|-----------------|
@@ -73,24 +74,17 @@ We recommend using [Standard deployments in Azure AI Foundry resources](#standar
7374
| Content filtering | Yes | Yes | No |
7475
| Custom content filtering | Yes | No | No |
7576
| Key-less authentication | Yes | No | No |
76-
| Billing bases | Token usage & [provisioned throughput units](../../ai-services/openai/concepts/provisioned-throughput.md) | Token usage<sup>1</sup> | Compute core hours<sup>2</sup> |
77-
78-
<sup>1</sup> A minimal endpoint infrastructure is billed per minute. You aren't billed for the infrastructure that hosts the model in standard deployment. After you delete the endpoint, no further charges accrue.
79-
80-
<sup>2</sup> Billing is on a per-minute basis, depending on the product tier and the number of instances used in the deployment since the moment of creation. After you delete the endpoint, no further charges accrue.
81-
82-
## Configure Azure AI Foundry portal for deployment options
77+
| Billing bases | Token usage & [provisioned throughput units](../../ai-services/openai/concepts/provisioned-throughput.md) | Token usage<sup>2</sup> | Compute core hours<sup>3</sup> |
8378

84-
Azure AI Foundry portal might automatically pick up a deployment option based on your environment and configuration. We recommend using Azure AI Foundry resources for deployment whenever possible. To do that, ensure that the **Deploy models to Azure AI Foundry resources** feature is **turned on**.
79+
<sup>2</sup> A minimal endpoint infrastructure is billed per minute. You aren't billed for the infrastructure that hosts the model in serverless deployment. After you delete the endpoint, no further charges accrue.
8580

86-
:::image type="content" source="../media/concepts/deployments-overview/docs-flag-enable-foundry.png" alt-text="A screenshot showing the steps to enable deployment to Azure AI Foundry resources in the Azure AI Foundry portal." lightbox="../media/concepts/deployments-overview/docs-flag-enable-foundry.png":::
81+
<sup>3</sup> Billing is on a per-minute basis, depending on the product tier and the number of instances used in the deployment since the moment of creation. After you delete the endpoint, no further charges accrue.
8782

88-
Once the **Deploy models to Azure AI Foundry resources** feature is enabled, models that support multiple deployment options default to deploy to Azure AI Foundry resources for deployment. To access other deployment options, either disable the feature or use the Azure CLI or Azure Machine Learning SDK for deployment. You can disable and enable the feature as many times as needed without affecting existing deployments.
8983

9084
## Related content
9185

9286
* [Configure your AI project to use Foundry Models](../../ai-foundry/foundry-models/how-to/quickstart-ai-project.md)
93-
* [Add and configure models to Foundry Models](../foundry-models/how-to/create-model-deployments.md)
87+
* [Deployment types in Azure AI Foundry Models](../foundry-models/concepts/deployment-types.md)
9488
* [Deploy Azure OpenAI models with Azure AI Foundry](../how-to/deploy-models-openai.md)
9589
* [Deploy open models with Azure AI Foundry](../how-to/deploy-models-managed.md)
9690
* [Explore Azure AI Foundry Models](../how-to/model-catalog-overview.md)

articles/ai-foundry/concepts/encryption-keys-portal.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,14 +5,13 @@ description: Learn how to use customer-managed keys (CMK) for enhanced encryptio
55
ms.author: jburchel
66
author: jonburchel
77
ms.reviewer: deeikele
8-
ms.date: 09/15/2025
8+
ms.date: 09/22/2025
99
ms.service: azure-ai-services
1010
ms.topic: concept-article
1111
ms.custom:
1212
- ignite-2023
1313
- build-aifnd
1414
- build-2025
15-
zone_pivot_groups: project-type
1615
ai-usage: ai-assisted
1716
# Customer intent: As an admin, I want to understand how I can use my own encryption keys with Azure AI Foundry.
1817
---

articles/ai-foundry/foundry-models/includes/models-azure-direct-others.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -14,11 +14,14 @@ author: msakande
1414
1515
## Black Forest Labs models sold directly by Azure
1616

17-
The Black Forest Labs collection of image generation models includes FLUX.1 Kontext [pro] for in-context generation and editing and FLUX1.1 [pro] for text-to-image generation.
17+
The Black Forest Labs collection of image generation models includes FLUX.1 Kontext [pro] for in-context generation and editing and FLUX1.1 [pro] for text-to-image generation.
1818

19-
| Model | Type | Capabilities | Deployment type (region availability) | Project type |
19+
You can run these models via our service provider API and through the [images/generations and images/edits endpoints](../../openai/reference-preview.md).
20+
21+
22+
| Model | Type | Capabilities | Deployment type (region availability) | Project type |
2023
| ------ | ---- | ------------ | ------------------------------------- | ------------ |
21-
| [FLUX.1-Kontext-pro](https://ai.azure.com/explore/models/FLUX.1-Kontext-pro/version/1/registry/azureml-blackforestlabs/?cid=learnDocs) | Image generation | - **Input:** text and image (5,000 tokens and 1 image) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats**: Image (PNG and JPG) | - Global standard (all regions) | Foundry, Hub-based |
24+
| [FLUX.1-Kontext-pro](https://ai.azure.com/explore/models/FLUX.1-Kontext-pro/version/1/registry/azureml-blackforestlabs/?cid=learnDocs) | Image generation | - **Input:** text and image (5,000 tokens and 1 image) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats**: Image (PNG and JPG) <br /> - **Additional Parameters**: Seed, Aspect Ratio | - Global standard (all regions) | Foundry, Hub-based |
2225
| [FLUX-1.1-pro](https://ai.azure.com/explore/models/FLUX-1.1-pro/version/1/registry/azureml-blackforestlabs/?cid=learnDocs) | Image generation | - **Input:** text (5,000 tokens) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats:** Image (PNG and JPG) | - Global standard (all regions) | Hub-based |
2326

2427

articles/ai-foundry/how-to/connections-add.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.custom:
99
- build-2024
1010
- ignite-2024
1111
ms.topic: how-to
12-
ms.date: 09/12/2025
12+
ms.date: 09/22/2025
1313
ms.reviewer: sgilley
1414
ms.author: sgilley
1515
author: sdgilley

articles/ai-foundry/how-to/create-projects.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: This article describes how to create an Azure AI Foundry project so
55
author: sdgilley
66
ms.author: sgilley
77
ms.reviewer: deeikele
8-
ms.date: 09/12/2025
8+
ms.date: 09/22/2025
99
ms.service: azure-ai-foundry
1010
ms.topic: how-to
1111
ms.custom:

articles/ai-foundry/how-to/develop/trace-agents-sdk.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -172,6 +172,14 @@ with tracer.start_as_current_span("example-tracing"):
172172
run = project_client.agents.runs.create_and_process(thread_id=thread.id, agent_id=agent.id)
173173
```
174174

175+
### Alternative: AI Toolkit for VS Code
176+
177+
AI Toolkit gives you a simple way to trace locally in VS Code. It uses a local OTLP-compatible collector, making it great for development and debugging.
178+
179+
The toolkit supports AI frameworks like Azure AI Foundry Agents Service, OpenAI, Anthropic, and LangChain through OpenTelemetry. You can see traces instantly in VS Code without needing cloud access.
180+
181+
For detailed setup instructions and SDK-specific code examples, see [Tracing in AI Toolkit](https://code.visualstudio.com/docs/intelligentapps/tracing).
182+
175183
## Trace custom functions
176184

177185
To trace your custom functions, use the OpenTelemetry SDK to instrument your code.
@@ -247,7 +255,7 @@ Once necessary packages are installed, you can easily begin to [Instrument traci
247255
The Agents playground in the Azure AI Foundry portal lets you view trace results for threads and runs that your agents produce. To see trace results, select **Thread logs** in an active thread. You can also optionally select **Metrics** to enable automatic evaluations of the model's performance across several dimensions of **AI quality** and **Risk and safety**.
248256

249257
> [!NOTE]
250-
> Evaluation results are available for 24 hours before expiring. To get evaluation results, select your desired metrics and chat with your agent.
258+
> Evaluation in the playground is billed as outlined under Trust and Azure AI Foundry Observability on [the pricing page](https://azure.microsoft.com/pricing/details/ai-foundry/?msockid=1f44c87dd9fa6d1e257fdd6dd8406c42). Results are available for 24 hours before expiring. To get evaluation results, select your desired metrics and chat with your agent.
251259
> - Evaluations aren't available in the following regions.
252260
> - `australiaeast`
253261
> - `japaneast`

articles/ai-foundry/how-to/develop/trace-application.md

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: View trace results for AI applications using OpenAI SDK with OpenTe
55
author: lgayhardt
66
ms.author: lagayhar
77
ms.reviewer: ychen
8-
ms.date: 09/15/2025
8+
ms.date: 09/22/2025
99
ms.service: azure-ai-foundry
1010
ms.topic: how-to
1111
ai-usage: ai-assisted
@@ -291,6 +291,15 @@ Configure tracing as follows:
291291
}
292292
```
293293
294+
## Trace locally with AI Toolkit
295+
296+
AI Toolkit offers a simple way to trace locally in VS Code. It uses a local OTLP-compatible collector, making it perfect for development and debugging without needing cloud access.
297+
298+
The toolkit supports the OpenAI SDK and other AI frameworks through OpenTelemetry. You can see traces instantly in your development environment.
299+
300+
For detailed setup instructions and SDK-specific code examples, see [Tracing in AI Toolkit](https://code.visualstudio.com/docs/intelligentapps/tracing).
301+
302+
294303
## Related content
295304
296305
* [Trace agents using Azure AI Foundry SDK](trace-agents-sdk.md)

articles/ai-foundry/how-to/hub-configure-private-link.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ ms.custom:
1212
- ignite-2024
1313
- hub-only
1414
ms.topic: how-to
15-
ms.date: 09/15/2025
15+
ms.date: 09/22/2025
1616
ms.reviewer: meerakurup
1717
ms.author: jburchel
1818
author: jonburchel
@@ -394,6 +394,7 @@ To find the private IP addresses for your A records, see the [Azure Machine Lear
394394
395395
## Next steps
396396
397+
- [Configure a private link for a Foundry project](configure-private-link.md)
397398
- [Create an Azure AI Foundry project](create-projects.md)
398399
- [Learn more about Azure AI Foundry](../what-is-azure-ai-foundry.md)
399400
- [Learn more about Azure AI Foundry hubs](../concepts/ai-resources.md)

articles/ai-foundry/how-to/hub-connections-add.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to use connections in Azure AI Foundry hubs.
55
manager: scottpolly
66
ms.service: azure-ai-foundry
77
ms.topic: how-to
8-
ms.date: 09/12/2025
8+
ms.date: 09/22/2025
99
ms.reviewer: scottpolly
1010
reviewer: scottpolly
1111
ms.author: jburchel
@@ -77,4 +77,5 @@ Prefer managed identity wherever possible for keyless and rotated credential man
7777

7878
## Next steps
7979

80-
Secure network traffic with [Private Link](./hub-configure-private-link.md).
80+
- [Create and managed connections in Azure AI Foundry projects](./connections-add.md).
81+
- [Secure network traffic with private link](./hub-configure-private-link.md).

articles/ai-foundry/how-to/hub-create-projects.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to create a hub-based project in Azure AI Foundry.
55
author: sdgilley
66
ms.author: sgilley
77
ms.reviewer: deeikele
8-
ms.date: 09/12/2025
8+
ms.date: 09/22/2025
99
ms.service: azure-ai-foundry
1010
ms.topic: how-to
1111
ms.custom:
@@ -111,5 +111,6 @@ Delete hub (with all projects): In Hub properties, select Delete hub to open Azu
111111

112112
## Related content
113113

114-
- [Quickstart: Get started with Azure AI Foundry (Hub projects)](../quickstarts/hub-get-started-code.md)
115-
- [Learn more about Azure AI Foundry](../what-is-azure-ai-foundry.md)
114+
- [Create an Azure AI Foundry project](create-projects.md).
115+
- [Quickstart: Get started with Azure AI Foundry (Hub projects)](../quickstarts/hub-get-started-code.md).
116+
- [Learn more about Azure AI Foundry](../what-is-azure-ai-foundry.md).

0 commit comments

Comments
 (0)