You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/foundry-models/how-to/create-model-deployments.md
+6-3Lines changed: 6 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,18 +1,21 @@
1
1
---
2
2
title: Add and configure models to Azure AI Foundry
3
3
titleSuffix: Azure AI Foundry
4
-
description: Learn how to add and configure new models to the Azure AI Foundry's inference endpoint.
4
+
description: Learn how to add and configure AI models to Azure AI Foundry Models inference endpoint for deployment and inference in your applications.
5
5
ms.service: azure-ai-foundry
6
6
ms.subservice: azure-ai-foundry-model-inference
7
7
ms.topic: how-to
8
-
ms.date: 05/19/2025
8
+
ms.date: 09/29/2025
9
9
ms.custom: ignite-2024, github-universe-2024
10
10
author: msakande
11
11
ms.author: mopeakande
12
12
recommendations: false
13
13
zone_pivot_groups: azure-ai-models-deployment
14
14
ms.reviewer: fasantia
15
15
reviewer: santiagxf
16
+
ai.usage: ai-assisted
17
+
18
+
#CustomerIntent: As a developer or AI practitioner, I want to deploy and configure AI models to Azure AI Foundry Models endpoint so that I can integrate these models into my applications and generate predictions through inference requests.
16
19
---
17
20
18
21
# Add and configure models to Azure AI Foundry Models
* Your Azure AI Foundry (formerly known Azure AI Services) resource name.
20
+
* Your Azure AI Foundry resource (formerly known as Azure AI Services resource) name.
21
21
22
22
* The resource group where the Azure AI Foundry resource is deployed.
23
23
24
-
* The model name, provider, version, and SKU you would like to deploy. You can use the Azure AI Foundry portal or the Azure CLI to identify it. In this example we deploy the following model:
24
+
* The model name, provider, version, and SKU you want to deploy. You can use the Azure AI Foundry portal or the Azure CLI to find this information. In this example, you deploy the following model:
25
25
26
26
***Model name:**: `Phi-3.5-vision-instruct`
27
27
***Provider**: `Microsoft`
@@ -52,7 +52,7 @@ cd azureai-model-inference-bicep/infra
@@ -70,4 +70,18 @@ cd azureai-model-inference-bicep/infra
70
70
71
71
## Use the model
72
72
73
-
Deployed models can be consumed using the [Azure AI model's inference endpoint](../../concepts/endpoints.md) for the resource. When constructing your request, indicate the parameter `model` and insert the model deployment name you have created.
73
+
Deployed models in can be consumed using the [Azure AI model's inference endpoint](../../concepts/endpoints.md) for the resource. When constructing your request, indicate the parameter `model` and insert the model deployment name you have created. You can programmatically get the URI for the inference endpoint using the following code:
74
+
75
+
__Inference endpoint__
76
+
77
+
```azurecli
78
+
az cognitiveservices account show -n $accountName -g $resourceGroupName | jq '.properties.endpoints["Azure AI Model Inference API"]'
79
+
```
80
+
81
+
To make requests to the Azure AI Foundry Models endpoint, append the route `models`, for example `https://<resource>.services.ai.azure.com/models`. You can see the API reference for the endpoint at [Azure AI Model Inference API reference page](https://aka.ms/azureai/modelinference).
82
+
83
+
__Inference keys__
84
+
85
+
```azurecli
86
+
az cognitiveservices account keys list -n $accountName -g $resourceGroupName
Copy file name to clipboardExpand all lines: articles/ai-foundry/foundry-models/includes/create-model-deployments/cli.md
+15-15Lines changed: 15 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,61 +4,61 @@ author: santiagxf
4
4
ms.author: fasantia
5
5
ms.service: azure-ai-foundry
6
6
ms.subservice: azure-ai-foundry-model-inference
7
-
ms.date: 1/21/2025
7
+
ms.date: 09/29/2025
8
8
ms.topic: include
9
9
zone_pivot_groups: azure-ai-models-deployment
10
10
---
11
11
12
12
[!INCLUDE [Header](intro.md)]
13
13
14
-
* Install the [Azure CLI](/cli/azure/) and the `cognitiveservices` extension for Azure AI Services:
14
+
* Install the [Azure CLI](/cli/azure/) and the `cognitiveservices` extension for Azure AI Services.
15
15
16
16
```azurecli
17
17
az extension add -n cognitiveservices
18
18
```
19
19
20
-
* Some of the commands in this tutorial use the `jq` tool, which might not be installed in your system. For installation instructions, see [Download `jq`](https://stedolan.github.io/jq/download/).
20
+
* Some of the commands in this tutorial use the `jq` tool, which might not be installed on your system. For installation instructions, see [Download `jq`](https://stedolan.github.io/jq/download/).
21
21
22
22
* Identify the following information:
23
23
24
24
* Your Azure subscription ID.
25
25
26
26
* Your Azure AI Services resource name.
27
27
28
-
* The resource group where the Azure AI Services resource is deployed.
28
+
* The resource group where you deployed the Azure AI Services resource.
29
29
30
30
31
31
## Add models
32
32
33
-
To add a model, you first need to identify the model that you want to deploy. You can query the available models as follows:
33
+
To add a model, first identify the model that you want to deploy. You can query the available models as follows:
34
34
35
-
1. Log in into your Azure subscription:
35
+
1. Sign in to your Azure subscription.
36
36
37
37
```azurecli
38
38
az login
39
39
```
40
40
41
-
2. If you have more than 1 subscription, select the subscription where your resource is located:
41
+
1. If you have more than one subscription, select the subscription where your resource is located.
42
42
43
43
```azurecli
44
44
az account set --subscription $subscriptionId
45
45
```
46
46
47
-
3. Set the following environment variables with the name of the Azure AI Services resource you plan to use and resource group.
47
+
1. Set the following environment variables with the name of the Azure AI Services resource you plan to use and resource group.
48
48
49
49
```azurecli
50
50
accountName="<ai-services-resource-name>"
51
51
resourceGroupName="<resource-group>"
52
52
location="eastus2"
53
53
```
54
54
55
-
3. If you don't have an Azure AI Services account create yet, you can create one as follows:
55
+
1. If you didn't create an Azure AI Services account yet, create one.
4. Let's see first which models are available to you and under which SKU. SKUs, also known as [deployment types](../../concepts/deployment-types.md), define how Azure infrastructure is used to process requests. Models may offer different deployment types. The following command list all the model definitions available:
61
+
1. Check which models are available to you and under which SKU. SKUs, also known as [deployment types](../../concepts/deployment-types.md), define how Azure infrastructure is used to process requests. Models might offer different deployment types. The following command lists all the model definitions available:
62
62
63
63
```azurecli
64
64
az cognitiveservices account list-models \
@@ -67,7 +67,7 @@ To add a model, you first need to identify the model that you want to deploy. Yo
@@ -79,9 +79,9 @@ To add a model, you first need to identify the model that you want to deploy. Yo
79
79
}
80
80
```
81
81
82
-
6. Identify the model you want to deploy. You need the properties `name`, `format`, `version`, and `sku`. The property `format` indicates the provider offering the model. Capacity might also be needed depending on the type of deployment.
82
+
1. Identify the model you want to deploy. You need the properties `name`, `format`, `version`, and `sku`. The property `format` indicates the provider offering the model. You might also need capacity depending on the type of deployment.
83
83
84
-
7. Add the model deployment to the resource. The following example adds `Phi-3.5-vision-instruct`:
84
+
1. Add the model deployment to the resource. The following example adds `Phi-3.5-vision-instruct`:
85
85
86
86
```azurecli
87
87
az cognitiveservices account deployment create \
@@ -95,9 +95,9 @@ To add a model, you first need to identify the model that you want to deploy. Yo
95
95
--sku-name GlobalStandard
96
96
```
97
97
98
-
8. The model is ready to be consumed.
98
+
1. The model is ready to use.
99
99
100
-
You can deploy the same model multiple times if needed as long as it's under a different deployment name. This capability might be useful in case you want to test different configurations for a given model, including content filters.
100
+
You can deploy the same model multiple times if needed as long as it's under a different deployment name. This capability might be useful if you want to test different configurations for a given model, including content filters.
Copy file name to clipboardExpand all lines: articles/ai-foundry/foundry-models/includes/create-model-deployments/intro.md
+5-4Lines changed: 5 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,20 +4,21 @@ author: santiagxf
4
4
ms.author: fasantia
5
5
ms.service: azure-ai-foundry
6
6
ms.subservice: azure-ai-foundry-model-inference
7
-
ms.date: 1/21/2025
7
+
ms.date: 09/29/2025
8
8
ms.topic: include
9
9
---
10
10
11
-
You can decide and configure which models are available for inference in your Azure AI Foundry resource. When a given model is configured, you can then generate predictions from it by indicating its model name or deployment name on your requests. No further changes are required in your code to use it.
11
+
You can decide and configure which models are available for inference in your Azure AI Foundry resource. When you configure a model, you can generate predictions from it by specifying its model name or deployment name in your requests. You don't need to make any other changes in your code to use the model.
12
12
13
-
In this article, you'll learn how to add a new model to Azure AI Foundry.
13
+
In this article, you learn how to add a new model to an Azure AI Foundry Models endpoint.
14
14
15
15
## Prerequisites
16
16
17
17
To complete this article, you need:
18
18
19
19
* An Azure subscription. If you're using [GitHub Models](https://docs.github.com/en/github-models/), you can upgrade your experience and create an Azure subscription in the process. Read [Upgrade from GitHub Models to Azure AI Foundry Models](../../how-to/quickstart-github-models.md) if that's your case.
20
20
21
-
* An Azure AI Foundry resource (formerly known as Azure AI Services). For more information, see [Create and configure all the resources for Azure AI Foundry Models](../../how-to/quickstart-create-resources.md).
21
+
* An Azure AI Foundry project. This kind of project is managed under an Azure AI Foundry resource (formerly known as Azure AI Services resource). If you don't have a Foundry project, see [Create a project for Azure AI Foundry (Foundry projects)](../../../how-to/create-projects.md).
22
+
22
23
23
24
*[Models from Partners and Community](../../concepts/models.md#models-from-partners-and-community) require access to **Azure Marketplace**. Ensure you have the [permissions required to subscribe to model offerings](../../how-to/configure-marketplace.md). [Models Sold Directly by Azure](../../concepts/models.md#models-sold-directly-by-azure) don't have this requirement.
0 commit comments