You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this article, you learn how to deploy a model from the model catalog as a standard deployment.
21
21
22
-
[Certain models in the model catalog](concept-endpoint-serverless-availability.md) can be deployed as a standard deployment with Standard billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
22
+
[Certain models in the model catalog](concept-endpoint-serverless-availability.md) can be deployed as a standard deployment with Standard billing. This deployment type provides a way to consume models as an API without hosting them on your subscription, while maintaining the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
23
23
24
24
This article uses a Meta Llama model deployment for illustration. However, you can use the same steps to deploy any of the [models in the model catalog that are available for standard deployment](concept-endpoint-serverless-availability.md).
25
25
@@ -96,20 +96,20 @@ This article uses a Meta Llama model deployment for illustration. However, you c
96
96
97
97
1. Go to your workspace. To use the standard deployment offering, your workspace must belong to one of the [regions that are supported for serverless deployment](concept-endpoint-serverless-availability.md) for the particular model you want to deploy.
98
98
99
-
1. Select **Model catalog** from the left sidebar and find the model card of the model you want to deploy. In this article, you select a **Meta-Llama-3-8B-Instruct** model.
99
+
1. Select **Model catalog** from the left sidebar and find the model card of the model you want to deploy. In this article, you select a **Bria-2.3-Fast** model.
100
100
101
101
1. If you're deploying the model using Azure CLI, Python SDK, or ARM, copy the **Model ID**.
102
102
103
103
> [!IMPORTANT]
104
-
> Do not include the version when copying the **Model ID**. standard deployments always deploy the model's latest version available. For example, for the model ID `azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct/versions/3`, copy `azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct`.
104
+
> Do not include the version when copying the **Model ID**. Standard deployments always deploy the model's latest version available. For example, for the model ID `azureml://registries/azureml-bria/models/Bria-2.3-Fast/versions/1`, copy `azureml://registries/azureml-bria/models/Bria-2.3-Fast`.
The next section covers the steps for subscribing your workspace to a model offering. You can skip this section and go to [Deploy the model to a standard deployment](#deploy-the-model-to-a-standard-deployment), if you're deploying a Microsoft model.
109
109
110
110
## Subscribe your workspace to the model offering
111
111
112
-
standard deployments can deploy both Microsoft and non-Microsoft offered models. For Microsoft models (such as Phi-3 models), you don't need to create an Azure Marketplace subscription and you can [deploy them to standard deployments directly](#deploy-the-model-to-a-standard-deployment) to consume their predictions. For non-Microsoft models, you need to create the subscription first. If it's your first time deploying the model in the workspace, you have to subscribe your workspace for the particular model offering from the Azure Marketplace. Each workspace has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending.
112
+
Standard deployments can deploy both Microsoft and non-Microsoft offered models. For Microsoft models (such as Phi-3 models), you don't need to create an Azure Marketplace subscription and you can [deploy them to standard deployments directly](#deploy-the-model-to-a-standard-deployment) to consume their predictions. For non-Microsoft models, you need to create the subscription first. If it's your first time deploying the model in the workspace, you have to subscribe your workspace for the particular model offering from the Azure Marketplace. Each workspace has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending.
113
113
114
114
> [!NOTE]
115
115
> Models offered through the Azure Marketplace are available for deployment to standard deployments in specific regions. Check [Region availability for models in standard deployments](concept-endpoint-serverless-availability.md) to verify which models and regions are available. If the one you need is not listed, you can deploy to a workspace in a supported region and then [consume standard deployments from a different workspace](how-to-connect-models-serverless.md).
@@ -118,13 +118,12 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
118
118
119
119
# [Studio](#tab/azure-studio)
120
120
121
-
1. On the model's **Details** page, select **Deploy**. A **Deployment options** window opens up, giving you the choice between standard deployment and deployment using a managed compute.
121
+
1. On the model's **Details** page, select **Use this model**. A **Deployment options** window opens up, giving you the choice between standard deployment and deployment using a managed compute.
122
122
123
123
> [!NOTE]
124
-
> For models that can be deployed only via standard deployment, the standard deployment wizard opens up right after you select **Deploy** from the model's details page.
124
+
> For models that can be deployed only via standard deployment, the standard deployment wizard opens up right after you select **Use this model** from the model's details page.
125
125
126
126
1. Select **standard deployment with Azure AI Content Safety (preview)** to open the standard deployment wizard.
127
-
1. Select the checkbox to acknowledge the Microsoft purchase policy.
128
127
129
128
:::image type="content" source="media/how-to-deploy-models-serverless/deploy-pay-as-you-go.png" alt-text="A screenshot showing how to deploy a model with the standard deployment option." lightbox="media/how-to-deploy-models-serverless/deploy-pay-as-you-go.png":::
130
129
@@ -142,8 +141,8 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
@@ -250,15 +249,15 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
250
249
251
250
Once you've created a subscription for a non-Microsoft model, you can deploy the associated model to a standard deployment. For Microsoft models (such as Phi-3 models), you don't need to create a subscription.
252
251
253
-
The standard deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance organizations need. This deployment option doesn't require quota from your subscription.
252
+
The standard deployment provides a way to consume models as an API without hosting them on your subscription, while maintaining the enterprise security and compliance organizations need. This deployment option doesn't require quota from your subscription.
254
253
255
-
In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
254
+
In this section, you create an endpoint with the name **Bria-2.3-Fast**.
256
255
257
256
1. Create the serverless endpoint
258
257
259
258
# [Studio](#tab/azure-studio)
260
259
261
-
1. To deploy a Microsoft model that doesn't require subscribing to a model offering, select **Deploy** and then select **standard deployment with Azure AI Content Safety (preview)** to open the deployment wizard.
260
+
1. To deploy a Microsoft model that doesn't require subscribing to a model offering, select **Use this model** and then select **standard deployment with Azure AI Content Safety (preview)** to open the deployment wizard.
262
261
263
262
1. Alternatively, for a non-Microsoft model that requires a model subscription, if you've just subscribed your workspace to the model offer in the previous section, continue to select **Deploy**. Alternatively, select **Continue to deploy** (if your deployment wizard had the note *You already have an Azure Marketplace subscription for this workspace*).
264
263
@@ -277,8 +276,8 @@ In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
0 commit comments