Skip to content

Commit f4a3fc5

Browse files
committed
freshness updates
1 parent a4f64e1 commit f4a3fc5

File tree

5 files changed

+21
-22
lines changed

5 files changed

+21
-22
lines changed

articles/machine-learning/how-to-deploy-models-serverless.md

Lines changed: 21 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ manager: scottpolly
66
ms.service: azure-machine-learning
77
ms.subservice: inferencing
88
ms.topic: how-to
9-
ms.date: 07/19/2024
9+
ms.date: 08/07/2025
1010
ms.reviewer: fasantia
1111
reviewer: santiagxf
1212
ms.author: scottpolly
@@ -19,7 +19,7 @@ ms.custom: build-2024, serverless, devx-track-azurecli
1919

2020
In this article, you learn how to deploy a model from the model catalog as a standard deployment.
2121

22-
[Certain models in the model catalog](concept-endpoint-serverless-availability.md) can be deployed as a standard deployment with Standard billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
22+
[Certain models in the model catalog](concept-endpoint-serverless-availability.md) can be deployed as a standard deployment with Standard billing. This deployment type provides a way to consume models as an API without hosting them on your subscription, while maintaining the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
2323

2424
This article uses a Meta Llama model deployment for illustration. However, you can use the same steps to deploy any of the [models in the model catalog that are available for standard deployment](concept-endpoint-serverless-availability.md).
2525

@@ -96,20 +96,20 @@ This article uses a Meta Llama model deployment for illustration. However, you c
9696
9797
1. Go to your workspace. To use the standard deployment offering, your workspace must belong to one of the [regions that are supported for serverless deployment](concept-endpoint-serverless-availability.md) for the particular model you want to deploy.
9898
99-
1. Select **Model catalog** from the left sidebar and find the model card of the model you want to deploy. In this article, you select a **Meta-Llama-3-8B-Instruct** model.
99+
1. Select **Model catalog** from the left sidebar and find the model card of the model you want to deploy. In this article, you select a **Bria-2.3-Fast** model.
100100
101101
1. If you're deploying the model using Azure CLI, Python SDK, or ARM, copy the **Model ID**.
102102
103103
> [!IMPORTANT]
104-
> Do not include the version when copying the **Model ID**. standard deployments always deploy the model's latest version available. For example, for the model ID `azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct/versions/3`, copy `azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct`.
104+
> Do not include the version when copying the **Model ID**. Standard deployments always deploy the model's latest version available. For example, for the model ID `azureml://registries/azureml-bria/models/Bria-2.3-Fast/versions/1`, copy `azureml://registries/azureml-bria/models/Bria-2.3-Fast`.
105105
106106
:::image type="content" source="media/how-to-deploy-models-serverless/model-card.png" alt-text="A screenshot showing a model's details page." lightbox="media/how-to-deploy-models-serverless/model-card.png":::
107107
108108
The next section covers the steps for subscribing your workspace to a model offering. You can skip this section and go to [Deploy the model to a standard deployment](#deploy-the-model-to-a-standard-deployment), if you're deploying a Microsoft model.
109109
110110
## Subscribe your workspace to the model offering
111111
112-
standard deployments can deploy both Microsoft and non-Microsoft offered models. For Microsoft models (such as Phi-3 models), you don't need to create an Azure Marketplace subscription and you can [deploy them to standard deployments directly](#deploy-the-model-to-a-standard-deployment) to consume their predictions. For non-Microsoft models, you need to create the subscription first. If it's your first time deploying the model in the workspace, you have to subscribe your workspace for the particular model offering from the Azure Marketplace. Each workspace has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending.
112+
Standard deployments can deploy both Microsoft and non-Microsoft offered models. For Microsoft models (such as Phi-3 models), you don't need to create an Azure Marketplace subscription and you can [deploy them to standard deployments directly](#deploy-the-model-to-a-standard-deployment) to consume their predictions. For non-Microsoft models, you need to create the subscription first. If it's your first time deploying the model in the workspace, you have to subscribe your workspace for the particular model offering from the Azure Marketplace. Each workspace has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending.
113113
114114
> [!NOTE]
115115
> Models offered through the Azure Marketplace are available for deployment to standard deployments in specific regions. Check [Region availability for models in standard deployments](concept-endpoint-serverless-availability.md) to verify which models and regions are available. If the one you need is not listed, you can deploy to a workspace in a supported region and then [consume standard deployments from a different workspace](how-to-connect-models-serverless.md).
@@ -118,13 +118,12 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
118118
119119
# [Studio](#tab/azure-studio)
120120
121-
1. On the model's **Details** page, select **Deploy**. A **Deployment options** window opens up, giving you the choice between standard deployment and deployment using a managed compute.
121+
1. On the model's **Details** page, select **Use this model**. A **Deployment options** window opens up, giving you the choice between standard deployment and deployment using a managed compute.
122122
123123
> [!NOTE]
124-
> For models that can be deployed only via standard deployment, the standard deployment wizard opens up right after you select **Deploy** from the model's details page.
124+
> For models that can be deployed only via standard deployment, the standard deployment wizard opens up right after you select **Use this model** from the model's details page.
125125
126126
1. Select **standard deployment with Azure AI Content Safety (preview)** to open the standard deployment wizard.
127-
1. Select the checkbox to acknowledge the Microsoft purchase policy.
128127
129128
:::image type="content" source="media/how-to-deploy-models-serverless/deploy-pay-as-you-go.png" alt-text="A screenshot showing how to deploy a model with the standard deployment option." lightbox="media/how-to-deploy-models-serverless/deploy-pay-as-you-go.png":::
130129
@@ -142,8 +141,8 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
142141
__subscription.yml__
143142
144143
```yml
145-
name: meta-llama3-8b-qwerty
146-
model_id: azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct
144+
name: bria-2.3-Fast
145+
model_id: azureml://registries/azureml-bria/models/Bria-2.3-Fast
147146
```
148147
149148
Use the _subscription.yml_ file to create the subscription:
@@ -155,7 +154,7 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
155154
# [Python SDK](#tab/python)
156155
157156
```python
158-
model_id="azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct"
157+
model_id="azureml://registries/azureml-bria/models/Bria-2.3-Fast"
159158
subscription_name="Meta-Llama-3-8B-Instruct"
160159
161160
marketplace_subscription = MarketplaceSubscription(
@@ -188,7 +187,7 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
188187
"type": "String"
189188
},
190189
"model_id": {
191-
"defaultValue": "azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct",
190+
"defaultValue": "azureml://registries/azureml-bria/models/Bria-2.3-Fast",
192191
"type": "String"
193192
}
194193
},
@@ -250,15 +249,15 @@ standard deployments can deploy both Microsoft and non-Microsoft offered models.
250249
251250
Once you've created a subscription for a non-Microsoft model, you can deploy the associated model to a standard deployment. For Microsoft models (such as Phi-3 models), you don't need to create a subscription.
252251
253-
The standard deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance organizations need. This deployment option doesn't require quota from your subscription.
252+
The standard deployment provides a way to consume models as an API without hosting them on your subscription, while maintaining the enterprise security and compliance organizations need. This deployment option doesn't require quota from your subscription.
254253
255-
In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
254+
In this section, you create an endpoint with the name **Bria-2.3-Fast**.
256255
257256
1. Create the serverless endpoint
258257
259258
# [Studio](#tab/azure-studio)
260259
261-
1. To deploy a Microsoft model that doesn't require subscribing to a model offering, select **Deploy** and then select **standard deployment with Azure AI Content Safety (preview)** to open the deployment wizard.
260+
1. To deploy a Microsoft model that doesn't require subscribing to a model offering, select **Use this model** and then select **standard deployment with Azure AI Content Safety (preview)** to open the deployment wizard.
262261
263262
1. Alternatively, for a non-Microsoft model that requires a model subscription, if you've just subscribed your workspace to the model offer in the previous section, continue to select **Deploy**. Alternatively, select **Continue to deploy** (if your deployment wizard had the note *You already have an Azure Marketplace subscription for this workspace*).
264263
@@ -277,8 +276,8 @@ In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
277276
__endpoint.yml__
278277
279278
```yml
280-
name: meta-llama3-8b-qwerty
281-
model_id: azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct
279+
name: bria-2.3-Fast
280+
model_id: azureml://registries/azureml-bria/models/Bria-2.3-Fast
282281
```
283282
284283
Use the _endpoint.yml_ file to create the endpoint:
@@ -290,7 +289,7 @@ In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
290289
# [Python SDK](#tab/python)
291290
292291
```python
293-
endpoint_name="meta-llama3-8b-qwerty"
292+
endpoint_name="bria-2.3-Fast"
294293
295294
serverless_endpoint = ServerlessEndpoint(
296295
name=endpoint_name,
@@ -318,15 +317,15 @@ In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
318317
"type": "String"
319318
},
320319
"endpoint_name": {
321-
"defaultValue": "meta-llama3-8b-qwerty",
320+
"defaultValue": "bria-2.3-Fast",
322321
"type": "String"
323322
},
324323
"location": {
325324
"defaultValue": "eastus2",
326325
"type": "String"
327326
},
328327
"model_id": {
329-
"defaultValue": "azureml://registries/azureml-meta/models/Meta-Llama-3-8B-Instruct",
328+
"defaultValue": "azureml://registries/azureml-bria/models/Bria-2.3-Fast",
330329
"type": "String"
331330
}
332331
},
@@ -411,7 +410,7 @@ In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
411410
# [Azure CLI](#tab/cli)
412411
413412
```azurecli
414-
az ml serverless-endpoint get-credentials -n meta-llama3-8b-qwerty
413+
az ml serverless-endpoint get-credentials -n bria-2.3-Fast
415414
```
416415
417416
# [Python SDK](#tab/python)
@@ -476,7 +475,7 @@ To delete a standard deployment:
476475
477476
```azurecli
478477
az ml serverless-endpoint delete \
479-
--name "meta-llama3-8b-qwerty"
478+
--name "bria-2.3-Fast"
480479
```
481480

482481
To delete the associated model subscription:
13.8 KB
Loading
-15.6 KB
Loading
5.44 KB
Loading
-70.7 KB
Loading

0 commit comments

Comments
 (0)