Skip to content

Commit 5c74880

Browse files
Merge pull request #283831 from santiagxf/santiagxf-patch-3
Update how-to-deploy-models-serverless.md
2 parents 1d15a47 + 03c88b7 commit 5c74880

File tree

1 file changed

+8
-9
lines changed

1 file changed

+8
-9
lines changed

articles/machine-learning/how-to-deploy-models-serverless.md

Lines changed: 8 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -106,12 +106,12 @@ The next section covers the steps for subscribing your workspace to a model offe
106106
107107
## Subscribe your workspace to the model offering
108108
109-
For non-Microsoft models offered through the Azure Marketplace, you can deploy them to serverless API endpoints to consume their predictions. If it's your first time deploying the model in the workspace, you have to subscribe your workspace for the particular model offering from the Azure Marketplace. Each workspace has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending.
109+
Serverless API endpoints can deploy both Microsoft and non-Microsoft offered models. For Microsoft models (such as Phi-3 models), you don't need to create an Azure Marketplace subscription and you can [deploy them to serverless API endpoints directly](#deploy-the-model-to-a-serverless-api-endpoint) to consume their predictions. For non-Microsoft models, you need to create the subscription first. If it's your first time deploying the model in the workspace, you have to subscribe your workspace for the particular model offering from the Azure Marketplace. Each workspace has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending.
110110
111111
> [!NOTE]
112112
> Models offered through the Azure Marketplace are available for deployment to serverless API endpoints in specific regions. Check [Region availability for models in serverless API endpoints](concept-endpoint-serverless-availability.md) to verify which models and regions are available. If the one you need is not listed, you can deploy to a workspace in a supported region and then [consume serverless API endpoints from a different workspace](how-to-connect-models-serverless.md).
113113
114-
1. Create the model's marketplace subscription. When you create a subscription, you accept the terms and conditions associated with the model offer.
114+
1. Create the model's marketplace subscription. When you create a subscription, you accept the terms and conditions associated with the model offer. Remember you don't need to perform this step for Microsoft offered models (like Phi-3).
115115
116116
# [Studio](#tab/azure-studio)
117117
@@ -172,8 +172,8 @@ For non-Microsoft models offered through the Azure Marketplace, you can deploy t
172172
"$schema": "https://schema.management.azure.com/schemas/2019-04-01/deploymentTemplate.json#",
173173
"contentVersion": "1.0.0.0",
174174
"parameters": {
175-
"workspace_name": {
176-
"defaultValue": "my-workspace",
175+
"project_name": {
176+
"defaultValue": "my-project",
177177
"type": "String"
178178
},
179179
"subscription_name": {
@@ -190,8 +190,7 @@ For non-Microsoft models offered through the Azure Marketplace, you can deploy t
190190
{
191191
"type": "Microsoft.MachineLearningServices/workspaces/marketplaceSubscriptions",
192192
"apiVersion": "2024-04-01",
193-
"name": "[concat(parameters('workspace_name'), '/', parameters('subscription_name'))]",
194-
"location": "[parameters('location')]",
193+
"name": "[concat(parameters('project_name'), '/', parameters('subscription_name'))]",
195194
"properties": {
196195
"modelId": "[parameters('model_id')]"
197196
}
@@ -305,8 +304,8 @@ In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
305304
"$schema": "https://schema.management.azure.com/schemas/2019-04-01/deploymentTemplate.json#",
306305
"contentVersion": "1.0.0.0",
307306
"parameters": {
308-
"workspace_name": {
309-
"defaultValue": "my-workspace",
307+
"project_name": {
308+
"defaultValue": "my-project",
310309
"type": "String"
311310
},
312311
"endpoint_name": {
@@ -327,7 +326,7 @@ In this section, you create an endpoint with the name **meta-llama3-8b-qwerty**.
327326
{
328327
"type": "Microsoft.MachineLearningServices/workspaces/serverlessEndpoints",
329328
"apiVersion": "2024-04-01",
330-
"name": "[concat(parameters('workspace_name'), '/', parameters('endpoint_name'))]",
329+
"name": "[concat(parameters('project_name'), '/', parameters('endpoint_name'))]",
331330
"location": "[parameters('location')]",
332331
"sku": {
333332
"name": "Consumption"

0 commit comments

Comments
 (0)