Skip to content

Commit 4a5a5c0

Browse files
Merge pull request #273540 from ssalgadodev/patch-90
Update how-to-deploy-models-llama.md
2 parents fb62a8e + d666e12 commit 4a5a5c0

File tree

2 files changed

+67
-6
lines changed

2 files changed

+67
-6
lines changed

articles/ai-studio/how-to/deploy-models-llama.md

Lines changed: 33 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -56,11 +56,41 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
5656

5757
### Prerequisites
5858

59+
# [Meta Llama 3](#tab/llama-three)
60+
61+
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
62+
- An [Azure AI hub resource](../how-to/create-azure-ai-resource.md).
63+
64+
> [!IMPORTANT]
65+
> For Meta Llama 3 models, the pay-as-you-go model deployment offering is only available with AI hubs created in **East US 2** and **Sweden Central** regions.
66+
67+
- An [Azure AI project](../how-to/create-projects.md) in Azure AI Studio.
68+
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
69+
70+
- On the Azure subscription—to subscribe the Azure AI project to the Azure Marketplace offering, once for each project, per offering:
71+
- `Microsoft.MarketplaceOrdering/agreements/offers/plans/read`
72+
- `Microsoft.MarketplaceOrdering/agreements/offers/plans/sign/action`
73+
- `Microsoft.MarketplaceOrdering/offerTypes/publishers/offers/plans/agreements/read`
74+
- `Microsoft.Marketplace/offerTypes/publishers/offers/plans/agreements/read`
75+
- `Microsoft.SaaS/register/action`
76+
77+
- On the resource group—to create and use the SaaS resource:
78+
- `Microsoft.SaaS/resources/read`
79+
- `Microsoft.SaaS/resources/write`
80+
81+
- On the Azure AI project—to deploy endpoints (the Azure AI Developer role contains these permissions already):
82+
- `Microsoft.MachineLearningServices/workspaces/marketplaceModelSubscriptions/*`
83+
- `Microsoft.MachineLearningServices/workspaces/serverlessEndpoints/*`
84+
85+
For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
86+
87+
# [Meta Llama 2](#tab/llama-two)
88+
5989
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
6090
- An [Azure AI hub resource](../how-to/create-azure-ai-resource.md).
6191

6292
> [!IMPORTANT]
63-
> For Meta Llama models, the pay-as-you-go model deployment offering is only available with AI hubs created in **East US 2** and **West US 3** regions.
93+
> For Meta Llama 2 models, the pay-as-you-go model deployment offering is only available with AI hubs created in **East US 2** and **West US 3** regions.
6494
6595
- An [Azure AI project](../how-to/create-projects.md) in Azure AI Studio.
6696
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
@@ -82,6 +112,7 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
82112

83113
For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
84114

115+
---
85116

86117
### Create a new deployment
87118

@@ -96,7 +127,7 @@ To create a deployment:
96127

97128
1. On the model's **Details** page, select **Deploy** and then select **Pay-as-you-go**.
98129

99-
1. Select the project in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2** region.
130+
1. Select the project in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2** or **Sweden Central** region.
100131
1. On the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use. You can also select the **Marketplace offer details** tab to learn about pricing for the selected model.
101132
1. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering (for example, Meta-Llama-3-70B) from Azure Marketplace. This step requires that your account has the Azure subscription permissions and resource group permissions listed in the prerequisites. Each project has its own subscription to the particular Azure Marketplace offering, which allows you to control and monitor spending. Select **Subscribe and Deploy**.
102133

articles/machine-learning/how-to-deploy-models-llama.md

Lines changed: 34 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -41,9 +41,7 @@ The following models are available in Azure Marketplace for Meta Llama models wh
4141
# [Meta Llama 3](#tab/llama-three)
4242

4343
* [Meta Llama-3-8B (preview)](https://aka.ms/aistudio/landing/meta-llama-3-8b-base)
44-
* [Meta Llama-3 8B-Instruct (preview)](https://aka.ms/aistudio/landing/meta-llama-3-8b-chat)
4544
* [Meta Llama-3-70B (preview)](https://aka.ms/aistudio/landing/meta-llama-3-70b-base)
46-
* [Meta Llama-3 70B-Instruct (preview)](https://aka.ms/aistudio/landing/meta-llama-3-70b-chat)
4745

4846
If you need to deploy a different model, [deploy it to real-time endpoints](#deploy-meta-llama-models-to-real-time-endpoints) instead.
4947

@@ -62,11 +60,41 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
6260

6361
### Prerequisites
6462

63+
# [Meta Llama 3](#tab/llama-three)
64+
65+
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
66+
- An Azure Machine Learning workspace and a compute instance. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them.
67+
68+
> [!IMPORTANT]
69+
> Pay-as-you-go model deployment offering is only available in workspaces created in **East US 2** and **Sweden Central** regions for Meta Llama 3 models.
70+
71+
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure Machine Learning. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
72+
73+
- On the Azure subscription—to subscribe the workspace to the Azure Marketplace offering, once for each workspace, per offering:
74+
- `Microsoft.MarketplaceOrdering/agreements/offers/plans/read`
75+
- `Microsoft.MarketplaceOrdering/agreements/offers/plans/sign/action`
76+
- `Microsoft.MarketplaceOrdering/offerTypes/publishers/offers/plans/agreements/read`
77+
- `Microsoft.Marketplace/offerTypes/publishers/offers/plans/agreements/read`
78+
- `Microsoft.SaaS/register/action`
79+
80+
- On the resource group—to create and use the SaaS resource:
81+
- `Microsoft.SaaS/resources/read`
82+
- `Microsoft.SaaS/resources/write`
83+
84+
- On the workspace—to deploy endpoints (the Azure Machine Learning data scientist role contains these permissions already):
85+
- `Microsoft.MachineLearningServices/workspaces/marketplaceModelSubscriptions/*`
86+
- `Microsoft.MachineLearningServices/workspaces/serverlessEndpoints/*`
87+
88+
For more information on permissions, see [Manage access to an Azure Machine Learning workspace](how-to-assign-roles.md).
89+
90+
91+
# [Meta Llama 2](#tab/llama-two)
92+
6593
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
6694
- An Azure Machine Learning workspace and a compute instance. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them.
6795

6896
> [!IMPORTANT]
69-
> Pay-as-you-go model deployment offering is only available in workspaces created in **East US 2** and **West US 3** regions.
97+
> Pay-as-you-go model deployment offering is only available in workspaces created in **East US 2** and **West US 3** regions for Meta Llama 2 models.
7098
7199
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure Machine Learning. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
72100

@@ -86,6 +114,8 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
86114
- `Microsoft.MachineLearningServices/workspaces/serverlessEndpoints/*`
87115

88116
For more information on permissions, see [Manage access to an Azure Machine Learning workspace](how-to-assign-roles.md).
117+
118+
---
89119

90120
### Create a new deployment
91121

@@ -94,7 +124,7 @@ To create a deployment:
94124
# [Meta Llama 3](#tab/llama-three)
95125

96126
1. Go to [Azure Machine Learning studio](https://ml.azure.com/home).
97-
1. Select the workspace in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2** region.
127+
1. Select the workspace in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2** or **Sweden Central** region.
98128
1. Choose the model you want to deploy from the [model catalog](https://ml.azure.com/model/catalog).
99129

100130
Alternatively, you can initiate deployment by going to your workspace and selecting **Endpoints** > **Serverless endpoints** > **Create**.

0 commit comments

Comments
 (0)