You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/how-to/deploy-models-llama.md
+33-2Lines changed: 33 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -56,11 +56,41 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
56
56
57
57
### Prerequisites
58
58
59
+
# [Meta Llama 3](#tab/llama-three)
60
+
61
+
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
62
+
- An [Azure AI hub resource](../how-to/create-azure-ai-resource.md).
63
+
64
+
> [!IMPORTANT]
65
+
> For Meta Llama 3 models, the pay-as-you-go model deployment offering is only available with AI hubs created in **East US 2** and **Sweden Central** regions.
66
+
67
+
- An [Azure AI project](../how-to/create-projects.md) in Azure AI Studio.
68
+
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
69
+
70
+
- On the Azure subscription—to subscribe the Azure AI project to the Azure Marketplace offering, once for each project, per offering:
For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
86
+
87
+
# [Meta Llama 2](#tab/llama-two)
88
+
59
89
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
60
90
- An [Azure AI hub resource](../how-to/create-azure-ai-resource.md).
61
91
62
92
> [!IMPORTANT]
63
-
> For Meta Llama models, the pay-as-you-go model deployment offering is only available with AI hubs created in **East US 2** and **West US 3** regions.
93
+
> For Meta Llama 2 models, the pay-as-you-go model deployment offering is only available with AI hubs created in **East US 2** and **West US 3** regions.
64
94
65
95
- An [Azure AI project](../how-to/create-projects.md) in Azure AI Studio.
66
96
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
@@ -82,6 +112,7 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
82
112
83
113
For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
84
114
115
+
---
85
116
86
117
### Create a new deployment
87
118
@@ -96,7 +127,7 @@ To create a deployment:
96
127
97
128
1. On the model's **Details** page, select **Deploy** and then select **Pay-as-you-go**.
98
129
99
-
1. Select the project in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2** region.
130
+
1. Select the project in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2**or **Sweden Central**region.
100
131
1. On the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use. You can also select the **Marketplace offer details** tab to learn about pricing for the selected model.
101
132
1. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering (for example, Meta-Llama-3-70B) from Azure Marketplace. This step requires that your account has the Azure subscription permissions and resource group permissions listed in the prerequisites. Each project has its own subscription to the particular Azure Marketplace offering, which allows you to control and monitor spending. Select **Subscribe and Deploy**.
If you need to deploy a different model, [deploy it to real-time endpoints](#deploy-meta-llama-models-to-real-time-endpoints) instead.
49
47
@@ -62,11 +60,41 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
62
60
63
61
### Prerequisites
64
62
63
+
# [Meta Llama 3](#tab/llama-three)
64
+
65
+
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
66
+
- An Azure Machine Learning workspace and a compute instance. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them.
67
+
68
+
> [!IMPORTANT]
69
+
> Pay-as-you-go model deployment offering is only available in workspaces created in **East US 2** and **Sweden Central** regions for Meta Llama 3 models.
70
+
71
+
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure Machine Learning. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
72
+
73
+
- On the Azure subscription—to subscribe the workspace to the Azure Marketplace offering, once for each workspace, per offering:
For more information on permissions, see [Manage access to an Azure Machine Learning workspace](how-to-assign-roles.md).
89
+
90
+
91
+
# [Meta Llama 2](#tab/llama-two)
92
+
65
93
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
66
94
- An Azure Machine Learning workspace and a compute instance. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them.
67
95
68
96
> [!IMPORTANT]
69
-
> Pay-as-you-go model deployment offering is only available in workspaces created in **East US 2** and **West US 3** regions.
97
+
> Pay-as-you-go model deployment offering is only available in workspaces created in **East US 2** and **West US 3** regions for Meta Llama 2 models.
70
98
71
99
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure Machine Learning. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
72
100
@@ -86,6 +114,8 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
For more information on permissions, see [Manage access to an Azure Machine Learning workspace](how-to-assign-roles.md).
117
+
118
+
---
89
119
90
120
### Create a new deployment
91
121
@@ -94,7 +124,7 @@ To create a deployment:
94
124
# [Meta Llama 3](#tab/llama-three)
95
125
96
126
1. Go to [Azure Machine Learning studio](https://ml.azure.com/home).
97
-
1. Select the workspace in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2** region.
127
+
1. Select the workspace in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the **East US 2**or **Sweden Central**region.
98
128
1. Choose the model you want to deploy from the [model catalog](https://ml.azure.com/model/catalog).
99
129
100
130
Alternatively, you can initiate deployment by going to your workspace and selecting **Endpoints** > **Serverless endpoints** > **Create**.
0 commit comments