Skip to content

Commit 314fae3

Browse files
author
Jill Grant
authored
Merge pull request #265930 from msakande/llama-deployment-AI-Studio-review
updates to llama deployment in AI studio
2 parents f8f32b2 + e5dbb8a commit 314fae3

File tree

6 files changed

+162
-126
lines changed

6 files changed

+162
-126
lines changed

articles/ai-studio/how-to/deploy-models-llama.md

Lines changed: 159 additions & 123 deletions
Large diffs are not rendered by default.
-231 KB
Loading
-68.9 KB
Loading
70.3 KB
Loading
-55.6 KB
Loading

articles/machine-learning/how-to-deploy-models-llama.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ If you need to deploy a different model, [deploy it to real-time endpoints](#dep
4646

4747
### Prerequisites
4848

49-
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure Machine Learning account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
49+
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
5050
- An Azure Machine Learning workspace and a compute instance. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them.
5151

5252
> [!IMPORTANT]
@@ -89,7 +89,7 @@ To create a deployment:
8989
1. If this is your first time deploying the model in the workspace, you have to subscribe your workspace for the particular offering (for example, Llama-2-70b) from Azure Marketplace. This step requires that your account has the Azure subscription permissions and resource group permissions listed in the prerequisites. Each workspace has its own subscription to the particular Azure Marketplace offering, which allows you to control and monitor spending. Select **Subscribe and Deploy**.
9090

9191
> [!NOTE]
92-
> Subscribing a project to a particular Azure Marketplace offering (in this case, Llama-2-70b) requires that your account has **Contributor** or **Owner** access at the subscription level where the project is created. Alternatively, your user account can be assigned a custom role that has the Azure subscription permissions and resource group permissions listed in the [prerequisites](#prerequisites).
92+
> Subscribing a workspace to a particular Azure Marketplace offering (in this case, Llama-2-70b) requires that your account has **Contributor** or **Owner** access at the subscription level where the project is created. Alternatively, your user account can be assigned a custom role that has the Azure subscription permissions and resource group permissions listed in the [prerequisites](#prerequisites).
9393
9494
:::image type="content" source="media/how-to-deploy-models-llama/deploy-marketplace-terms.png" alt-text="A screenshot showing the terms and conditions of a given model." lightbox="media/how-to-deploy-models-llama/deploy-marketplace-terms.png":::
9595

@@ -400,7 +400,7 @@ Follow these steps to deploy a model such as `Llama-2-7b-chat` to a real-time en
400400
> If you don't have enough quota available in the selected project, you can use the option **I want to use shared quota and I acknowledge that this endpoint will be deleted in 168 hours**.
401401
402402
1. Select the **Virtual machine** and the **Instance count** that you want to assign to the deployment.
403-
1. Select if you want to create this deployment as part of a new endpoint or an existing one. Endpoints can host multiple deployments while keeping resources configuration exclusive for each of them. Deployments under the same endpoint share the endpoint URI and its access keys.
403+
1. Select if you want to create this deployment as part of a new endpoint or an existing one. Endpoints can host multiple deployments while keeping resource configuration exclusive for each of them. Deployments under the same endpoint share the endpoint URI and its access keys.
404404
1. Indicate if you want to enable **Inferencing data collection (preview)**.
405405
1. Indicate if you want to enable **Package Model (preview)**.
406406
1. Select **Deploy**. After a few moments, the endpoint's **Details** page opens up.

0 commit comments

Comments
 (0)