You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/how-to/deploy-models-managed-pay-go.md
+20-11Lines changed: 20 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -32,47 +32,57 @@ In this article, you learn how to use protected models from partners and communi
32
32
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Foundry portal. To perform the steps in this article, your user account must be assigned a *custom role* with the following permissions. User accounts assigned the *Owner* or *Contributor* role for the Azure subscription can also create deployments. For more information on permissions, see [Role-based access control in Azure AI Foundry portal](/azure/ai-foundry/concepts/rbac-azure-ai-foundry).
33
33
34
34
35
-
- On the Azure subscription—**to subscribe the workspace/project to the Azure Marketplace offering**:
35
+
- On the Azure subscription—**to subscribe the workspace/project to the Azure Marketplace offering**:
## Marketplace offer unit of measure and subscription scope
54
+
55
+
The surcharge for the models is a per GPU hour price set by the partner / publisher on Azure marketplace, for all the supported GPUs for the model to be deployed on Foundry managed compute.
56
+
57
+
User's subscriptions to the azure marketplace offers are scoped to a project resource within Azure AI Foundry. If a subscription to the marketplace offer already exists within a project, users will be notified of the same in the Deploy Wizard (reference snapshot below).
58
+
59
+
<insertimagefromdoc>
60
+
61
+
All SaaS subscriptions created in an Azure subscription are listed under 'Resources' of the Settings blade of the Azure subscription and can be filtered using Resource Type equals SaaS. The consumption-based surcharge is accrued to the associated SaaS subscription and billed to the user via Azure Marketplace. The user can view his invoice by clicking on 'View Billing' in the Overview tab of the respective SaaS subscription.
1. Select the **Deployment options** filter in the model catalog and choose **Managed compute**.
58
68
59
-
1. Filter the list further by selecting the **Collection** and model of your choice. In this article, we use **Cohere Command A** for illustration.
69
+
2. Filter the list further by selecting the **Collection** and model of your choice. In this article, we use **Cohere Command A** for illustration.
60
70
61
-
1. From the model's page, select **Use this model** to open the deployment wizard.
71
+
3. From the model's page, select **Use this model** to open the deployment wizard.
62
72
63
-
1. Choose from one of the supported VM SKUs for the model. You need to have Azure Machine Learning Compute quota for that SKU in your Azure subscription.
73
+
4. Choose from one of the supported VM SKUs for the model. You need to have Azure Machine Learning Compute quota for that SKU in your Azure subscription.
64
74
65
-
1. Select **Customize** to specify your deployment configuration for parameters such as the instance count. You can also select an existing endpoint for the deployment or create a new one. For this example, we specify an instance count of **1** and create a new endpoint for the deployment.
75
+
5. Select **Customize** to specify your deployment configuration for parameters such as the instance count. You can also select an existing endpoint for the deployment or create a new one. For this example, we specify an instance count of **1** and create a new endpoint for the deployment.
66
76
67
77
:::image type="content" source="../media/deploy-models-managed-pay-go/deployment-configuration.png" alt-text="Screenshot of the deployment configuration screen for a protected model in Azure AI Foundry." lightbox="../media/deploy-models-managed-pay-go/deployment-configuration.png":::
68
78
69
-
1. Select **Next** to proceed to the *pricing breakdown* page.
79
+
6. Select **Next** to proceed to the *pricing breakdown* page.
70
80
71
-
1. Review the pricing breakdown for the deployment, terms of use, and license agreement associated with the model's offer on Azure Marketplace. The pricing breakdown tells you what the aggregated pricing for the deployed model would be, where the surcharge for the model is a function of the number of GPUs in the VM instance that is selected in the previous steps. In addition to the applicable surcharge for the model, Azure compute charges also apply, based on your deployment configuration. If you have existing reservations or Azure savings plan, the invoice for the compute charges honors and reflects the discounted VM pricing.
81
+
7. Review the pricing breakdown for the deployment, terms of use, and license agreement associated with the model's offer on Azure Marketplace. The pricing breakdown tells you what the aggregated pricing for the deployed model would be, where the surcharge for the model is a function of the number of GPUs in the VM instance that is selected in the previous steps. In addition to the applicable surcharge for the model, Azure compute charges also apply, based on your deployment configuration. If you have existing reservations or Azure savings plan, the invoice for the compute charges honors and reflects the discounted VM pricing.
72
82
73
83
:::image type="content" source="../media/deploy-models-managed-pay-go/pricing-breakdown.png" alt-text="Screenshot of the pricing breakdown page for a protected model deployment in Azure AI Foundry." lightbox="../media/deploy-models-managed-pay-go/pricing-breakdown.png":::
74
84
75
-
1. Select the checkbox to acknowledge that you understand and agree to the terms of use. Then, select **Deploy**. It takes about 15-20 minutes for the deployment to complete.
85
+
8. Select the checkbox to acknowledge that you understand and agree to the terms of use. Then, select **Deploy**. Foundry creates the user's subscription to the marketplace offer and further on, the deployment of the model on managed compute. It takes about 15-20 minutes for the deployment to complete.
76
86
77
87
## Network Isolation of deployments
78
88
@@ -82,7 +92,6 @@ Collections in the model catalog can be deployed within your isolated networks u
82
92
83
93
An Azure AI Foundry project with ingress Public Network Access disabled can only support a single active deployment of one of the protected models from the catalog. Attempts to create more active deployments result in deployment creation failures.
84
94
85
-
86
95
## Related content
87
96
88
97
*[How to deploy and inference a managed compute deployment](deploy-models-managed.md)
0 commit comments