Skip to content

Commit 94dd735

Browse files
committed
managed pay-go deployment
1 parent 0dfcd77 commit 94dd735

File tree

4 files changed

+89
-0
lines changed

4 files changed

+89
-0
lines changed
Lines changed: 87 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,87 @@
1+
---
2+
title: PayGo Surcharge for protected models in Azure AI Foundry
3+
titleSuffix: Azure AI Foundry
4+
description: Learn how to deploy third-party protected models on Azure AI Foundry managed compute and understand how pay-as-you-go surcharge billing works.
5+
manager: scottpolly
6+
ms.service: azure-ai-foundry
7+
ms.custom:
8+
ms.topic: how-to
9+
ms.date: 06/20/2025
10+
ms.reviewer: tinaem
11+
reviewer: tinaem
12+
ms.author: mopeakande
13+
author: msakande
14+
---
15+
16+
**PayGo Surcharge for Models on Managed Compute**
17+
18+
Azure AI Foundry offers a comprehensive catalog of models, including models from third-party partner and the community. These third-party partner and community models offered for deployment on Managed Compute are either open or protected models. The deployment of protected models on Managed Compute involves pay as you go billing for the customer in two dimensions: per hour Azure Machine Learning Compute billing for the virtual machines employed in the deployment of the protected model and surcharge billing for the model as set by the model publisher on the Azure marketplace offer. This pay as you go billing of model surcharge and azure compute is pro-rated per minute based on the uptime of these managed online deployments.
19+
20+
In this article, you will learn how to use third party protected models offered via Azure Marketplace for deployment on Managed Compute within Azure AI Foundry Catalog. Azure AI Foundry enables a seamless subscription and transactability experience for these protected models as you create and consume your dedicated model deployments at scale.
21+
22+
**Prerequisites**
23+
24+
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
25+
26+
- If you don't have one, [create a hub based project](/azure/ai-foundry/how-to/create-projects?pivots=hub-project).
27+
28+
- Marketplace purchases enabled for your Azure subscription. Learn more [here](/azure/cost-management-billing/manage/enable-marketplace-purchases).
29+
30+
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Foundry portal. To perform the steps in this article, your user account must be assigned a *custom role* with the following permissions. User accounts assigned the *Owner* or *Contributor* role for the Azure subscription can also create deployments. For more information on permissions, see [Role-based access control in Azure AI Foundry portal](/azure/ai-foundry/concepts/rbac-azure-ai-foundry).
31+
32+
- On the Azure subscription—**to subscribe the workspace/project to the Azure Marketplace offering**:
33+
34+
- Microsoft.MarketplaceOrdering/agreements/offers/plans/read
35+
- Microsoft.MarketplaceOrdering/agreements/offers/plans/sign/action
36+
- Microsoft.MarketplaceOrdering/offerTypes/publishers/offers/plans/agreements/read
37+
- Microsoft.Marketplace/offerTypes/publishers/offers/plans/agreements/read
38+
- Microsoft.SaaS/register/action
39+
40+
- On the resource group—**to create and use the SaaS resource**:
41+
42+
- Microsoft.SaaS/resources/read
43+
- Microsoft.SaaS/resources/write
44+
45+
- On the workspace—**to deploy endpoints**:
46+
47+
- Microsoft.MachineLearningServices/workspaces/marketplaceModelSubscriptions/*
48+
- Microsoft.MachineLearningServices/workspaces/onlineEndpoints/*
49+
50+
**Subscribe and Deploy on Managed Compute**
51+
52+
1. Sign in to [Azure AI Foundry](https://ai.azure.com/?cid=learnDocs) and go to the Home page.
53+
54+
1. Select Model catalog from the left sidebar.
55+
56+
1. In the filters section, select the Deployment Option as Managed Compute.
57+
58+
1. Select the Collection and model of your choice. In this article, we are using **Cohere Command A** as an example.
59+
60+
1. Click on **Use this model** and pick the Managed Compute deployment option.
61+
62+
1. The Deploy wizard lets you choose from one of the supported VM SKUs for the model. You need to have Azure Machine Learning Compute quota for that SKU in your Azure subscription.
63+
64+
1. You can then customize your deployment configuration for parameters such as the instance count and select an existing endpoint for the deployment or create a new one. For this example, we consider an instance count of **1** and create a new endpoint for the deployment.
65+
66+
:::image type="content" source="media/deploy-models-managed-pay-go/image1.png" alt-text="Screenshot of the deployment configuration screen for a protected model in Azure AI Foundry.":::
67+
68+
1. Click **Next** to proceed to the pricing breakdown page.
69+
70+
1. Review the pricing breakdown for the deployment, terms of use and license agreement associated with the model's offer on Marketplace. The pricing breakdown helps inform what the aggregated pricing for the model deployed would be, where the surcharge for the model is a function of the number of GPUs in the VM instance that is selected in the previous steps. In addition to the applicable surcharge for the model, Azure Compute charges also apply based on your deployment configuration. If you have existing reservations or azure savings plan, the invoice for the compute charges will honor and reflect the discounted VM pricing.
71+
72+
:::image type="content" source="media/deploy-models-managed-pay-go/image2.png" alt-text="Screenshot of the pricing breakdown page for a protected model deployment in Azure AI Foundry.":::
73+
74+
1. Select the checkbox to acknowledge understanding of pricing and terms of use, and then, click **Deploy**. It takes about 15-20 mins for the deployment to complete.
75+
76+
**Network Isolation of deployments**
77+
78+
Collections in the model catalog can be deployed within your isolated networks using workspace managed virtual network. For more information on how to configure your workspace managed networks, see [here.](/azure/machine-learning/how-to-managed-network#configure-a-managed-virtual-network-to-allow-internet-outbound)
79+
80+
Limitation
81+
82+
An Azure AI Foundry project with ingress Public Network Access disabled can only support a single active deployment of one of the protected models from the catalog. Attempts to create more active deployments result in deployment creation failures.
83+
84+
85+
## Related content
86+
87+
* [How to deploy and inference a managed compute deployment](deploy-models-managed.md)
248 KB
Loading
161 KB
Loading

articles/ai-foundry/toc.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -167,6 +167,8 @@ items:
167167
items:
168168
- name: Deploy models via managed compute
169169
href: how-to/deploy-models-managed.md
170+
- name: Deploy models to managed compute pay go
171+
href: how-to/deploy-models-managed-pay-go.md
170172
- name: Healthcare AI models
171173
items:
172174
- name: Foundational AI models for healthcare

0 commit comments

Comments
 (0)