You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/how-to/deploy-models-cohere-rerank.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -59,7 +59,7 @@ Rerank multilingual performs well on multilingual benchmarks such as Miracl.
59
59
60
60
Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
61
61
62
-
You can deploy the previously mentioned Cohere models as a service with pay-as-you-go billing. Cohere offers these models through the Microsoft Azure Marketplace and can change or update the terms of use and pricing of these models.
62
+
You can deploy the previously mentioned Cohere models as a service with pay-as-you-go billing. Cohere offers these models through Microsoft Azure Marketplace and can change or update the terms of use and pricing of these models.
63
63
64
64
### Prerequisites
65
65
@@ -95,7 +95,7 @@ To create a deployment:
95
95
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering.
96
96
97
97
> [!NOTE]
98
-
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Cohere models) are billed through the Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
98
+
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Cohere models) are billed through Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
99
99
100
100
1. Once you subscribe the project for the particular Azure Marketplace offering, subsequent deployments of the _same_ offering in the _same_ project don't require subscribing again. If this scenario applies to you, there's a **Continue to deploy** option to select.
101
101
@@ -237,11 +237,11 @@ The `results` object is a dictionary with the following fields:
237
237
238
238
Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits aren't sufficient for your scenarios.
239
239
240
-
Cohere models deployed as serverless APIs with pay-as-you-go billing are offered by Cohere through the Azure Marketplace and integrated with Azure AI Foundry for use. You can find the Azure Marketplace pricing when deploying the model.
240
+
Cohere models deployed as serverless APIs with pay-as-you-go billing are offered by Cohere through Azure Marketplace and integrated with Azure AI Foundry for use. You can find Azure Marketplace pricing when deploying the model.
241
241
242
-
Each time a project subscribes to a given offer from the Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
242
+
Each time a project subscribes to a given offer from Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
243
243
244
-
For more information on how to track costs, see [monitor costs for models offered throughout the Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
244
+
For more information on how to track costs, see [monitor costs for models offered throughout Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
Copy file name to clipboardExpand all lines: articles/ai-studio/how-to/deploy-models-timegen-1.md
+21-23Lines changed: 21 additions & 23 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ description: Learn how to deploy TimeGEN-1 with Azure AI Foundry.
5
5
manager: scottpolly
6
6
ms.service: azure-ai-studio
7
7
ms.topic: how-to
8
-
ms.date: 5/21/2024
8
+
ms.date: 12/16/2024
9
9
ms.reviewer: kritifaujdar
10
10
reviewer: fkriti
11
11
ms.author: mopeakande
@@ -28,23 +28,15 @@ The Nixtla TimeGEN-1 is a generative, pretrained forecasting and anomaly detecti
28
28
29
29
Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
30
30
31
-
You can deploy TimeGEN-1 as a serverless API with pay-as-you-go billing. Nixtla offers TimeGEN-1 through the Microsoft Azure Marketplace. Nixtla can change or update the terms of use and pricing of this model.
31
+
You can deploy TimeGEN-1 as a serverless API with pay-as-you-go billing. Nixtla offers TimeGEN-1 through Microsoft Azure Marketplace. Nixtla can change or update the terms of use and pricing of this model.
32
32
33
33
### Prerequisites
34
34
35
35
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions don't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
36
-
- An [Azure AI Foundry project](../how-to/create-projects.md). The serverless API model deployment offering for TimeGEN-1 is only available with projects created in these regions:
37
36
38
-
> [!div class="checklist"]
39
-
> * East US
40
-
> * East US 2
41
-
> * North Central US
42
-
> * South Central US
43
-
> * West US
44
-
> * West US 3
45
-
> * Sweden Central
37
+
- An [Azure AI Foundry hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Nixtla's TimeGEN-1 model is only available with hubs created in specific regions. For a list of these regions, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md#nixtla-models).
46
38
47
-
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
39
+
- An [Azure AI Foundry project](../how-to/create-projects.md).
48
40
49
41
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Foundry portal. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, visit [Role-based access control in Azure AI Foundry portal](../concepts/rbac-ai-studio.md).
50
42
@@ -89,20 +81,26 @@ These steps demonstrate the deployment of TimeGEN-1. To create a deployment:
4. Search for and select **TimeGEN-1** to open its Details page.
84
+
4. Select the model card of the model you want to deploy. In this article, you select **TimeGEN-1** to open the Model Details page.
85
+
93
86
1. Select **Deploy** to open a serverless API deployment window for the model.
94
-
1. Alternatively, you can initiate a deployment by starting from the **Models + endpoints** page in AI Foundry portal.
95
-
1. From the left navigation pane of your project, select **My assets** > **Models + endpoints**.
87
+
1. Alternatively, you can initiate a deployment from your project in the AI Foundry portal as follows:
88
+
89
+
1. From the left sidebar of your project, select **Models + Endpoints**.
96
90
1. Select **+ Deploy model** > **Deploy base model**.
97
-
1. Search for and select **TimeGEN-1**. to open the Model's Details page.
91
+
1. Search for and select **TimeGEN-1** to open the Model Details page.
98
92
1. Select **Confirm** to open a serverless API deployment window for the model.
99
-
1. Your current project is specified for the deployment. To successfully deploy the TimeGEN-1 model, your project must be in one of the regions listed in the [Prerequisites](#prerequisites) section.
100
-
1. In the deployment wizard, select the link to **Azure Marketplace Terms**, to learn more about the terms of use.
93
+
94
+
1. In the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use.
101
95
1. Select the **Pricing and terms** tab to learn about pricing for the selected model.
102
-
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering. This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
96
+
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering.
97
+
98
+
> [!NOTE]
99
+
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Nixtla TimeGEN-1) are billed through Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
100
+
103
101
1. Once you subscribe the project for the particular Azure Marketplace offering, subsequent deployments of the _same_ offering in the _same_ project don't require subscribing again. If this scenario applies to you, there's a **Continue to deploy** option to select.
104
102
1. Give the deployment a name. This name becomes part of the deployment API URL. This URL must be unique in each Azure region.
105
-
1. Select **Deploy**. Wait until the deployment is ready and you're redirected to the Deployments page.
103
+
1. Select **Deploy**. Wait until the deployment is ready and you're redirected to the **Model deployments** page.
106
104
1. Return to the Deployments page, select the deployment, and note the endpoint's **Target** URI and the Secret **Key**. For more information on using the APIs, see the [reference](#reference-for-timegen-1-deployed-as-a-serverless-api) section.
107
105
1.[!INCLUDE [Find your deployment details](../includes/find-deployments.md)]
108
106
@@ -273,11 +271,11 @@ This JSON sample is an example response:
273
271
274
272
### Cost and quota considerations for TimeGEN-1 deployed as a serverless API
275
273
276
-
Nixtla offers TimeGEN-1 deployed as a serverless API through the Azure Marketplace. TimeGEN-1 is integrated with Azure AI Foundry for use. You can find more information about Azure Marketplace pricing when you deploy the model.
274
+
Nixtla offers TimeGEN-1 deployed as a serverless API through Azure Marketplace. TimeGEN-1 is integrated with Azure AI Foundry for use. You can find more information about Azure Marketplace pricing when you deploy the model.
277
275
278
-
Each time a project subscribes to a given offer from the Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
276
+
Each time a project subscribes to a given offer from Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
279
277
280
-
For more information about how to track costs, visit [monitor costs for models offered throughout the Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
278
+
For more information about how to track costs, visit [monitor costs for models offered throughout Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
281
279
282
280
Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits are insufficient for your scenarios.
0 commit comments