Skip to content

Commit 9d11626

Browse files
Merge pull request #2016 from msakande/freshness-timegen1-ai-foundry
TimeGEN-1: update procedural steps to align with new UX
2 parents 756a030 + c66d44a commit 9d11626

File tree

2 files changed

+26
-28
lines changed

2 files changed

+26
-28
lines changed

articles/ai-studio/how-to/deploy-models-cohere-rerank.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ Rerank multilingual performs well on multilingual benchmarks such as Miracl.
5959

6060
Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
6161

62-
You can deploy the previously mentioned Cohere models as a service with pay-as-you-go billing. Cohere offers these models through the Microsoft Azure Marketplace and can change or update the terms of use and pricing of these models.
62+
You can deploy the previously mentioned Cohere models as a service with pay-as-you-go billing. Cohere offers these models through Microsoft Azure Marketplace and can change or update the terms of use and pricing of these models.
6363

6464
### Prerequisites
6565

@@ -95,7 +95,7 @@ To create a deployment:
9595
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering.
9696

9797
> [!NOTE]
98-
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Cohere models) are billed through the Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
98+
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Cohere models) are billed through Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
9999
100100
1. Once you subscribe the project for the particular Azure Marketplace offering, subsequent deployments of the _same_ offering in the _same_ project don't require subscribing again. If this scenario applies to you, there's a **Continue to deploy** option to select.
101101

@@ -237,11 +237,11 @@ The `results` object is a dictionary with the following fields:
237237

238238
Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits aren't sufficient for your scenarios.
239239

240-
Cohere models deployed as serverless APIs with pay-as-you-go billing are offered by Cohere through the Azure Marketplace and integrated with Azure AI Foundry for use. You can find the Azure Marketplace pricing when deploying the model.
240+
Cohere models deployed as serverless APIs with pay-as-you-go billing are offered by Cohere through Azure Marketplace and integrated with Azure AI Foundry for use. You can find Azure Marketplace pricing when deploying the model.
241241

242-
Each time a project subscribes to a given offer from the Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
242+
Each time a project subscribes to a given offer from Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
243243

244-
For more information on how to track costs, see [monitor costs for models offered throughout the Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
244+
For more information on how to track costs, see [monitor costs for models offered throughout Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
245245

246246

247247

articles/ai-studio/how-to/deploy-models-timegen-1.md

Lines changed: 21 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to deploy TimeGEN-1 with Azure AI Foundry.
55
manager: scottpolly
66
ms.service: azure-ai-studio
77
ms.topic: how-to
8-
ms.date: 5/21/2024
8+
ms.date: 12/16/2024
99
ms.reviewer: kritifaujdar
1010
reviewer: fkriti
1111
ms.author: mopeakande
@@ -28,23 +28,15 @@ The Nixtla TimeGEN-1 is a generative, pretrained forecasting and anomaly detecti
2828

2929
Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
3030

31-
You can deploy TimeGEN-1 as a serverless API with pay-as-you-go billing. Nixtla offers TimeGEN-1 through the Microsoft Azure Marketplace. Nixtla can change or update the terms of use and pricing of this model.
31+
You can deploy TimeGEN-1 as a serverless API with pay-as-you-go billing. Nixtla offers TimeGEN-1 through Microsoft Azure Marketplace. Nixtla can change or update the terms of use and pricing of this model.
3232

3333
### Prerequisites
3434

3535
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions don't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
36-
- An [Azure AI Foundry project](../how-to/create-projects.md). The serverless API model deployment offering for TimeGEN-1 is only available with projects created in these regions:
3736

38-
> [!div class="checklist"]
39-
> * East US
40-
> * East US 2
41-
> * North Central US
42-
> * South Central US
43-
> * West US
44-
> * West US 3
45-
> * Sweden Central
37+
- An [Azure AI Foundry hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Nixtla's TimeGEN-1 model is only available with hubs created in specific regions. For a list of these regions, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md#nixtla-models).
4638

47-
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
39+
- An [Azure AI Foundry project](../how-to/create-projects.md).
4840

4941
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Foundry portal. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, visit [Role-based access control in Azure AI Foundry portal](../concepts/rbac-ai-studio.md).
5042

@@ -89,20 +81,26 @@ These steps demonstrate the deployment of TimeGEN-1. To create a deployment:
8981

9082
[!INCLUDE [open-catalog](../includes/open-catalog.md)]
9183

92-
4. Search for and select **TimeGEN-1** to open its Details page.
84+
4. Select the model card of the model you want to deploy. In this article, you select **TimeGEN-1** to open the Model Details page.
85+
9386
1. Select **Deploy** to open a serverless API deployment window for the model.
94-
1. Alternatively, you can initiate a deployment by starting from the **Models + endpoints** page in AI Foundry portal.
95-
1. From the left navigation pane of your project, select **My assets** > **Models + endpoints**.
87+
1. Alternatively, you can initiate a deployment from your project in the AI Foundry portal as follows:
88+
89+
1. From the left sidebar of your project, select **Models + Endpoints**.
9690
1. Select **+ Deploy model** > **Deploy base model**.
97-
1. Search for and select **TimeGEN-1**. to open the Model's Details page.
91+
1. Search for and select **TimeGEN-1** to open the Model Details page.
9892
1. Select **Confirm** to open a serverless API deployment window for the model.
99-
1. Your current project is specified for the deployment. To successfully deploy the TimeGEN-1 model, your project must be in one of the regions listed in the [Prerequisites](#prerequisites) section.
100-
1. In the deployment wizard, select the link to **Azure Marketplace Terms**, to learn more about the terms of use.
93+
94+
1. In the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use.
10195
1. Select the **Pricing and terms** tab to learn about pricing for the selected model.
102-
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering. This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
96+
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering.
97+
98+
> [!NOTE]
99+
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Nixtla TimeGEN-1) are billed through Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
100+
103101
1. Once you subscribe the project for the particular Azure Marketplace offering, subsequent deployments of the _same_ offering in the _same_ project don't require subscribing again. If this scenario applies to you, there's a **Continue to deploy** option to select.
104102
1. Give the deployment a name. This name becomes part of the deployment API URL. This URL must be unique in each Azure region.
105-
1. Select **Deploy**. Wait until the deployment is ready and you're redirected to the Deployments page.
103+
1. Select **Deploy**. Wait until the deployment is ready and you're redirected to the **Model deployments** page.
106104
1. Return to the Deployments page, select the deployment, and note the endpoint's **Target** URI and the Secret **Key**. For more information on using the APIs, see the [reference](#reference-for-timegen-1-deployed-as-a-serverless-api) section.
107105
1. [!INCLUDE [Find your deployment details](../includes/find-deployments.md)]
108106

@@ -273,11 +271,11 @@ This JSON sample is an example response:
273271

274272
### Cost and quota considerations for TimeGEN-1 deployed as a serverless API
275273

276-
Nixtla offers TimeGEN-1 deployed as a serverless API through the Azure Marketplace. TimeGEN-1 is integrated with Azure AI Foundry for use. You can find more information about Azure Marketplace pricing when you deploy the model.
274+
Nixtla offers TimeGEN-1 deployed as a serverless API through Azure Marketplace. TimeGEN-1 is integrated with Azure AI Foundry for use. You can find more information about Azure Marketplace pricing when you deploy the model.
277275

278-
Each time a project subscribes to a given offer from the Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
276+
Each time a project subscribes to a given offer from Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
279277

280-
For more information about how to track costs, visit [monitor costs for models offered throughout the Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
278+
For more information about how to track costs, visit [monitor costs for models offered throughout Azure Marketplace](./costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
281279

282280
Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits are insufficient for your scenarios.
283281

0 commit comments

Comments
 (0)