Skip to content

Commit 0cb7335

Browse files
committed
cost
1 parent 45f054d commit 0cb7335

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

articles/ai-foundry/model-inference/how-to/manage-costs.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: Plan to manage costs for Azure AI Foundry Models
3-
description: Learn how to plan for and manage costs for Azure AI Foundry Models in Azure AI Foundry by using cost analysis in the Azure portal.
3+
description: Learn how to plan for and manage costs for Azure AI Foundry Models by using cost analysis in the Azure portal.
44
author: santiagxf
55
ms.author: fasantia
66
ms.custom: subject-cost-optimization
@@ -26,15 +26,15 @@ Although this article is about planning for and managing costs for Azure AI Foun
2626

2727
Language models understand and process inputs by breaking them down into tokens. For reference, each token is roughly four characters for typical English text. Models that can process images or audio break down them into tokens too for billing purposes. The number of tokens per image or audio content depends on the model and the resolution of the input.
2828

29-
Costs per token vary depending on which model series you choose but in all cases models deployed in Azure AI Services are charged per 1,000 tokens. Token costs are for both input and output. For example, suppose you have a 1,000 token JavaScript code sample that you ask a model to convert to Python. You would be charged approximately 1,000 tokens for the initial input request sent, and 1,000 more tokens for the output that is received in response for a total of 2,000 tokens.
29+
Costs per token vary depending on which model series you choose but in all cases models deployed in Azure AI Foundry are charged per 1,000 tokens. Token costs are for both input and output. For example, suppose you have a 1,000 token JavaScript code sample that you ask a model to convert to Python. You would be charged approximately 1,000 tokens for the initial input request sent, and 1,000 more tokens for the output that is received in response for a total of 2,000 tokens.
3030

3131
### Cost breakdown
3232

3333
To understand the breakdown of what makes up the cost, it can be helpful to use **Cost Analysis** tool in Azure portal. Follow these steps to understand the cost of inference:
3434

3535
1. Go to [Azure AI Foundry Portal](https://ai.azure.com).
3636

37-
2. In the upper right corner of the screen, select on the name of your Azure AI Services resource, or if you're working on an AI project, on the name of the project.
37+
2. In the upper right corner of the screen, select on the name of your Azure AI Foundry resource (formerly known as Azure AI Services), or if you're working on an AI project, on the name of the project.
3838

3939
3. Select the name of the project. Azure portal opens in a new window.
4040

@@ -45,32 +45,32 @@ To understand the breakdown of what makes up the cost, it can be helpful to use
4545
5. By default, cost analysis is scoped to the selected resource group.
4646

4747
> [!IMPORTANT]
48-
> It's important to scope *Cost Analysis* to the resource group where the Azure AI Services resource is deployed. Cost meters associated with some provider model providers, like Mistral AI or Cohere, are displayed under the resource group instead of the Azure AI Services resource.
48+
> It's important to scope *Cost Analysis* to the resource group where the Azure AI Foundry resource is deployed. Cost meters associated with [Azure Ecosystem Models](#azure-ecosystem-models) are displayed under the resource group instead of the Azure AI Foundry resource.
4949
5050
6. Modify **Group by** to **Meter**. You can now see that for this particular resource group, the source of the costs comes from different models series.
5151

5252
:::image type="content" source="../media/manage-cost/cost-by-meter.png" alt-text="Screenshot of how to see the cost by each meter in the resource group." lightbox="../media/manage-cost/cost-by-meter.png":::
5353

5454
The following sections explain the entries in details.
5555

56-
### Azure OpenAI and Microsoft models
56+
### Azure Direct Models
5757

58-
Azure OpenAI models and models offered as first-party consumption services from Microsoft (including DeepSeek family and Phi family of models) are charged directly and they show up as billing meters under each Azure AI services resource. This billing happens directly through Microsoft. When you inspect your bill, you notice billing meters accounting for inputs and outputs for each consumed model.
58+
[Azure Direct Models](../concepts/models.md#azure-direct-models) (including Azure OpenAI) are charged directly and they show up as billing meters under each Azure AI Foundry resource (formerly known Azure AI Services). This billing happens directly through Microsoft. When you inspect your bill, you notice billing meters accounting for inputs and outputs for each consumed model.
5959

60-
:::image type="content" source="../media/manage-cost/cost-by-meter-1p.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Services resource is deployed, highlighting the meters for Azure OpenAI and Microsoft's models. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-1p.png":::
60+
:::image type="content" source="../media/manage-cost/cost-by-meter-1p.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Foundry resource is deployed, highlighting the meters for Azure OpenAI and Phi models. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-1p.png":::
6161

62-
### Provider models
62+
### Azure Ecosystem models
6363

64-
Models provided by another provider, like Mistral AI, Cohere, Meta AI, or AI21 Labs, are billed using Azure Marketplace. As opposite to Microsoft billing meters, those entries are associated with the resource group where your Azure AI services is deployed instead of to the Azure AI Services resource itself. Given model providers charge you directly, you see entries under the category **Marketplace** and **Service Name** *SaaS* accounting for inputs and outputs for each consumed model.
64+
Models provided by third-party providers, like Cohere, are billed using Azure Marketplace. As opposite to Microsoft billing meters, those entries are associated with the resource group where your Azure AI Foundry (formerly known as Azure AI Services) is deployed instead of to the Azure AI Foundry resource itself. Given model providers charge you directly, you see entries under the category **Marketplace** and **Service Name** *SaaS* accounting for inputs and outputs for each consumed model.
6565

66-
:::image type="content" source="../media/manage-cost/cost-by-meter-saas.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Services resource is deployed, highlighting the meters for models billed throughout Azure Marketplace. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-saas.png":::
66+
:::image type="content" source="../media/manage-cost/cost-by-meter-saas.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Foundry resource is deployed, highlighting the meters for models billed throughout Azure Marketplace. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-saas.png":::
6767

6868
> [!IMPORTANT]
69-
> This distinction between Azure OpenAI, Microsoft-offered models, and provider models only affects how the model is made available to you and how you are charged. In all cases, models are hosted within Azure cloud and there is no interaction with external services or providers.
69+
> This distinction between [Azure Direct Models](../concepts/models.md#azure-direct-models) (including Azure OpenAI) and [Azure Ecosystem Models](../concepts/models.md#azure-ecosystem-models) only affects how the model is made available to you and how you are charged. In all cases, models are hosted within Azure cloud and there is no interaction with external services or providers.
7070
7171
### Using Azure Prepayment
7272

73-
You can pay for Azure OpenAI and Microsoft's models charges with your Azure Prepayment credit. However, you can't use Azure Prepayment credit to pay for charges for other provider models given they're billed through Azure Marketplace.
73+
You can pay for Azure Direct Models' charges with your Azure Prepayment credit. However, you can't use Azure Prepayment credit to pay for charges for other provider models given they're billed through Azure Marketplace.
7474

7575
### HTTP Error response code and billing status
7676

0 commit comments

Comments
 (0)