Skip to content

Commit 098bb65

Browse files
Merge pull request #4749 from santiagxf/santiagxf/foundry-models
Foundry Models T2
2 parents 83467b0 + c4397bd commit 098bb65

File tree

41 files changed

+227
-122
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

41 files changed

+227
-122
lines changed

articles/ai-foundry/model-inference/breadcrumb/toc.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,6 @@
66
tocHref: /azure/ai-foundry/
77
topicHref: /azure/ai-studio/index
88
items:
9-
- name: Model Inference
9+
- name: Foundry Models
1010
tocHref: /azure/ai-foundry/model-inference/
1111
topicHref: /azure/ai-foundry/model-inference/index

articles/ai-foundry/model-inference/concepts/content-filter.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,9 @@ ms.custom: ignite-2024, github-universe-2024
1111
manager: nitinme
1212
---
1313

14-
# Content filtering for Azure AI Foundry Models in Azure AI Foundry Service
14+
15+
# Content filtering for Azure AI Foundry Models
16+
1517

1618
> [!IMPORTANT]
1719
> The content filtering system isn't applied to prompts and completions processed by audio models such as Whisper in Azure OpenAI in Azure AI Foundry Models. Learn more about the [Audio models in Azure OpenAI](../../../ai-services/openai/concepts/models.md?tabs=standard-audio#standard-deployment-regional-models-by-endpoint).

articles/ai-foundry/model-inference/concepts/models.md

Lines changed: 72 additions & 49 deletions
Large diffs are not rendered by default.

articles/ai-foundry/model-inference/faq.yml

Lines changed: 6 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -12,18 +12,16 @@ metadata:
1212
author: santiagxf
1313
title: Foundry Models frequently asked questions
1414
summary: |
15-
If you can't find answers to your questions in this document, and still need help check the [Azure AI Foundry services (formerly known Azure AI Services)support options guide](../../ai-services/cognitive-services-support-options.md?context=/azure/ai-services/openai/context/context).
15+
If you can't find answers to your questions in this document, and still need help check the [Azure AI Foundry services (formerly known Azure AI Services) support options guide](../../ai-services/cognitive-services-support-options.md?context=/azure/ai-services/openai/context/context).
1616
sections:
1717
- name: General
1818
questions:
1919
- question: |
20-
What's the difference between Azure OpenAI in Foundry Models and Foundry Models?
20+
What's the difference between Azure OpenAI and Foundry Models?
2121
answer: |
22-
Azure OpenAI gives customers access to advanced language models from OpenAI. Foundry Models extends such capability giving customers access to all the flagship models in Azure AI under the same service, endpoint, and credentials. It includes Azure OpenAI, Cohere, Mistral AI, Meta Llama, AI21 labs, etc. Customers can seamlessly switch between models without changing their code.
22+
Azure OpenAI gives customers access to advanced language models from OpenAI. Foundry Models extends such capability giving customers access to all the flagship models in Azure AI Foundry under the same service, endpoint, and credentials. It includes Azure OpenAI, Cohere, Mistral AI, Meta Llama, AI21 labs, etc. Customers can seamlessly switch between models without changing their code.
2323
24-
Both Azure OpenAI and Foundry Models are part of the Azure AI Foundry services (formerly known Azure AI Services) family and build on top of the same security and enterprise promise of Azure.
25-
26-
While Foundry Models focus on inference, Azure OpenAI can be used with more advanced APIs like batch, fine-tuning, assistants, and files.
24+
Azure OpenAI is an Azure Direct model family in Foundry Models.
2725
- question: |
2826
What's the difference between Azure AI services and Azure AI Foundry?
2927
answer: |
@@ -90,13 +88,13 @@ sections:
9088
- question: |
9189
Where can I see the bill details?
9290
answer: |
93-
Billing and costs are displayed in [Azure Cost Management + Billing](/azure/cost-management-billing/understand/download-azure-daily-usage). You can see the usage details in the [Azure portal](https://portal.azure.com).
91+
Billing and costs are displayed in [Microsoft Cost Management + Billing](/azure/cost-management-billing/understand/download-azure-daily-usage). You can see the usage details in the [Azure portal](https://portal.azure.com).
9492
9593
Billing isn't shown in Azure AI Foundry portal.
9694
- question: |
9795
How can I place a spending limit to my bill?
9896
answer: |
99-
You can set up a spending limit in the [Azure portal](https://portal.azure.com) under **Azure Cost Management + Billing**. This limit prevents you from spending more than the limit you set. Once spending limit is reached, the subscription will be disabled and you won't be able to use the endpoint until the next billing cycle.
97+
You can set up a spending limit in the [Azure portal](https://portal.azure.com) under **Microsoft Cost Management + Billing**. This limit prevents you from spending more than the limit you set. Once spending limit is reached, the subscription will be disabled and you won't be able to use the endpoint until the next billing cycle.
10098
- name: Data and Privacy
10199
questions:
102100
- question: |
Lines changed: 35 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,35 @@
1+
---
2+
title: Configure access to Azure Ecosystem Models
3+
description: Learn how to configure access to Azure Ecosystem Models.
4+
author: santiagxf
5+
ms.author: fasantia
6+
ms.service: azure-ai-model-inference
7+
ms.topic: how-to
8+
ms.date: 5/11/2025
9+
---
10+
11+
# Configure access to Azure Ecosystem Models
12+
13+
Certain models in AI Foundry Models are offered directly by the model provider through the Azure Marketplace. This article explains the requirements to use Azure Marketplace if you plan to use such models in your workloads. Azure Direct Models, like DeepSeek or Phi, or Azure OpenAI Service models, like GPTs, don't have this requirement.
14+
15+
> [!TIP]
16+
> All models offered in AI Foundry Models are hosted in Microsoft's Azure environment and the Service does NOT interact with any external services or model provider.
17+
18+
:::image type="content" source="../media/configure-marketplace/azure-marketplace-3p.png" alt-text="A diagram with the overall architecture of Azure Marketplace integration with AI Foundry Models." lightbox="../media/configure-marketplace/azure-marketplace-3p.png":::
19+
20+
[!INCLUDE [marketplace-rbac](../includes/configure-marketplace/rbac.md)]
21+
22+
## Country availability
23+
24+
Azure Ecosystem Models with Pay-as-you-go billing is available only to users whose Azure subscription belongs to a billing account in a country/region where the model offer is available. Availability varies per model provider and model SKU. Read [Region availability for models](../../how-to/deploy-models-serverless-availability.md).
25+
26+
## Troubleshooting
27+
28+
Use the following troubleshooting guide to find and solve errors when deploying third-party models in AI Foundry Models:
29+
30+
| Error | Description |
31+
|-------|-------------|
32+
| This offer is not made available by the provider in the country where your account and Azure Subscription are registered. | The model provider didn't make the specific model SKU available in the country where the subscription is registered. Each model provider may decide to make the offer available in specific countries and such may vary by model SKU. You need to deploy the model to a subscription having billing on a supported country. See the list of countries at [Region availability for models](../../how-to/deploy-models-serverless-availability.md). |
33+
| Marketplace Subscription purchase eligibility check failed. | The model provider didn't make the specific model SKU available in the country where the subscription is registered or it isn't available in the region where you deployed the Azure AI Services resource. See [Region availability for models](../../how-to/deploy-models-serverless-availability.md). |
34+
| Unable to create a model deployment for model "model-name". If the error persists, please contact HIT (Human Intelligence Team) via this link: https://go.microsoft.com/fwlink/?linkid=2101400&clcid=0x409 and request to allowlist the Azure subscription. | Azure Marketplace rejects the request to create a model subscription. Such can be due to multiple reasons, including subscribing to the model offering too often, or from multiple subscriptions at the same time. Please contact support using the provided link indicating your subscription ID. |
35+
| This offer is not available for purchasing by subscriptions belonging to Microsoft Azure Cloud Solution Providers | Cloud Solution Provider (CSP) subscriptions do not have the ability to purchase third-party model offerings. You can consider models offered as first-party consumption service. |

articles/ai-foundry/model-inference/how-to/github/create-model-deployments.md

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -12,26 +12,28 @@ ms.author: fasantia
1212
recommendations: false
1313
---
1414

15-
# Add and configure models to Azure AI services
15+
# Add and configure models from Azure AI Foundry Models
1616

1717
You can decide and configure which models are available for inference in the Azure AI services resource model's inference endpoint. When a given model is configured, you can then generate predictions from it by indicating its model name or deployment name on your requests. No further changes are required in your code to use it.
1818

19-
In this article, you learn how to add a new model to Azure AI Foundry Models.
19+
20+
In this article, you learn how to add a new model from Azure AI Foundry Models.
2021

2122
## Prerequisites
2223

2324
To complete this article, you need:
2425

25-
* An Azure subscription. If you're using [GitHub Models](https://docs.github.com/en/github-models/), you can upgrade your experience and create an Azure subscription in the process. Read [Upgrade from GitHub Models to Foundry Models](../quickstart-github-models.md) if it's your case.
26-
* An Azure AI services resource. For more information, see [Create an Azure AI Services resource](../../../../ai-services/multi-service-resource.md?context=/azure/ai-services/model-inference/context/context).
26+
* An Azure subscription. If you're using [GitHub Models](https://docs.github.com/en/github-models/), you can upgrade your experience and create an Azure subscription in the process. Read [Upgrade from GitHub Models to Azure AI Foundry Models](../quickstart-github-models.md) if it's your case.
27+
* An Azure AI services resource. For more information, see [Create an Azure AI Foundry resource](../quickstart-create-resources.md).
28+
2729

2830
## Add a model
2931

3032
[!INCLUDE [add-model-deployments](../../includes/github/add-model-deployments.md)]
3133

3234
## Use the model
3335

34-
Deployed models in Azure AI services can be consumed using the [Azure AI model's inference endpoint](../../concepts/endpoints.md) for the resource.
36+
Deployed models in Azure AI Foundry Models can be consumed using the [Azure AI model's inference endpoint](../../concepts/endpoints.md) for the resource.
3537

3638
To use it:
3739

@@ -52,6 +54,7 @@ When creating model deployments, you can configure additional settings including
5254
> [!NOTE]
5355
> Configurations may vary depending on the model you're deploying.
5456
55-
## Next steps
57+
## Related content
58+
59+
* [Develop applications using Azure AI Foundry Models](../../supported-languages.md)
5660

57-
* [Develop applications using Foundry Models service in Azure AI services](../../supported-languages.md)

articles/ai-foundry/model-inference/how-to/manage-costs.md

Lines changed: 18 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
title: Plan to manage costs for Azure AI Foundry Models in Azure AI Foundry Service
3-
description: Learn how to plan for and manage costs for Azure AI Foundry Models in Azure AI Foundry Service by using cost analysis in the Azure portal.
2+
title: Plan to manage costs for Azure AI Foundry Models
3+
description: Learn how to plan for and manage costs for Azure AI Foundry Models by using cost analysis in the Azure portal.
44
author: santiagxf
55
ms.author: fasantia
66
ms.custom: subject-cost-optimization
@@ -9,12 +9,12 @@ ms.topic: how-to
99
ms.date: 1/21/2025
1010
---
1111

12+
# Plan to manage costs for Azure AI Foundry Models
1213

13-
# Plan to manage costs for Azure AI Foundry Models in Azure AI Foundry Service
14+
This article describes how you can view, plan for, and manage costs for Azure AI Foundry Models.
1415

15-
This article describes how you can view, plan for, and manage costs for Foundry Models in Azure AI Foundry Service.
16+
Although this article is about planning for and managing costs for Azure AI Foundry Models, you're billed for all Azure services and resources used in your Azure subscription.
1617

17-
Although this article is about planning for and managing costs for Foundry Models in Azure AI Foundry Service, you're billed for all Azure services and resources used in your Azure subscription.
1818

1919
## Prerequisites
2020

@@ -24,17 +24,18 @@ Although this article is about planning for and managing costs for Foundry Model
2424

2525
## Understand Foundry Models billing model
2626

27-
Language models understand and process inputs by breaking them down into tokens. For reference, each token is roughly four characters for typical English text. Models that can process images or audio break down them into tokens too for billing purposes. The number of tokens per image or audio content depends on the model and the resolution of the input.
2827

29-
Costs per token vary depending on which model series you choose but in all cases models deployed in Azure AI Services are charged per 1,000 tokens. Token costs are for both input and output. For example, suppose you have a 1,000 token JavaScript code sample that you ask a model to convert to Python. You would be charged approximately 1,000 tokens for the initial input request sent, and 1,000 more tokens for the output that is received in response for a total of 2,000 tokens.
28+
Language models understand and process inputs by breaking them down into tokens. For reference, each token is roughly four characters for typical English text. Models that can process images or audio break them down into tokens too for billing purposes. The number of tokens per image or audio content depends on the model and the resolution of the input.
29+
30+
Costs per token vary depending on which model series you choose but in all cases models deployed in Azure AI Foundry are charged per 1,000 tokens. Token costs are for both input and output. For example, suppose you have a 1,000 token JavaScript code sample that you ask a model to convert to Python. You would be charged approximately 1,000 tokens for the initial input request sent, and 1,000 more tokens for the output that is received in response for a total of 2,000 tokens.
3031

3132
### Cost breakdown
3233

3334
To understand the breakdown of what makes up the cost, it can be helpful to use **Cost Analysis** tool in Azure portal. Follow these steps to understand the cost of inference:
3435

3536
1. Go to [Azure AI Foundry Portal](https://ai.azure.com).
3637

37-
2. In the upper right corner of the screen, select on the name of your Azure AI Services resource, or if you're working on an AI project, on the name of the project.
38+
2. In the upper right corner of the screen, select on the name of your Azure AI Foundry resource (formerly known as Azure AI Services), or if you're working on an AI project, on the name of the project.
3839

3940
3. Select the name of the project. Azure portal opens in a new window.
4041

@@ -45,32 +46,32 @@ To understand the breakdown of what makes up the cost, it can be helpful to use
4546
5. By default, cost analysis is scoped to the selected resource group.
4647

4748
> [!IMPORTANT]
48-
> It's important to scope *Cost Analysis* to the resource group where the Azure AI Services resource is deployed. Cost meters associated with some provider model providers, like Mistral AI or Cohere, are displayed under the resource group instead of the Azure AI Services resource.
49+
> It's important to scope *Cost Analysis* to the resource group where the Azure AI Foundry resource is deployed. Cost meters associated with [Azure Ecosystem Models](#azure-ecosystem-models) are displayed under the resource group instead of the Azure AI Foundry resource.
4950
5051
6. Modify **Group by** to **Meter**. You can now see that for this particular resource group, the source of the costs comes from different models series.
5152

5253
:::image type="content" source="../media/manage-cost/cost-by-meter.png" alt-text="Screenshot of how to see the cost by each meter in the resource group." lightbox="../media/manage-cost/cost-by-meter.png":::
5354

5455
The following sections explain the entries in details.
5556

56-
### Azure OpenAI and Microsoft models
57+
### Azure Direct Models
5758

58-
Azure OpenAI models and models offered as first-party consumption services from Microsoft (including DeepSeek family and Phi family of models) are charged directly and they show up as billing meters under each Azure AI services resource. This billing happens directly through Microsoft. When you inspect your bill, you notice billing meters accounting for inputs and outputs for each consumed model.
59+
[Azure Direct Models](../concepts/models.md#azure-direct-models) (including Azure OpenAI) are charged directly and they show up as billing meters under each Azure AI Foundry resource (formerly known Azure AI Services). This billing happens directly through Microsoft. When you inspect your bill, you notice billing meters accounting for inputs and outputs for each consumed model.
5960

60-
:::image type="content" source="../media/manage-cost/cost-by-meter-1p.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Services resource is deployed, highlighting the meters for Azure OpenAI and Microsoft's models. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-1p.png":::
61+
:::image type="content" source="../media/manage-cost/cost-by-meter-1p.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Foundry resource is deployed, highlighting the meters for Azure OpenAI and Phi models. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-1p.png":::
6162

62-
### Provider models
63+
### Azure Ecosystem models
6364

64-
Models provided by another provider, like Mistral AI, Cohere, Meta AI, or AI21 Labs, are billed using Azure Marketplace. As opposite to Microsoft billing meters, those entries are associated with the resource group where your Azure AI services is deployed instead of to the Azure AI Services resource itself. Given model providers charge you directly, you see entries under the category **Marketplace** and **Service Name** *SaaS* accounting for inputs and outputs for each consumed model.
65+
Models provided by third-party providers, like Cohere, are billed using Azure Marketplace. As opposite to Microsoft billing meters, those entries are associated with the resource group where your Azure AI Foundry (formerly known as Azure AI Services) is deployed instead of to the Azure AI Foundry resource itself. Given model providers charge you directly, you see entries under the category **Marketplace** and **Service Name** *SaaS* accounting for inputs and outputs for each consumed model.
6566

66-
:::image type="content" source="../media/manage-cost/cost-by-meter-saas.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Services resource is deployed, highlighting the meters for models billed throughout Azure Marketplace. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-saas.png":::
67+
:::image type="content" source="../media/manage-cost/cost-by-meter-saas.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Foundry resource is deployed, highlighting the meters for models billed throughout Azure Marketplace. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-saas.png":::
6768

6869
> [!IMPORTANT]
69-
> This distinction between Azure OpenAI, Microsoft-offered models, and provider models only affects how the model is made available to you and how you are charged. In all cases, models are hosted within Azure cloud and there is no interaction with external services or providers.
70+
> This distinction between [Azure Direct Models](../concepts/models.md#azure-direct-models) (including Azure OpenAI) and [Azure Ecosystem Models](../concepts/models.md#azure-ecosystem-models) only affects how the model is made available to you and how you are charged. In all cases, models are hosted within Azure cloud and there is no interaction with external services or providers.
7071
7172
### Using Azure Prepayment
7273

73-
You can pay for Azure OpenAI and Microsoft's models charges with your Azure Prepayment credit. However, you can't use Azure Prepayment credit to pay for charges for other provider models given they're billed through Azure Marketplace.
74+
You can pay for Azure Direct Models' charges with your Azure Prepayment credit. However, you can't use Azure Prepayment credit to pay for charges for other provider models given they're billed through Azure Marketplace.
7475

7576
### HTTP Error response code and billing status
7677

0 commit comments

Comments
 (0)