MicrosoftDocs
diff --git a/‎articles/ai-foundry/model-inference/breadcrumb/toc.yml‎
Lines changed: 1 addition & 1 deletion b/‎articles/ai-foundry/model-inference/breadcrumb/toc.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-foundry/model-inference/concepts/content-filter.md‎
Lines changed: 3 additions & 1 deletion b/‎articles/ai-foundry/model-inference/concepts/content-filter.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎articles/ai-foundry/model-inference/concepts/models.md‎
Lines changed: 72 additions & 49 deletions b/‎articles/ai-foundry/model-inference/concepts/models.md‎
Lines changed: 72 additions & 49 deletions
diff --git a/‎articles/ai-foundry/model-inference/faq.yml‎
Lines changed: 6 additions & 8 deletions b/‎articles/ai-foundry/model-inference/faq.yml‎
Lines changed: 6 additions & 8 deletions
diff --git a/‎articles/ai-foundry/model-inference/how-to/configure-marketplace.md‎
Lines changed: 35 additions & 0 deletions b/‎articles/ai-foundry/model-inference/how-to/configure-marketplace.md‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎articles/ai-foundry/model-inference/how-to/github/create-model-deployments.md‎
Lines changed: 10 additions & 7 deletions b/‎articles/ai-foundry/model-inference/how-to/github/create-model-deployments.md‎
Lines changed: 10 additions & 7 deletions
diff --git a/‎articles/ai-foundry/model-inference/how-to/manage-costs.md‎
Lines changed: 18 additions & 17 deletions b/‎articles/ai-foundry/model-inference/how-to/manage-costs.md‎
Lines changed: 18 additions & 17 deletions
@@ -6,6 +6,6 @@
     tocHref: /azure/ai-foundry/
     topicHref: /azure/ai-studio/index
     items:
-    - name: Model Inference
+    - name: Foundry Models
       tocHref: /azure/ai-foundry/model-inference/
       topicHref: /azure/ai-foundry/model-inference/index
@@ -11,7 +11,9 @@ ms.custom: ignite-2024, github-universe-2024
 manager: nitinme
 ---
 
-# Content filtering for Azure AI Foundry Models in Azure AI Foundry Service
+
+# Content filtering for Azure AI Foundry Models
+
 
 > [!IMPORTANT]
 > The content filtering system isn't applied to prompts and completions processed by audio models such as Whisper in Azure OpenAI in Azure AI Foundry Models. Learn more about the [Audio models in Azure OpenAI](../../../ai-services/openai/concepts/models.md?tabs=standard-audio#standard-deployment-regional-models-by-endpoint).
 
@@ -12,18 +12,16 @@ metadata:
   author: santiagxf
 title: Foundry Models frequently asked questions
 summary: |
-  If you can't find answers to your questions in this document, and still need help check the [Azure AI Foundry services (formerly known Azure AI Services)support options guide](../../ai-services/cognitive-services-support-options.md?context=/azure/ai-services/openai/context/context).
+  If you can't find answers to your questions in this document, and still need help check the [Azure AI Foundry services (formerly known Azure AI Services) support options guide](../../ai-services/cognitive-services-support-options.md?context=/azure/ai-services/openai/context/context).
 sections:
   - name: General
     questions:
       - question: |
-          What's the difference between Azure OpenAI in Foundry Models and Foundry Models?
+          What's the difference between Azure OpenAI and Foundry Models?
         answer: |
-          Azure OpenAI gives customers access to advanced language models from OpenAI. Foundry Models extends such capability giving customers access to all the flagship models in Azure AI under the same service, endpoint, and credentials. It includes Azure OpenAI, Cohere, Mistral AI, Meta Llama, AI21 labs, etc. Customers can seamlessly switch between models without changing their code.
+          Azure OpenAI gives customers access to advanced language models from OpenAI. Foundry Models extends such capability giving customers access to all the flagship models in Azure AI Foundry under the same service, endpoint, and credentials. It includes Azure OpenAI, Cohere, Mistral AI, Meta Llama, AI21 labs, etc. Customers can seamlessly switch between models without changing their code.
 
-          Both Azure OpenAI and Foundry Models are part of the Azure AI Foundry services (formerly known Azure AI Services) family and build on top of the same security and enterprise promise of Azure.
-
-          While Foundry Models focus on inference, Azure OpenAI can be used with more advanced APIs like batch, fine-tuning, assistants, and files.
+          Azure OpenAI is an Azure Direct model family in Foundry Models.
       - question: |
           What's the difference between Azure AI services and Azure AI Foundry?
         answer: |
@@ -90,13 +88,13 @@ sections:
       - question: |
           Where can I see the bill details?
         answer: |
-          Billing and costs are displayed in [Azure Cost Management + Billing](/azure/cost-management-billing/understand/download-azure-daily-usage). You can see the usage details in the [Azure portal](https://portal.azure.com).
+          Billing and costs are displayed in [Microsoft Cost Management + Billing](/azure/cost-management-billing/understand/download-azure-daily-usage). You can see the usage details in the [Azure portal](https://portal.azure.com).
 
           Billing isn't shown in Azure AI Foundry portal.
       - question: |
           How can I place a spending limit to my bill?
         answer: |
-          You can set up a spending limit in the [Azure portal](https://portal.azure.com) under **Azure Cost Management + Billing**. This limit prevents you from spending more than the limit you set. Once spending limit is reached, the subscription will be disabled and you won't be able to use the endpoint until the next billing cycle.
+          You can set up a spending limit in the [Azure portal](https://portal.azure.com) under **Microsoft Cost Management + Billing**. This limit prevents you from spending more than the limit you set. Once spending limit is reached, the subscription will be disabled and you won't be able to use the endpoint until the next billing cycle.
   - name: Data and Privacy
     questions:
       - question: |
 
@@ -0,0 +1,35 @@
+---
+title: Configure access to Azure Ecosystem Models
+description: Learn how to configure access to Azure Ecosystem Models.
+author: santiagxf
+ms.author: fasantia 
+ms.service: azure-ai-model-inference
+ms.topic: how-to
+ms.date: 5/11/2025
+---
+
+# Configure access to Azure Ecosystem Models
+
+Certain models in AI Foundry Models are offered directly by the model provider through the Azure Marketplace. This article explains the requirements to use Azure Marketplace if you plan to use such models in your workloads. Azure Direct Models, like DeepSeek or Phi, or Azure OpenAI Service models, like GPTs, don't have this requirement. 
+
+> [!TIP]
+> All models offered in AI Foundry Models are hosted in Microsoft's Azure environment and the Service does NOT interact with any external services or model provider.
+
+:::image type="content" source="../media/configure-marketplace/azure-marketplace-3p.png" alt-text="A diagram with the overall architecture of Azure Marketplace integration with AI Foundry Models." lightbox="../media/configure-marketplace/azure-marketplace-3p.png":::
+
+[!INCLUDE [marketplace-rbac](../includes/configure-marketplace/rbac.md)]
+
+## Country availability
+
+Azure Ecosystem Models with Pay-as-you-go billing is available only to users whose Azure subscription belongs to a billing account in a country/region where the model offer is available. Availability varies per model provider and model SKU. Read [Region availability for models](../../how-to/deploy-models-serverless-availability.md).
+
+## Troubleshooting
+
+Use the following troubleshooting guide to find and solve errors when deploying third-party models in AI Foundry Models:
+
+| Error | Description |
+|-------|-------------|
+| This offer is not made available by the provider in the country where your account and Azure Subscription are registered. | The model provider didn't make the specific model SKU available in the country where the subscription is registered. Each model provider may decide to make the offer available in specific countries and such may vary by model SKU. You need to deploy the model to a subscription having billing on a supported country. See the list of countries at [Region availability for models](../../how-to/deploy-models-serverless-availability.md).  |
+| Marketplace Subscription purchase eligibility check failed. | The model provider didn't make the specific model SKU available in the country where the subscription is registered or it isn't available in the region where you deployed the Azure AI Services resource. See [Region availability for models](../../how-to/deploy-models-serverless-availability.md). |
+| Unable to create a model deployment for model "model-name". If the error persists, please contact HIT (Human Intelligence Team) via this link: https://go.microsoft.com/fwlink/?linkid=2101400&clcid=0x409 and request to allowlist the Azure subscription. | Azure Marketplace rejects the request to create a model subscription. Such can be due to multiple reasons, including subscribing to the model offering too often, or from multiple subscriptions at the same time. Please contact support using the provided link indicating your subscription ID. |
+| This offer is not available for purchasing by subscriptions belonging to Microsoft Azure Cloud Solution Providers | Cloud Solution Provider (CSP) subscriptions do not have the ability to purchase third-party model offerings. You can consider models offered as first-party consumption service. |
@@ -12,26 +12,28 @@ ms.author: fasantia
 recommendations: false
 ---
 
-# Add and configure models to Azure AI services
+# Add and configure models from Azure AI Foundry Models
 
 You can decide and configure which models are available for inference in the Azure AI services resource model's inference endpoint. When a given model is configured, you can then generate predictions from it by indicating its model name or deployment name on your requests. No further changes are required in your code to use it.
 
-In this article, you learn how to add a new model to Azure AI Foundry Models.
+
+In this article, you learn how to add a new model from Azure AI Foundry Models.
 
 ## Prerequisites
 
 To complete this article, you need:
 
-* An Azure subscription. If you're using [GitHub Models](https://docs.github.com/en/github-models/), you can upgrade your experience and create an Azure subscription in the process. Read [Upgrade from GitHub Models to Foundry Models](../quickstart-github-models.md) if it's your case.
-* An Azure AI services resource. For more information, see [Create an Azure AI Services resource](../../../../ai-services/multi-service-resource.md?context=/azure/ai-services/model-inference/context/context).
+* An Azure subscription. If you're using [GitHub Models](https://docs.github.com/en/github-models/), you can upgrade your experience and create an Azure subscription in the process. Read [Upgrade from GitHub Models to Azure AI Foundry Models](../quickstart-github-models.md) if it's your case.
+* An Azure AI services resource. For more information, see [Create an Azure AI Foundry resource](../quickstart-create-resources.md).
+
 
 ## Add a model
 
 [!INCLUDE [add-model-deployments](../../includes/github/add-model-deployments.md)]
 
 ## Use the model
 
-Deployed models in Azure AI services can be consumed using the [Azure AI model's inference endpoint](../../concepts/endpoints.md) for the resource.
+Deployed models in Azure AI Foundry Models can be consumed using the [Azure AI model's inference endpoint](../../concepts/endpoints.md) for the resource.
 
 To use it:
 
@@ -52,6 +54,7 @@ When creating model deployments, you can configure additional settings including
 > [!NOTE]
 > Configurations may vary depending on the model you're deploying.
 
-## Next steps
+## Related content
+
+* [Develop applications using Azure AI Foundry Models](../../supported-languages.md)
 
-* [Develop applications using Foundry Models service in Azure AI services](../../supported-languages.md)
 
@@ -1,6 +1,6 @@
 ---
-title: Plan to manage costs for Azure AI Foundry Models in Azure AI Foundry Service
-description: Learn how to plan for and manage costs for Azure AI Foundry Models in Azure AI Foundry Service by using cost analysis in the Azure portal.
+title: Plan to manage costs for Azure AI Foundry Models
+description: Learn how to plan for and manage costs for Azure AI Foundry Models by using cost analysis in the Azure portal.
 author: santiagxf
 ms.author: fasantia 
 ms.custom: subject-cost-optimization
@@ -9,12 +9,12 @@ ms.topic: how-to
 ms.date: 1/21/2025
 ---
 
+# Plan to manage costs for Azure AI Foundry Models
 
-# Plan to manage costs for Azure AI Foundry Models in Azure AI Foundry Service
+This article describes how you can view, plan for, and manage costs for Azure AI Foundry Models.
 
-This article describes how you can view, plan for, and manage costs for Foundry Models in Azure AI Foundry Service.
+Although this article is about planning for and managing costs for Azure AI Foundry Models, you're billed for all Azure services and resources used in your Azure subscription.
 
-Although this article is about planning for and managing costs for Foundry Models in Azure AI Foundry Service, you're billed for all Azure services and resources used in your Azure subscription.
 
 ## Prerequisites
 
@@ -24,17 +24,18 @@ Although this article is about planning for and managing costs for Foundry Model
 
 ## Understand Foundry Models billing model
 
-Language models understand and process inputs by breaking them down into tokens. For reference, each token is roughly four characters for typical English text. Models that can process images or audio break down them into tokens too for billing purposes. The number of tokens per image or audio content depends on the model and the resolution of the input.
 
-Costs per token vary depending on which model series you choose but in all cases models deployed in Azure AI Services are charged per 1,000 tokens. Token costs are for both input and output. For example, suppose you have a 1,000 token JavaScript code sample that you ask a model to convert to Python. You would be charged approximately 1,000 tokens for the initial input request sent, and 1,000 more tokens for the output that is received in response for a total of 2,000 tokens.
+Language models understand and process inputs by breaking them down into tokens. For reference, each token is roughly four characters for typical English text. Models that can process images or audio break them down into tokens too for billing purposes. The number of tokens per image or audio content depends on the model and the resolution of the input.
+
+Costs per token vary depending on which model series you choose but in all cases models deployed in Azure AI Foundry are charged per 1,000 tokens. Token costs are for both input and output. For example, suppose you have a 1,000 token JavaScript code sample that you ask a model to convert to Python. You would be charged approximately 1,000 tokens for the initial input request sent, and 1,000 more tokens for the output that is received in response for a total of 2,000 tokens.
 
 ### Cost breakdown
 
 To understand the breakdown of what makes up the cost, it can be helpful to use **Cost Analysis** tool in Azure portal. Follow these steps to understand the cost of inference:
 
 1. Go to [Azure AI Foundry Portal](https://ai.azure.com).
 
-2. In the upper right corner of the screen, select on the name of your Azure AI Services resource, or if you're working on an AI project, on the name of the project.
+2. In the upper right corner of the screen, select on the name of your Azure AI Foundry resource (formerly known as Azure AI Services), or if you're working on an AI project, on the name of the project.
 
 3. Select the name of the project. Azure portal opens in a new window.
 
@@ -45,32 +46,32 @@ To understand the breakdown of what makes up the cost, it can be helpful to use
 5. By default, cost analysis is scoped to the selected resource group.
 
     > [!IMPORTANT]
-    > It's important to scope *Cost Analysis* to the resource group where the Azure AI Services resource is deployed. Cost meters associated with some provider model providers, like Mistral AI or Cohere, are displayed under the resource group instead of the Azure AI Services resource.
+    > It's important to scope *Cost Analysis* to the resource group where the Azure AI Foundry resource is deployed. Cost meters associated with [Azure Ecosystem Models](#azure-ecosystem-models) are displayed under the resource group instead of the Azure AI Foundry resource.
 
 6. Modify **Group by** to **Meter**. You can now see that for this particular resource group, the source of the costs comes from different models series.  
 
     :::image type="content" source="../media/manage-cost/cost-by-meter.png" alt-text="Screenshot of how to see the cost by each meter in the resource group." lightbox="../media/manage-cost/cost-by-meter.png":::
 
 The following sections explain the entries in details.
 
-### Azure OpenAI and Microsoft models
+### Azure Direct Models
 
-Azure OpenAI models and models offered as first-party consumption services from Microsoft (including DeepSeek family and Phi family of models) are charged directly and they show up as billing meters under each Azure AI services resource. This billing happens directly through Microsoft. When you inspect your bill, you notice billing meters accounting for inputs and outputs for each consumed model.
+[Azure Direct Models](../concepts/models.md#azure-direct-models) (including Azure OpenAI) are charged directly and they show up as billing meters under each Azure AI Foundry resource (formerly known Azure AI Services). This billing happens directly through Microsoft. When you inspect your bill, you notice billing meters accounting for inputs and outputs for each consumed model.
 
-:::image type="content" source="../media/manage-cost/cost-by-meter-1p.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Services resource is deployed, highlighting the meters for Azure OpenAI and Microsoft's models. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-1p.png":::
+:::image type="content" source="../media/manage-cost/cost-by-meter-1p.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Foundry resource is deployed, highlighting the meters for Azure OpenAI and Phi models. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-1p.png":::
 
-### Provider models
+### Azure Ecosystem models
 
-Models provided by another provider, like Mistral AI, Cohere, Meta AI, or AI21 Labs, are billed using Azure Marketplace. As opposite to Microsoft billing meters, those entries are associated with the resource group where your Azure AI services is deployed instead of to the Azure AI Services resource itself. Given model providers charge you directly, you see entries under the category **Marketplace** and **Service Name** *SaaS* accounting for inputs and outputs for each consumed model.
+Models provided by third-party providers, like Cohere, are billed using Azure Marketplace. As opposite to Microsoft billing meters, those entries are associated with the resource group where your Azure AI Foundry (formerly known as Azure AI Services) is deployed instead of to the Azure AI Foundry resource itself. Given model providers charge you directly, you see entries under the category **Marketplace** and **Service Name** *SaaS* accounting for inputs and outputs for each consumed model.
 
-:::image type="content" source="../media/manage-cost/cost-by-meter-saas.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Services resource is deployed, highlighting the meters for models billed throughout Azure Marketplace. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-saas.png":::
+:::image type="content" source="../media/manage-cost/cost-by-meter-saas.png" alt-text="Screenshot of cost analysis dashboard scoped to the resource group where the Azure AI Foundry resource is deployed, highlighting the meters for models billed throughout Azure Marketplace. Cost is group by meter." lightbox="../media/manage-cost/cost-by-meter-saas.png":::
 
 > [!IMPORTANT]
-> This distinction between Azure OpenAI, Microsoft-offered models, and provider models only affects how the model is made available to you and how you are charged. In all cases, models are hosted within Azure cloud and there is no interaction with external services or providers.
+> This distinction between [Azure Direct Models](../concepts/models.md#azure-direct-models) (including Azure OpenAI) and [Azure Ecosystem Models](../concepts/models.md#azure-ecosystem-models) only affects how the model is made available to you and how you are charged. In all cases, models are hosted within Azure cloud and there is no interaction with external services or providers.
 
 ### Using Azure Prepayment
 
-You can pay for Azure OpenAI and Microsoft's models charges with your Azure Prepayment credit. However, you can't use Azure Prepayment credit to pay for charges for other provider models given they're billed through Azure Marketplace.
+You can pay for Azure Direct Models' charges with your Azure Prepayment credit. However, you can't use Azure Prepayment credit to pay for charges for other provider models given they're billed through Azure Marketplace.
 
 ### HTTP Error response code and billing status