MicrosoftDocs
diff --git a/‎articles/ai-foundry/model-inference/how-to/configure-content-filters.md
Lines changed: 33 additions & 0 deletions b/‎articles/ai-foundry/model-inference/how-to/configure-content-filters.md
Lines changed: 33 additions & 0 deletions
diff --git a/‎articles/ai-foundry/model-inference/how-to/configure-project-connection.md
Lines changed: 32 additions & 0 deletions b/‎articles/ai-foundry/model-inference/how-to/configure-project-connection.md
Lines changed: 32 additions & 0 deletions
diff --git a/‎articles/ai-foundry/model-inference/how-to/create-model-deployments.md
Lines changed: 32 additions & 0 deletions b/‎articles/ai-foundry/model-inference/how-to/create-model-deployments.md
Lines changed: 32 additions & 0 deletions
diff --git a/‎articles/ai-foundry/model-inference/how-to/github/create-model-deployments.md
Lines changed: 57 additions & 0 deletions b/‎articles/ai-foundry/model-inference/how-to/github/create-model-deployments.md
Lines changed: 57 additions & 0 deletions
diff --git a/‎articles/ai-foundry/model-inference/how-to/inference.md
Lines changed: 59 additions & 0 deletions b/‎articles/ai-foundry/model-inference/how-to/inference.md
Lines changed: 59 additions & 0 deletions
@@ -0,0 +1,33 @@
+---
+title: 'How to configure content filters (preview) for models in Azure AI services'
+titleSuffix: Azure AI Foundry
+description: Learn how to use and configure the content filters that come with Azure AI Services, including getting approval for gated modifications.
+manager: nitinme
+ms.service: azure-ai-model-inference
+ms.topic: how-to
+ms.date: 1/21/2025
+author: mrbullwinkle
+ms.author: mbullwin
+recommendations: false
+ms.custom: ignite-2024, github-universe-2024
+zone_pivot_groups: azure-ai-models-deployment
+---
+
+# How to configure content filters (preview) for models in Azure AI services
+
+::: zone pivot="ai-foundry-portal"
+[!INCLUDE [portal](../includes/configure-content-filters/portal.md)]
+::: zone-end
+
+::: zone pivot="programming-language-cli"
+[!INCLUDE [cli](../includes/configure-content-filters/cli.md)]
+::: zone-end
+
+::: zone pivot="programming-language-bicep"
+[!INCLUDE [bicep](../includes/configure-content-filters/bicep.md)]
+::: zone-end
+
+## Next steps
+
+- Read more about [content filtering categories and severity levels](../concepts/content-filter.md) with Azure OpenAI Service.
+- Learn more about red teaming from our: [Introduction to red teaming large language models (LLMs) article](../../../ai-services/openai/concepts/red-teaming.md).
@@ -0,0 +1,32 @@
+---
+title: Configure a connection to use Azure AI model inference in your AI project
+titleSuffix: Azure AI Foundry
+description: Learn how to configure a connection to use Azure AI model inference in your project.
+ms.service: azure-ai-model-inference
+ms.topic: how-to
+ms.date: 1/21/2025
+ms.custom: ignite-2024, github-universe-2024
+manager: nitinme
+author: mrbullwinkle
+ms.author: fasantia 
+recommendations: false
+zone_pivot_groups: azure-ai-models-deployment
+---
+
+# Configure a connection to use Azure AI model inference in your AI project
+
+::: zone pivot="ai-foundry-portal"
+[!INCLUDE [portal](../includes/configure-project-connection/portal.md)]
+::: zone-end
+
+::: zone pivot="programming-language-cli"
+[!INCLUDE [cli](../includes/configure-project-connection/cli.md)]
+::: zone-end
+
+::: zone pivot="programming-language-bicep"
+[!INCLUDE [bicep](../includes/configure-project-connection/bicep.md)]
+::: zone-end
+
+## Next steps
+
+* [Develop applications using Azure AI model inference service in Azure AI services](../supported-languages.md)
@@ -0,0 +1,32 @@
+---
+title: Add and configure models to Azure AI services
+titleSuffix: Azure AI Foundry
+description: Learn how to add and configure new models to the Azure AI model's inference endpoint in Azure AI services.
+ms.service: azure-ai-model-inference
+ms.topic: how-to
+ms.date: 1/21/2025
+ms.custom: ignite-2024, github-universe-2024
+manager: nitinme
+author: mrbullwinkle
+ms.author: fasantia 
+recommendations: false
+zone_pivot_groups: azure-ai-models-deployment
+---
+
+# Add and configure models to Azure AI model inference
+
+::: zone pivot="ai-foundry-portal"
+[!INCLUDE [portal](../includes/create-model-deployments/portal.md)]
+::: zone-end
+
+::: zone pivot="programming-language-cli"
+[!INCLUDE [cli](../includes/create-model-deployments/cli.md)]
+::: zone-end
+
+::: zone pivot="programming-language-bicep"
+[!INCLUDE [bicep](../includes/create-model-deployments/bicep.md)]
+::: zone-end
+
+## Next steps
+
+* [Develop applications using Azure AI model inference service in Azure AI services](../supported-languages.md)
@@ -0,0 +1,57 @@
+---
+title: Add and configure models to Azure AI model inference
+titleSuffix: Azure AI Foundry for GitHub
+description: Learn how to add and configure new models to the Azure AI model inference endpoint in Azure AI Foundry for GitHub.
+ms.service: azure-ai-model-inference
+ms.topic: how-to
+ms.date: 1/21/2025
+ms.custom: ignite-2024, github-universe-2024
+manager: nitinme
+author: mrbullwinkle
+ms.author: fasantia 
+recommendations: false
+---
+
+# Add and configure models to Azure AI services
+
+You can decide and configure which models are available for inference in the Azure AI services resource model's inference endpoint. When a given model is configured, you can then generate predictions from it by indicating its model name or deployment name on your requests. No further changes are required in your code to use it.
+
+In this article, you learn how to add a new model to Azure AI model inference.
+
+## Prerequisites
+
+To complete this article, you need:
+
+* An Azure subscription. If you're using [GitHub Models](https://docs.github.com/en/github-models/), you can upgrade your experience and create an Azure subscription in the process. Read [Upgrade from GitHub Models to Azure AI model inference](../quickstart-github-models.md) if it's your case.
+* An Azure AI services resource. For more information, see [Create an Azure AI Services resource](../../../../ai-services/multi-service-resource.md?context=/azure/ai-services/model-inference/context/context).
+
+## Add a model
+
+[!INCLUDE [add-model-deployments](../../includes/github/add-model-deployments.md)]
+
+## Use the model
+
+Deployed models in Azure AI services can be consumed using the [Azure AI model's inference endpoint](../../concepts/endpoints.md) for the resource.
+
+To use it:
+
+1. Get the Azure AI model's inference endpoint URL and keys from the **deployment page** or the **Overview** page. If you're using Microsoft Entra ID authentication, you don't need a key.
+
+2. When constructing your request, indicate the parameter `model` and insert the model deployment name you created.
+
+    [!INCLUDE [code-create-chat-completion](../../includes/code-create-chat-completion.md)]
+
+3. When using the endpoint, you can change the `model` parameter to any available model deployment in your resource.
+
+Additionally, Azure OpenAI models can be consumed using the [Azure OpenAI service endpoint](../../../../ai-services/openai/supported-languages.md) in the resource. This endpoint is exclusive for each model deployment and has its own URL.
+
+## Model deployment customization
+
+When creating model deployments, you can configure additional settings including content filtering and rate limits. Select the option **Customize** in the deployment wizard to configure it.
+
+> [!NOTE]
+> Configurations may vary depending on the model you're deploying.
+
+## Next steps
+
+* [Develop applications using Azure AI model inference service in Azure AI services](../../supported-languages.md)
@@ -0,0 +1,59 @@
+---
+title: How to use the Azure AI model inference endpoint to consume models
+titleSuffix: Azure AI Foundry
+description: Learn how to use the Azure AI model inference endpoint to consume models
+manager: scottpolly
+author: msakande
+reviewer: santiagxf
+ms.service: azure-ai-model-inference
+ms.topic: how-to
+ms.date: 1/21/2025
+ms.author: mopeakande
+ms.reviewer: fasantia
+---
+
+# Use the Azure AI model inference endpoint to consume models
+
+Azure AI model inference in Azure AI services allows customers to consume the most powerful models from flagship model providers using a single endpoint and credentials. This means that you can switch between models and consume them from your application without changing a single line of code.
+
+This article explains how to use the inference endpoint to invoke them.
+
+## Endpoints
+
+Azure AI services expose multiple endpoints depending on the type of work you're looking for:
+
+> [!div class="checklist"]
+> * Azure AI model inference endpoint
+> * Azure OpenAI endpoint
+
+The **Azure AI inference endpoint** allows customers to use a single endpoint with the same authentication and schema to generate inference for the deployed models in the resource. All the models support this capability. This endpoint follows the [Azure AI model inference API](../../../ai-studio/reference/reference-model-inference-api.md).
+
+**Azure OpenAI** models deployed to AI services also support the Azure OpenAI API. This endpoint exposes the full capabilities of OpenAI models and supports more features like assistants, threads, files, and batch inference.
+
+To learn more about how to apply the **Azure OpenAI endpoint** see [Azure OpenAI service documentation](../../../ai-services/openai/overview.md).
+
+## Using the routing capability in the Azure AI model inference endpoint
+
+The inference endpoint routes requests to a given deployment by matching the parameter `name` inside of the request to the name of the deployment. This means that *deployments work as an alias of a given model under certain configurations*. This flexibility allows you to deploy a given model multiple times in the service but under different configurations if needed.
+
+:::image type="content" source="../media/endpoint/endpoint-routing.png" alt-text="An illustration showing how routing works for a Meta-llama-3.2-8b-instruct model by indicating such name in the parameter 'model' inside of the payload request." lightbox="../media/endpoint/endpoint-routing.png":::
+
+For example, if you create a deployment named `Mistral-large`, then such deployment can be invoked as:
+
+[!INCLUDE [code-create-chat-client](../includes/code-create-chat-client.md)]
+
+For a chat model, you can create a request as follows:
+
+[!INCLUDE [code-create-chat-completion](../includes/code-create-chat-completion.md)]
+
+If you specify a model name that doesn't match any given model deployment, you get an error that the model doesn't exist. You can control which models are available for users by creating model deployments as explained at [add and configure model deployments](create-model-deployments.md).
+
+## Limitations
+
+* Azure OpenAI Batch can't be used with the Azure AI model inference endpoint. You have to use the dedicated deployment URL as explained at [Batch API support in Azure OpenAI documentation](../../../ai-services/openai/how-to/batch.md#api-support).
+* Real-time API isn't supported in the inference endpoint. Use the dedicated deployment URL.
+
+## Next steps
+
+* [Use embedding models](use-embeddings.md)
+* [Use chat completion models](use-chat-completions.md)