Merge pull request #2755 from santiagxf/santiagxf-patch-1

prmerger-automator[bot] · web-flow · commit e145d27140af · 2025-02-05T21:55:43.000Z
Update quickstart-github-models.md
diff --git a/articles/ai-foundry/model-inference/how-to/inference.md b/articles/ai-foundry/model-inference/how-to/inference.md
@@ -48,6 +48,14 @@ For a chat model, you can create a request as follows:
 
 If you specify a model name that doesn't match any given model deployment, you get an error that the model doesn't exist. You can control which models are available for users by creating model deployments as explained at [add and configure model deployments](create-model-deployments.md).
 
+## Key-less authentication
+
+Models deployed to Azure AI model inference in Azure AI Services support key-less authorization using Microsoft Entra ID. Key-less authorization enhances security, simplifies the user experience, reduces operational complexity, and provides robust compliance support for modern development. It makes it a strong choice for organizations adopting secure and scalable identity management solutions.
+
+To use key-less authentication, [configure your resource and grant access to users](configure-entra-id.md) to perform inference. Once configured, then you can authenticate as follows:
+
+[!INCLUDE [code-create-chat-client-entra](../includes/code-create-chat-client-entra.md)]
+
 ## Limitations
 
 * Azure OpenAI Batch can't be used with the Azure AI model inference endpoint. You have to use the dedicated deployment URL as explained at [Batch API support in Azure OpenAI documentation](../../../ai-services/openai/how-to/batch.md#api-support).
@@ -56,4 +64,4 @@ If you specify a model name that doesn't match any given model deployment, you g
 ## Next steps
 
 * [Use embedding models](use-embeddings.md)
-* [Use chat completion models](use-chat-completions.md)
+* [Use chat completion models](use-chat-completions.md)
diff --git a/articles/ai-foundry/model-inference/how-to/quickstart-github-models.md b/articles/ai-foundry/model-inference/how-to/quickstart-github-models.md
@@ -85,6 +85,7 @@ Use the parameter `model="<deployment-name>` to route your request to this deplo
 Azure AI model inference supports additional features not available in GitHub Models, including:
 
 * [Explore the model catalog](https://ai.azure.com/github/models) to see additional models not available in GitHub Models.
+* Configure [key-less authentication](configure-entra-id.md).
 * Configure [content filtering](configure-content-filters.md).
 * Configure rate limiting (for specific models).
 * Explore additional [deployment SKUs (for specific models)](../concepts/deployment-types.md).
@@ -97,4 +98,4 @@ See the [FAQ section](../faq.yml) to explore more help.
 ## Next steps
 
 * [Explore the model catalog](https://ai.azure.com/github/models) in Azure AI studio.
-* [Add more models](create-model-deployments.md) to your endpoint.
+* [Add more models](create-model-deployments.md) to your endpoint.
diff --git a/articles/ai-foundry/model-inference/includes/code-create-chat-client-entra.md b/articles/ai-foundry/model-inference/includes/code-create-chat-client-entra.md
@@ -12,12 +12,9 @@ author: santiagxf
 Install the package `azure-ai-inference` using your package manager, like pip:
 
 ```bash
-pip install azure-ai-inference>=1.0.0b5
+pip install azure-ai-inference
 ```
 
-> [!WARNING]
-> Azure AI Services resource requires the version `azure-ai-inference>=1.0.0b5` for Python.
-
 Then, you can use the package to consume the model. The following example shows how to create a client to consume chat completions with Entra ID:
 
 ```python
diff --git a/articles/ai-foundry/model-inference/includes/code-create-chat-client.md b/articles/ai-foundry/model-inference/includes/code-create-chat-client.md
@@ -12,12 +12,9 @@ author: santiagxf
 Install the package `azure-ai-inference` using your package manager, like pip:
 
 ```bash
-pip install azure-ai-inference>=1.0.0b5
+pip install azure-ai-inference
 ```
 
-> [!WARNING]
-> Azure AI Services resource requires the version `azure-ai-inference>=1.0.0b5` for Python.
-
 Then, you can use the package to consume the model. The following example shows how to create a client to consume chat completions:
 
 ```python
@@ -115,7 +112,7 @@ __Request__
 
 ```HTTP/1.1
 POST https://<resource>.services.ai.azure.com/models/chat/completions?api-version=2024-05-01-preview
-Authorization: Bearer <bearer-token>
+api-key: <api-key>
 Content-Type: application/json
 ```
----
+---
diff --git a/articles/ai-foundry/model-inference/includes/code-create-chat-completion.md b/articles/ai-foundry/model-inference/includes/code-create-chat-completion.md
@@ -78,7 +78,7 @@ __Request__
 
 ```HTTP/1.1
 POST https://<resource>.services.ai.azure.com/models/chat/completions?api-version=2024-05-01-preview
-Authorization: Bearer <bearer-token>
+api-key: <api-key>
 Content-Type: application/json
 ```
 
@@ -98,4 +98,4 @@ Content-Type: application/json
 }
 ```
 
----
+---
diff --git a/articles/ai-foundry/model-inference/includes/code-create-embeddings-client.md b/articles/ai-foundry/model-inference/includes/code-create-embeddings-client.md
@@ -12,12 +12,9 @@ author: santiagxf
 Install the package `azure-ai-inference` using your package manager, like pip:
 
 ```bash
-pip install azure-ai-inference>=1.0.0b5
+pip install azure-ai-inference
 ```
 
-> [!WARNING]
-> Azure AI Services resource requires the version `azure-ai-inference>=1.0.0b5` for Python.
-
 Then, you can use the package to consume the model. The following example shows how to create a client to consume chat completions:
 
 ```python
@@ -132,7 +129,7 @@ __Request__
 
 ```HTTP/1.1
 POST https://<resource>.services.ai.azure.com/models/embeddings?api-version=2024-05-01-preview
-Authorization: Bearer <bearer-token>
+api-key: <api-key>
 Content-Type: application/json
 ```
----
+---
diff --git a/articles/ai-foundry/model-inference/includes/code-create-embeddings.md b/articles/ai-foundry/model-inference/includes/code-create-embeddings.md
@@ -54,7 +54,7 @@ __Request__
 
 ```HTTP/1.1
 POST https://<resource>.services.ai.azure.com/models/embeddings?api-version=2024-05-01-preview
-Authorization: Bearer <bearer-token>
+api-key: <api-key>
 Content-Type: application/json
 ```
 
@@ -100,4 +100,4 @@ __Response__
 }
 ```
 
----
+---
diff --git a/articles/ai-foundry/model-inference/includes/code-manage-content-filtering.md b/articles/ai-foundry/model-inference/includes/code-manage-content-filtering.md
@@ -122,8 +122,8 @@ try {
 __Request__
 
 ```HTTP/1.1
-POST /chat/completions?api-version=2024-05-01-preview
-Authorization: Bearer <bearer-token>
+POST https://<resource>.services.ai.azure.com/models/chat/completions?api-version=2024-05-01-preview
+api-key: <api-key>
 Content-Type: application/json
 ```