MicrosoftDocs
diff --git a/‎articles/ai-foundry/agents/how-to/use-your-own-resources.md‎
Lines changed: 25 additions & 3 deletions b/‎articles/ai-foundry/agents/how-to/use-your-own-resources.md‎
Lines changed: 25 additions & 3 deletions
diff --git a/‎articles/ai-foundry/agents/includes/connection-string-deprecation.md‎
Lines changed: 11 additions & 0 deletions b/‎articles/ai-foundry/agents/includes/connection-string-deprecation.md‎
Lines changed: 11 additions & 0 deletions
diff --git a/‎articles/ai-foundry/agents/includes/quickstart-csharp.md‎
Lines changed: 3 additions & 5 deletions b/‎articles/ai-foundry/agents/includes/quickstart-csharp.md‎
Lines changed: 3 additions & 5 deletions
diff --git a/‎articles/ai-foundry/agents/includes/quickstart-python.md‎
Lines changed: 5 additions & 3 deletions b/‎articles/ai-foundry/agents/includes/quickstart-python.md‎
Lines changed: 5 additions & 3 deletions
diff --git a/‎articles/ai-foundry/agents/includes/quickstart-typescript.md‎
Lines changed: 3 additions & 3 deletions b/‎articles/ai-foundry/agents/includes/quickstart-typescript.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎articles/ai-foundry/concepts/rbac-azure-ai-foundry.md‎
Lines changed: 4 additions & 8 deletions b/‎articles/ai-foundry/concepts/rbac-azure-ai-foundry.md‎
Lines changed: 4 additions & 8 deletions
diff --git a/‎articles/ai-foundry/how-to/develop/trace-application.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-foundry/how-to/develop/trace-application.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-foundry/openai/how-to/fine-tune-test.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-foundry/openai/how-to/fine-tune-test.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-foundry/openai/how-to/fine-tuning-deploy.md‎
Lines changed: 3 additions & 3 deletions b/‎articles/ai-foundry/openai/how-to/fine-tuning-deploy.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎articles/ai-foundry/openai/how-to/prompt-caching.md‎
Lines changed: 7 additions & 3 deletions b/‎articles/ai-foundry/openai/how-to/prompt-caching.md‎
Lines changed: 7 additions & 3 deletions
@@ -6,7 +6,7 @@ services: cognitive-services
 manager: nitinme
 ms.service: azure-ai-agent-service
 ms.topic: how-to
-ms.date: 06/18/2025
+ms.date: 07/23/2025
 author: aahill
 ms.author: aahi
 ms.reviewer: fosteramanda
@@ -19,7 +19,29 @@ Use this article if you want to set up your Foundry project with your own resour
 
 ## Limitations
 
-**Use Azure Cosmos DB for NoSQL to store threads**  
+There are some limitations you should be aware of when you plan to use existing resources with the Azure AI Foundry Agent Service.
+
+### If you are using a hub-based project or Azure OpenAI Assistants
+
+At this time, there is no direct upgrade path to migrate existing agents or their associated data assets such as files, threads, or vector stores from a hub-based project to an Azure AI Foundry project. There is also no upgrade path to convert existing Azure OpenAI Assistants into Foundry Agents, nor a way to automatically migrate Assistants' files, threads, or vector stores.
+
+You can reuse your existing model deployments and quota from Azure AI Services or Azure OpenAI resources within a Foundry project.
+
+### SDK usage with hub-based projects
+
+Starting in May 2025, the Azure AI Agent Service uses an endpoint for [Foundry projects](../../what-is-azure-ai-foundry.md#project-types) instead of the connection string that was used for hub-based projects before this time. Connection strings are no longer supported in current versions of the SDKs and REST API. We recommend creating a new foundry project.
+
+If you want to continue using your hub-based project and connection string, you will need to: 
+* Use the connection string for your project located under **Connection string** in the overview of your project. 
+
+    :::image type="content" source="../../media/quickstarts/azure-ai-sdk/connection-string.png" alt-text="A screenshot showing the legacy connection string for a hub-based project.":::
+
+* Use one of the previous versions of the SDK and the associated sample code:
+    * [C#](https://github.com/Azure/azure-sdk-for-net/tree/feature/azure-ai-agents/sdk/ai/Azure.AI.Projects/samples): `1.0.0-beta.2` or earlier
+    * [Python](https://github.com/Azure/azure-sdk-for-python/tree/feature/azure-ai-projects-beta10/sdk/ai/azure-ai-projects/samples/agents): `1.0.0b10` or earlier
+
+### Azure Cosmos DB for NoSQL to store threads 
+
 - Your existing Azure Cosmos DB for NoSQL account used in a [standard setup](#choose-basic-or-standard-agent-setup) must have a total throughput limit of at least 3000 RU/s. Both provisioned throughput and serverless are supported.
 - Three containers will be provisioned in your existing Cosmos DB account, each requiring 1000 RU/s
 
@@ -70,7 +92,7 @@ Includes everything in the basic setup and fine-grained control over your data b
 
 ## Basic agent setup: Use an existing Azure OpenAI resource 
 
-Replace the parameter value for `existingAoaiResourceId` with the full arm resource ID of the Azure OpenAI resource you want to use.
+Replace the parameter value for `existingAoaiResourceId`in the [template](https://github.com/azure-ai-foundry/foundry-samples/tree/main/samples/microsoft/infrastructure-setup/42-basic-agent-setup-with-customization) with the full arm resource ID of the Azure OpenAI resource you want to use.
 
 1. To get the Azure OpenAI account resource ID, sign in to the Azure CLI and select the subscription with your AI Services account:
        
 
@@ -0,0 +1,11 @@
+---
+manager: nitinme
+author: aahill
+ms.author: aahi
+ms.service: azure-ai-agent-service
+ms.topic: include
+ms.date: 07/23/2025
+---
+
+> [!IMPORTANT]
+> Starting in May 2025, the Azure AI Agent Service uses an endpoint for [Foundry projects](../../what-is-azure-ai-foundry.md#project-types) instead of the connection string that was previously used for hub-based projects. If you're using a hub-based project, you won't be able to use the current versions of the SDK and REST API. For more information, see [SDK usage with hub-based projects](../how-to/use-your-own-resources.md#sdk-usage-with-hub-based-projects).
@@ -46,13 +46,11 @@ Use the following code to create and run an agent. To run this code, you will ne
 
 `https://<AIFoundryResourceName>.services.ai.azure.com/api/projects/<ProjectName>`
 
-[!INCLUDE [endpoint-string-portal](endpoint-string-portal.md)]
-
-For example, your endpoint may look something like:
+[!INCLUDE [connection-string-deprecation](connection-string-deprecation.md)]
 
-`https://myresource.services.ai.azure.com/api/projects/myproject`
+[!INCLUDE [endpoint-string-portal](endpoint-string-portal.md)]
 
-Set this endpoint in an environment variable variable named `ProjectEndpoint`.
+Set this endpoint in an environment variable named `ProjectEndpoint`.
 
 [!INCLUDE [model-name-portal](model-name-portal.md)]
 
 
@@ -43,13 +43,15 @@ Use the following code to create and run an agent. To run this code, you will ne
 
 `https://<AIFoundryResourceName>.services.ai.azure.com/api/projects/<ProjectName>`
 
+[!INCLUDE [connection-string-deprecation](connection-string-deprecation.md)]
+
 [!INCLUDE [endpoint-string-portal](endpoint-string-portal.md)]
 
-For example, your endpoint may look something like:
+Set this endpoint as an environment variable named `PROJECT_ENDPOINT`.
 
-`https://myresource.services.ai.azure.com/api/projects/myproject`
+[!INCLUDE [model-name-portal](model-name-portal.md)]
 
-Set this endpoint as an environment variable named `PROJECT_ENDPOINT`.
+Save the name of your model deployment name as an environment variable named `MODEL_DEPLOYMENT_NAME`. 
 
 ```python
 import os
 
@@ -58,11 +58,11 @@ Use the following code to create and run an agent which uploads [a CSV file](htt
 
 [!INCLUDE [endpoint-string-portal](endpoint-string-portal.md)]
 
-For example, your endpoint looks something like:
+Set this endpoint as an environment variable named `PROJECT_ENDPOINT` in a `.env` file.
 
-`https://myresource.services.ai.azure.com/api/projects/myproject`
+[!INCLUDE [model-name-portal](model-name-portal.md)]
 
-Set this endpoint as an environment variable named `PROJECT_ENDPOINT` in a `.env` file.
+Save the name of your model deployment name as an environment variable named `MODEL_DEPLOYMENT_NAME`. 
 
 > [!IMPORTANT] 
 > * This quickstart code uses environment variables for sensitive configuration. Never commit your `.env` file to version control by making sure `.env` is listed in your `.gitignore` file.
 
@@ -270,7 +270,7 @@ Here's a table of the built-in roles and their permissions for the hub:
 | --- | --- |
 | Owner | Full access to the hub, including the ability to manage and create new hubs and assign permissions. This role is automatically assigned to the hub creator|
 | Contributor | User has full access to the hub, including the ability to create new hubs, but isn't able to manage hub permissions on the existing resource. |
-| Azure AI Administrator (preview) | This role is automatically assigned to the system-assigned managed identity for the hub. The Azure AI Administrator role has the minimum permissions needed for the managed identity to perform its tasks. For more information, see [Azure AI Administrator role (preview)](#azure-ai-administrator-role-preview). |
+| Azure AI Administrator | This role is automatically assigned to the system-assigned managed identity for the hub. The Azure AI Administrator role has the minimum permissions needed for the managed identity to perform its tasks. For more information, see [Azure AI Administrator role](#azure-ai-administrator-role). |
 | Azure AI Developer |     Perform all actions except create new hubs and manage the hub permissions. For example, users can create projects, compute, and connections. Users can assign permissions within their project. Users can interact with existing Azure AI resources such as Azure OpenAI, Azure AI Search, and Azure AI services. |
 | Azure AI Inference Deployment Operator | Perform all actions required to create a resource deployment within a resource group. |
 | Reader |     Read only access to the hub. This role is automatically assigned to all project members within the hub. |
@@ -279,14 +279,10 @@ The key difference between Contributor and Azure AI Developer is the ability to
 
 Only the Owner and Contributor roles allow you to make a hub. At this time, custom roles can't grant you permission to make hubs.
 
-### Azure AI Administrator role (preview)
+### Azure AI Administrator role
 
 Before 11/19/2024, the system-assigned managed identity created for the hub was automatically assigned the __Contributor__ role for the resource group that contains the hub and projects. Hubs created after this date have the system-assigned managed identity assigned to the __Azure AI Administrator__ role. This role is more narrowly scoped to the minimum permissions needed for the managed identity to perform its tasks.
 
-The __Azure AI Administrator__ role is currently in public preview.
-
-[!INCLUDE [feature-preview](../includes/feature-preview.md)]
-
 The __Azure AI Administrator__ role has the following permissions:
 
 ```json
@@ -419,7 +415,7 @@ Here's a table of the built-in roles and their permissions for the project:
 | --- | --- |
 | Owner | Full access to the project, including the ability to assign permissions to project users. |
 | Contributor |    User has full access to the project but can't assign permissions to project users. |
-| Azure AI Administrator (preview) | This role is automatically assigned to the system-assigned managed identity for the hub. The Azure AI Administrator role has the minimum permissions needed for the managed identity to perform its tasks. For more information, see [Azure AI Administrator role (preview)](#azure-ai-administrator-role-preview). |
+| Azure AI Administrator | This role is automatically assigned to the system-assigned managed identity for the hub. The Azure AI Administrator role has the minimum permissions needed for the managed identity to perform its tasks. For more information, see [Azure AI Administrator role](#azure-ai-administrator-role). |
 | Azure AI Developer |     User can perform most actions, including create deployments, but can't assign permissions to project users. |
 | Azure AI Inference Deployment Operator | Perform all actions required to create a resource deployment within a resource group. |
 | Reader |     Read only access to the project. |
@@ -767,4 +763,4 @@ If you create a new hub and encounter errors with the new default role assignmen
 - [How to create an Azure AI Foundry project](../how-to/create-projects.md)
 - [How to create a connection in Azure AI Foundry portal](../how-to/connections-add.md)
 
-::: zone-end
+::: zone-end
@@ -116,7 +116,7 @@ When developing with the OpenAI SDK, you can instrument your code so traces are
 1. Use the OpenAI SDK as usual:
 
     ```python
-    client = project_client.get_azure_openai_client()
+    client = project_client.inference.get_azure_openai_client()
 
     response = client.chat.completions.create(
         model="deepseek-v3-0324",
@@ -194,7 +194,7 @@ When developing with the OpenAI SDK, you can instrument your code so traces are
 
 ## Trace to console
 
-It may be useful to also trace your application and send the traces to the local execution console. Such approach may result beneficial when running unit tests or integration tests in your application using an automated CI/CD pipeline. Traces can be sent to the console and captured by your CI/CD tool to further analysis.
+It may be useful to also trace your application and send the traces to the local execution console. Such approach may be beneficial when running unit tests or integration tests in your application using an automated CI/CD pipeline. Traces can be sent to the console and captured by your CI/CD tool to further analysis.
 
 Configure tracing as follows:
 
 
@@ -90,7 +90,7 @@ The following example shows how to use the REST API to create a model deployment
 
 
 ```bash
-curl -X POST "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2025-04-01-preview" \
+curl -X POST "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>?api-version=2025-04-01-preview" \
   -H "Authorization: Bearer <TOKEN>" \
   -H "Content-Type: application/json" \
   -d '{
@@ -203,7 +203,7 @@ To use the [Deployments - Delete REST API](/rest/api/aiservices/accountmanagemen
 Below is the REST API example to delete a deployment:
 
 ```bash
-curl -X DELETE "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2025-04-01-preview" \
+curl -X DELETE "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>?api-version=2025-04-01-preview" \
   -H "Authorization: Bearer <TOKEN>"
 ```
 
 
@@ -197,7 +197,7 @@ The following example shows how to use the REST API to create a model deployment
 
 
 ```bash
-curl -X POST "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-21" \
+curl -X POST "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>?api-version=2024-10-21" \
   -H "Authorization: Bearer <TOKEN>" \
   -H "Content-Type: application/json" \
   -d '{
@@ -231,7 +231,7 @@ The only limitations are that the new region must also support fine-tuning and w
 Below is an example of deploying a model that was fine-tuned in one subscription/region to another.
 
 ```bash
-curl -X PUT "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-21" \
+curl -X PUT "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>?api-version=2024-10-21" \
   -H "Authorization: Bearer <TOKEN>" \
   -H "Content-Type: application/json" \
   -d '{
@@ -401,7 +401,7 @@ To delete a deployment, use the [Deployments - Delete REST API](/rest/api/aiserv
 Below is the REST API example to delete a deployment:
 
 ```bash
-curl -X DELETE "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-21" \
+curl -X DELETE "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>?api-version=2024-10-21" \
   -H "Authorization: Bearer <TOKEN>"
 ```
 
 
@@ -6,15 +6,15 @@ services: cognitive-services
 manager: nitinme
 ms.service: azure-ai-openai
 ms.topic: how-to
-ms.date: 07/23/2025
+ms.date: 07/24/2025
 author: mrbullwinkle
 ms.author: mbullwin
 recommendations: false
 ---
 
 # Prompt caching
 
-Prompt caching allows you to reduce overall request latency and cost for longer prompts that have identical content at the beginning of the prompt. *"Prompt"* in this context is referring to the input you send to the model as part of your chat completions request. Rather than reprocess the same input tokens over and over again, the service is able to retain a temporary cache of processed input token computations to improve overall performance. Prompt caching has no impact on the output content returned in the model response beyond a reduction in latency and cost. For supported models, cached tokens are billed at a [discount on input token pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) for Standard deployment types and up to [100% discount on input tokens](/azure/ai-services/openai/concepts/provisioned-throughput) for Provisioned deployment types. If you provide the `user` parameter, it's combined with a prefix hash, allowing you to influence routing and improve cache hit rates. This is especially beneficial when many requests share long, common prefixes.
+Prompt caching allows you to reduce overall request latency and cost for longer prompts that have identical content at the beginning of the prompt. *"Prompt"* in this context is referring to the input you send to the model as part of your chat completions request. Rather than reprocess the same input tokens over and over again, the service is able to retain a temporary cache of processed input token computations to improve overall performance. Prompt caching has no impact on the output content returned in the model response beyond a reduction in latency and cost. For supported models, cached tokens are billed at a [discount on input token pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) for Standard deployment types and up to [100% discount on input tokens](/azure/ai-services/openai/concepts/provisioned-throughput) for Provisioned deployment types. 
 
 Caches are typically cleared within 5-10 minutes of inactivity and are always removed within one hour of the cache's last use. Prompt caches aren't shared between Azure subscriptions.
 
@@ -27,13 +27,15 @@ Caches are typically cleared within 5-10 minutes of inactivity and are always re
 
 Official support for prompt caching was first added in API version `2024-10-01-preview`. At this time, only the o-series model family supports the `cached_tokens` API response parameter.
 
-## Get started
+## Getting started
 
 For a request to take advantage of prompt caching the request must be both:
 
 - A minimum of 1,024 tokens in length.
 - The first 1,024 tokens in the prompt must be identical.
 
+Requests are routed based on a hash of the initial prefix of a prompt.
+
 When a match is found between the token computations in a prompt and the current content of the prompt cache, it's referred to as a cache hit. Cache hits will show up as [`cached_tokens`](/azure/ai-services/openai/reference-preview#cached_tokens) under [`prompt_tokens_details`](/azure/ai-services/openai/reference-preview#properties-for-prompt_tokens_details) in the chat completions response.
 
 ```json
@@ -63,6 +65,8 @@ After the first 1,024 tokens cache hits will occur for every 128 additional iden
 
 A single character difference in the first 1,024 tokens will result in a cache miss which is characterized by a `cached_tokens` value of 0. Prompt caching is enabled by default with no additional configuration needed for supported models.
 
+If you provide the [`user`](/azure/ai-foundry/openai/reference-preview-latest#request-body-2) parameter, it's combined with the prefix hash, allowing you to influence routing and improve cache hit rates. This is especially beneficial when many requests share long, common prefixes.
+
 ## What is cached?
 
 o1-series models feature support varies by model. For more information, see our dedicated [reasoning models guide](./reasoning.md).