You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/api-management/azure-openai-enable-semantic-caching.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,8 +22,8 @@ Enable semantic caching of responses to Azure OpenAI API requests to reduce band
22
22
23
23
## Prerequisites
24
24
25
-
* One or more Azure OpenAI Service APIs must be added to your API Management instance. For more information, see [Add an Azure OpenAI Service API to Azure API Management](azure-openai-api-from-specification.md).
26
-
* The Azure OpenAI service must have deployments for the following:
25
+
* One or more Azure OpenAI in Foundry Models APIs must be added to your API Management instance. For more information, see [Add an Azure OpenAI API to Azure API Management](azure-openai-api-from-specification.md).
26
+
* The Azure OpenAI instance must have deployments for the following:
27
27
* Chat Completion API - Deployment used for API consumer calls
28
28
* Embeddings API - Deployment used for semantic caching
29
29
* The API Management instance must be configured to use managed identity authentication to the Azure OpenAI APIs. For more information, see [Authenticate and authorize access to Azure OpenAI APIs using Azure API Management ](api-management-authenticate-authorize-azure-openai.md#authenticate-with-managed-identity).
@@ -57,17 +57,17 @@ Configure a [backend](backends.md) resource for the embeddings API deployment wi
57
57
58
58
***Name** - A name of your choice, such as `embeddings-backend`. You use this name to reference the backend in policies.
59
59
***Type** - Select **Custom URL**.
60
-
***Runtime URL** - The URL of the embeddings API deployment in the Azure OpenAI Service, similar to:
60
+
***Runtime URL** - The URL of the embeddings API deployment in the Azure OpenAI instance, similar to:
0 commit comments