edits

dlepow · dlepow · commit 2da3f94461af · 2025-05-17T09:25:07.000-07:00
diff --git a/articles/api-management/TOC.yml b/articles/api-management/TOC.yml
@@ -208,8 +208,12 @@
       items:
       - name: AI gateway capabilities in API Management
         href: genai-gateway-capabilities.md
+      - name: Import Azure AI Foundry API
+        href: azure-ai-foundry-api.md
       - name: Import Azure OpenAI API
         href: azure-openai-api-from-specification.md
+      - name: Import OpenAI-compatible LLM API
+        href: openai-compatible-llm-api.md
       - name: Import LLM API
         href: openai-compatible-llm-api.md
       - name: Authenticate and authorize to Azure OpenAI
diff --git a/articles/api-management/azure-ai-foundry-api.md b/articles/api-management/azure-ai-foundry-api.md
@@ -14,7 +14,7 @@ ms.custom: template-how-to, build-2024
 
 [!INCLUDE [api-management-availability-all-tiers](../../includes/api-management-availability-all-tiers.md)]
 
-You can import AI model endpoints deployed in Azure AI Foundry to your API Management instance as a REST API. Use AI gateway policies and other capabilities in API Management to simplify integration, improve observability, and enhance control over the model endpoints.
+You can import AI model endpoints deployed in Azure AI Foundry to your API Management instance. Use AI gateway policies and other capabilities in API Management to simplify integration, improve observability, and enhance control over the model endpoints.
 
 Learn more about managing AI APIs in API Management:
 
@@ -23,29 +23,25 @@ Learn more about managing AI APIs in API Management:
 
 ## Client compatibility options
 
-API Management supports two client compatibility options for AI APIs. The option you select determines how clients call the API and how the API Management instance routes requests to the AI service.
+API Management supports two client compatibility options for AI APIs. Choose the option suitable for your model deployment. The option determines how clients call the API and how the API Management instance routes requests to the AI service.
 
 * **Azure AI** - Manage model endpoints in Azure AI Foundry that are exposed through the [Azure AI Model Inference API](/azure/ai-studio/reference/reference-model-inference-api).
 
-    Clients call the deployment at a `/models` endpoint such as `/Deepseek-3/models/chat/completions`. Deployment name is passed in the request body. Use this option if your service includes models exposed through the Azure AI Model Inference API.
+    Clients call the deployment at a `/models` endpoint such as `/my-model/models/chat/completions`. Deployment name is passed in the request body. Use this option if your AI service includes models exposed through the Azure AI Model Inference API.
 
 * **Azure OpenAI Service** - Manage model endpoints deployed in Azure OpenAI Service. 
 
-    Clients call the deployment at an `/openai` endpoint such as `/openai/deployments/my-deployment/chat/completions`. Deployment name is passed in the request path. Use this option if your service only includes Azure OpenAI Service model deployments. 
+    Clients call the deployment at an `/openai` endpoint such as `/openai/deployments/my-deployment/chat/completions`. Deployment name is passed in the request path. Use this option if your AI service only includes Azure OpenAI Service model deployments. 
 
 ## Prerequisites
 
 - An existing API Management instance. [Create one if you haven't already](get-started-create-service-instance.md).
-- One or more Azure AI services with models deployed, such as:
-
 - An Azure AI service in your subscription with one or more models deployed. Examples include models deployed in Azure AI Foundry or Azure OpenAI Service.
 
-## Import AI API using the portal
+## Import AI Foundry API using the portal
 
 Use the following steps to import an AI API to API Management. 
 
-[!INCLUDE [api-management-workspace-availability](../../includes/api-management-workspace-availability.md)]
-
 When you import the API, API Management automatically configures:
 
 * Operations for each of the API's REST API endpoints
@@ -68,11 +64,11 @@ To import an AI Foundry API to API Management:
     1. Select **Next**.
 1. On the **Configure API** tab:
     1. Enter a **Display name** and optional **Description** for the API.
-    1. In **Path**, enter a path that your API Management instance uses to access the deployment endpoint.
+    1. In **Base path**, enter a path that your API Management instance uses to access the deployment endpoint.
     1. Optionally select one or more **Products** to associate with the API.  
-    1. In **Client compatibility**, select either of the following based on the types of client you intend to support. See [AI service options](#ai-service-options) for more information.
-        * **Azure OpenAI** - Select this option if your deployment only includes Azure OpenAI Service model deployments.
-        * **Azure AI** - Select this option if your deployment includes other models available through Azure AI Foundry. 
+    1. In **Client compatibility**, select either of the following based on the types of client you intend to support. See [Client compatibility options](#client-compatibility-options) for more information.
+        * **Azure OpenAI** - Select this option if your clients only need to access Azure OpenAI Service model deployments.
+        * **Azure AI** - Select this option if your clients need to access other model in Azure AI Foundry. 
     1. Select **Next**.
 
         :::image type="content" source="media/azure-ai-foundry-api/client-compatibility.png" alt-text="Screenshot of AI Foundry API configuration in the portal.":::
@@ -82,7 +78,7 @@ To import an AI Foundry API to API Management:
     * [Track token usage](llm-emit-token-metric-policy.md) 
 1. On the **Apply semantic caching** tab, optionally enter settings or accept defaults that define the policies to help optimize performance and reduce latency for the API:
     * [Enable semantic caching of responses](azure-openai-enable-semantic-caching.md)
-On the **AI content safety**, optionally enter settings or accept defaults to configure the Azure AI Content Safety service checks for API requests:
+1. On the **AI content safety**, optionally enter settings or accept defaults to configure the Azure AI Content Safety service checks for API requests:
     * [Enforce content safety checks on LLM requests](llm-content-safety-policy.md)
 1. Select **Review**.
 1. After settings are validated, select **Create**. 
@@ -92,7 +88,7 @@ On the **AI content safety**, optionally enter settings or accept defaults to co
 To ensure that your AI API is working as expected, test it in the API Management test console. 
 1. Select the API you created in the previous step.
 1. Select the **Test** tab.
-1. Select an operation that's compatible with the model in the lanaguage API.
+1. Select an operation that's compatible with the model deployment.
     The page displays fields for parameters and headers.
 1. Enter parameters and headers as needed. Depending on the operation, you might need to configure or update a **Request body**.
     > [!NOTE]
diff --git a/articles/api-management/openai-compatible-llm-api.md b/articles/api-management/openai-compatible-llm-api.md
@@ -1,6 +1,6 @@
 ---
-title: Import a Language Model API as REST API - Azure API Management
-description: How to import an OpenAI-compatible language model API or other AI model as a REST API in Azure API Management.
+title: Import a Self-Hosted Language Model API - Azure API Management
+description: How to import a self-hosted OpenAI-compatible language model or other AI model as a REST API in Azure API Management.
 ms.service: azure-api-management
 author: dlepow
 ms.author: danlep
@@ -10,23 +10,23 @@ ms.collection: ce-skilling-ai-copilot
 ms.custom: template-how-to, build-2024
 ---
 
-# Import a language model API 
+# Import a self-hosted language model API 
 
 [!INCLUDE [api-management-availability-all-tiers](../../includes/api-management-availability-all-tiers.md)]
 
-You can import AI model endpoints deployed outside Azure to your API Management instance as a REST API. Use AI gateway policies and other capabilities in API Management to simplify integration, improve observability, and enhance control over the model endpoints.
+You can import self-hosted AI model endpoints to your API Management instance. Use AI gateway policies and other capabilities in API Management to simplify integration, improve observability, and enhance control over the model endpoints.
 
 Learn more about managing AI APIs in API Management:
 
 * [Generative AI gateway capabilities in Azure API Management](genai-gateway-capabilities.md)
 
 ## Language model API types
 
-API Management supports two types of self-hosted language model APIs. The option you select determines how clients call the API and how the API Management instance routes requests to the AI service.
+API Management supports two types of self-hosted language model APIs. Choose the option  suitable for your model deployment. The option determines how clients call the API and how the API Management instance routes requests to the AI service.
 
-* **OpenAI-compatible** - Self-hosted model endpoints that are compatible with OpenAI's API. Examples include models exposed by inference providers such as [Hugging Face Text Generation Inference (TGI)](https://huggingface.co/docs/text-generation-inference/en/index).
+* **OpenAI-compatible** - Self-hosted model endpoints that are compatible with OpenAI's API. Examples include certain models exposed by inference providers such as [Hugging Face Text Generation Inference (TGI)](https://huggingface.co/docs/text-generation-inference/en/index).
 
-    API Management configures an OpenAI-compatible chat completions endpoint. Clients call the deployment at an `/openai` endpoint such as `/openai/deployments/my-deployment/chat/completions`. 
+    API Management configures an OpenAI-compatible chat completions endpoint. 
 
 * **Passthrough** - Other self-hosted model endpoints that aren't compatible with OpenAI's API. Examples include models deployed in [Amazon Bedrock](https://docs.aws.amazon.com/bedrock/latest/userguide/what-is-bedrock.html) or other providers.
 
@@ -42,8 +42,6 @@ API Management supports two types of self-hosted language model APIs. The option
 
 Use the following steps to import a language model API to API Management. 
 
-[!INCLUDE [api-management-workspace-availability](../../includes/api-management-workspace-availability.md)]
-
 To import a language model API to API Management:
 
 1. In the [Azure portal](https://portal.azure.com), navigate to your API Management instance.
@@ -78,7 +76,7 @@ To import a language model API to API Management:
 To ensure that your LLM API is working as expected, test it in the API Management test console. 
 1. Select the API you created in the previous step.
 1. Select the **Test** tab.
-1. Select an operation that's compatible with the model in the LLM API.
+1. Select an operation that's compatible with the model deployment.
     The page displays fields for parameters and headers.
 1. Enter parameters and headers as needed. Depending on the operation, you might need to configure or update a **Request body**.
     > [!NOTE]
diff --git a/includes/api-management-azure-openai-models.md b/includes/api-management-azure-openai-models.md
@@ -14,15 +14,13 @@ The policy is used with APIs [added to API Management from the Azure OpenAI Serv
 
 | API type | Supported models |
 |-------|-------------|
-| Chat completion     |  `gpt-3.5`<br/><br/>`gpt-4`<br/><br/>`gpt-4o`<sup>1</sup><br/><br/>`gpt-4o-mini`<sup>1</sup><br/><br/>`o1`<br/><br/>`03` |
-| Embeddings | `text-embedding-3-large`<br/><br/> `text-embedding-3-small`<sup>1</sup><br/><br/>`text-embedding-ada-002` |
-| Responses (preview) | `gpt-4o` (Versions: `2024-11-20`, `2024-08-06`, `2024-05-13`)<br/><br/>`gpt-4o-mini`<sup>1</sup> (Version: `2024-07-18`)<br/><br/>`gpt-4.1` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-nano` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-mini` (Version: `2025-04-14`)<br/><br/>`gpt-image-1` (Version: `2025-04-15`)<br/><br/>`o3` (Version: `2025-04-16`)<br/><br/>`o4-mini` (Version: `2025-04-16)
+| Chat completion     |  `gpt-3.5`<br/><br/>`gpt-4`<br/><br/>`gpt-4o`<br/><br/>`gpt-4o-mini`<br/><br/>`o1`<br/><br/>`o3` |
+| Embeddings | `text-embedding-3-large`<br/><br/> `text-embedding-3-small`<br/><br/>`text-embedding-ada-002` |
+| Responses | `gpt-4o` (Versions: `2024-11-20`, `2024-08-06`, `2024-05-13`)<br/><br/>`gpt-4o-mini` (Version: `2024-07-18`)<br/><br/>`gpt-4.1` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-nano` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-mini` (Version: `2025-04-14`)<br/><br/>`gpt-image-1` (Version: `2025-04-15`)<br/><br/>`o3` (Version: `2025-04-16`)<br/><br/>`o4-mini` (Version: `2025-04-16)
 
 
-<sup>1</sup> Model is multimodal (accepts text or image inputs and generates text).
-
 > [!NOTE]
 > Traditional completion APIs are only available with legacy model versions and support is limited.
 
-For more information, see [Azure OpenAI Service models](/azure/ai-services/openai/concepts/models).
+For current information about the models and their capabilities, see [Azure OpenAI Service models](/azure/ai-services/openai/concepts/models).