Skip to content

Commit 9295093

Browse files
author
gitName
committed
review comments
1 parent 0f5ce8b commit 9295093

File tree

2 files changed

+11
-11
lines changed

2 files changed

+11
-11
lines changed

articles/api-management/openai-compatible-llm-api.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
title: Import a Self-Hosted Language Model API - Azure API Management
3-
description: How to import a self-hosted OpenAI-compatible language model or other AI model as a REST API in Azure API Management.
2+
title: Import OpenAI-Compatible Language Model API - Azure API Management
3+
description: How to import an OpenAI-compatible language model or a non-Azure-provided AI model as a REST API in Azure API Management.
44
ms.service: azure-api-management
55
author: dlepow
66
ms.author: danlep
@@ -10,38 +10,38 @@ ms.collection: ce-skilling-ai-copilot
1010
ms.custom: template-how-to, build-2024
1111
---
1212

13-
# Import a self-hosted language model API
13+
# Import an OpenAI-compatible language model API
1414

1515
[!INCLUDE [api-management-availability-all-tiers](../../includes/api-management-availability-all-tiers.md)]
1616

17-
You can import self-hosted AI model endpoints to your API Management instance as APIs. Use AI gateway policies and other capabilities in API Management to simplify integration, improve observability, and enhance control over the model endpoints.
17+
You can import OpenAI-compatible language model endpoints to your API Management instance as APIs. For example, you might want to manage an LLM that you self-host, or that's hosted on an inference provider other than Azure AI services. Use AI gateway policies and other capabilities in API Management to simplify integration, improve observability, and enhance control over the model endpoints.
1818

1919
Learn more about managing AI APIs in API Management:
2020

2121
* [Generative AI gateway capabilities in Azure API Management](genai-gateway-capabilities.md)
2222

2323
## Language model API types
2424

25-
API Management supports two types of self-hosted language model APIs. Choose the option suitable for your model deployment. The option determines how clients call the API and how the API Management instance routes requests to the AI service.
25+
API Management supports two types of language model APIs for this scenario. Choose the option suitable for your model deployment. The option determines how clients call the API and how the API Management instance routes requests to the AI service.
2626

27-
* **OpenAI-compatible** - Self-hosted model endpoints that are compatible with OpenAI's API. Examples include certain models exposed by inference providers such as [Hugging Face Text Generation Inference (TGI)](https://huggingface.co/docs/text-generation-inference/en/index).
27+
* **OpenAI-compatible** - Language model endpoints that are compatible with OpenAI's API. Examples include certain models exposed by inference providers such as [Hugging Face Text Generation Inference (TGI)](https://huggingface.co/docs/text-generation-inference/en/index).
2828

2929
API Management configures an OpenAI-compatible chat completions endpoint.
3030

31-
* **Passthrough** - Other self-hosted model endpoints that aren't compatible with OpenAI's API. Examples include models deployed in [Amazon Bedrock](https://docs.aws.amazon.com/bedrock/latest/userguide/what-is-bedrock.html) or other providers.
31+
* **Passthrough** - Other language model endpoints that aren't compatible with OpenAI's API. Examples include models deployed in [Amazon Bedrock](https://docs.aws.amazon.com/bedrock/latest/userguide/what-is-bedrock.html) or other providers.
3232

3333
API Management configures wildcard operations for common HTTP verbs. Clients can append paths to the wildcard operations, and API Management passes requests to the backend.
3434

3535
## Prerequisites
3636

3737
- An existing API Management instance. [Create one if you haven't already](get-started-create-service-instance.md).
38-
- A self-hosted (non-Azure-hosted) language model deployment with an API endpoint.
38+
- A self-hosted or non-Azure-provided language model deployment with an API endpoint.
3939

4040

4141
## Import language model API using the portal
4242

4343

44-
To import a self-hosted language model API to API Management:
44+
To import a language model API to API Management:
4545

4646
1. In the [Azure portal](https://portal.azure.com), navigate to your API Management instance.
4747
1. In the left menu, under **APIs**, select **APIs** > **+ Add API**.
@@ -55,7 +55,7 @@ To import a self-hosted language model API to API Management:
5555
1. Optionally select one or more **Products** to associate with the API.
5656
1. In **Path**, append a path that your API Management instance uses to access the LLM API endpoints.
5757
1. In **Type**, select either **Create OpenAI API** or **Create a passthrough API**. See [Language model API types](#language-model-api-types) for more information.
58-
1. In **Access key**, optionally enter the authorization header name and API key used to access the LLM API.
58+
1. In **Access key**, enter the authorization header name and API key used to access the LLM API, if required.
5959
1. Select **Next**.
6060

6161
:::image type="content" source="media/openai-compatible-llm-api/configure-api.png" alt-text="Screenshot of language model API configuration in the portal.":::

includes/api-management-azure-openai-models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ The policy is used with APIs [added to API Management from the Azure OpenAI Serv
1616
|-------|-------------|
1717
| Chat completion | `gpt-3.5`<br/><br/>`gpt-4`<br/><br/>`gpt-4o`<br/><br/>`gpt-4o-mini`<br/><br/>`o1`<br/><br/>`o3` |
1818
| Embeddings | `text-embedding-3-large`<br/><br/> `text-embedding-3-small`<br/><br/>`text-embedding-ada-002` |
19-
| Responses | `gpt-4o` (Versions: `2024-11-20`, `2024-08-06`, `2024-05-13`)<br/><br/>`gpt-4o-mini` (Version: `2024-07-18`)<br/><br/>`gpt-4.1` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-nano` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-mini` (Version: `2025-04-14`)<br/><br/>`gpt-image-1` (Version: `2025-04-15`)<br/><br/>`o3` (Version: `2025-04-16`)<br/><br/>`o4-mini` (Version: `2025-04-16)
19+
| Responses (preview) | `gpt-4o` (Versions: `2024-11-20`, `2024-08-06`, `2024-05-13`)<br/><br/>`gpt-4o-mini` (Version: `2024-07-18`)<br/><br/>`gpt-4.1` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-nano` (Version: `2025-04-14`)<br/><br/>`gpt-4.1-mini` (Version: `2025-04-14`)<br/><br/>`gpt-image-1` (Version: `2025-04-15`)<br/><br/>`o3` (Version: `2025-04-16`)<br/><br/>`o4-mini` (Version: `2025-04-16)
2020

2121

2222
> [!NOTE]

0 commit comments

Comments
 (0)