Skip to content

Commit 7e36816

Browse files
committed
edit
1 parent 77d4af1 commit 7e36816

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

articles/api-management/openai-compatible-llm-api.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ ms.service: azure-api-management
55
author: dlepow
66
ms.author: danlep
77
ms.topic: how-to
8-
ms.date: 05/15/2025
8+
ms.date: 06/04/2025
99
ms.collection: ce-skilling-ai-copilot
1010
ms.custom: template-how-to
1111
---
@@ -26,7 +26,7 @@ API Management supports two types of language model APIs for this scenario. Choo
2626

2727
* **OpenAI-compatible** - Language model endpoints that are compatible with OpenAI's API. Examples include certain models exposed by inference providers such as [Hugging Face Text Generation Inference (TGI)](https://huggingface.co/docs/text-generation-inference/en/index) and [Google Gemini API](https://ai.google.dev/gemini-api/docs).
2828

29-
API Management configures an OpenAI-compatible chat completions endpoint.
29+
For an OpenAI-compatible LLM, API Management configures a chat completions endpoint.
3030

3131
* **Passthrough** - Other language model endpoints that aren't compatible with OpenAI's API. Examples include models deployed in [Amazon Bedrock](https://docs.aws.amazon.com/bedrock/latest/userguide/what-is-bedrock.html) or other providers.
3232

@@ -93,7 +93,7 @@ To ensure that your LLM API is working as expected, test it in the API Managemen
9393

9494
## Example: Google Gemini
9595

96-
You can import OpenAI-compatible models from Google Gemini such as `gemini-2.0-flash`. Azure API Management can manage an OpenAI-compatible chat completion endpoint for these models.
96+
You can import an OpenAI-compatible Google Gemini API to access models such as `gemini-2.0-flash`. For these models, Azure API Management can manage an OpenAI-compatible chat completions endpoint.
9797

9898
To import an OpenAI-compatible Gemini model:
9999

@@ -109,7 +109,7 @@ To import an OpenAI-compatible Gemini model:
109109
1. Enter a **Display name** and optional **Description** for the API.
110110
1. In **URL**, enter the following base URL that you copied previously: `https://generativelanguage.googleapis.com/v1beta/openai`
111111

112-
1. In **Path**, append a path that your API Management instance uses to access the Gemini API endpoints.
112+
1. In **Path**, append a path that your API Management instance uses to route requests to the Gemini API endpoints.
113113
1. In **Type**, select **Create OpenAI API**.
114114
1. In **Access key**, enter the following:
115115
1. **Header name**: *Authorization*.
@@ -119,12 +119,12 @@ To import an OpenAI-compatible Gemini model:
119119

120120
### Test Gemini model
121121

122-
After importing the API, you can test it using the test console in the Azure portal. Choose an OpenAI-compatible model and endpoint for the test.
122+
After importing the API, you can test the chat completions endpoint for the API.
123123

124124
1. Select the API you created in the previous step.
125125
1. Select the **Test** tab.
126126
1. Select the `POST Creates a model response for the given chat conversation` operation, which is a `POST` request to the `/chat/completions` endpoint.
127-
1. In the **Request body** section, enter the following JSON to specify the model and an example prompt. In this example, the OpenAI-compatible `gemini-2.0-flash` model is used.
127+
1. In the **Request body** section, enter the following JSON to specify the model and an example prompt. In this example, the `gemini-2.0-flash` model is used.
128128

129129
```json
130130
{

0 commit comments

Comments
 (0)