You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: sdk/ai/azure-ai-inference/README.md
+4-5Lines changed: 4 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,8 +11,8 @@ Use the Inference client library (in preview) to:
11
11
The Inference client library supports AI models deployed to the following services:
12
12
13
13
*[GitHub Models](https://github.com/marketplace/models) - Free-tier endpoint for AI models from different providers
14
-
* Serverless API endpoints and Managed Compute endpoints - AI models from different providers deployed from [Azure AI Studio](https://ai.azure.com). See [Overview: Deploy models, flows, and web apps with Azure AI Studio](https://learn.microsoft.com/azure/ai-studio/concepts/deployments-overview).
15
-
* Azure OpenAI Service - OpenAI models deployed from [Azure OpenAI Studio](https://oai.azure.com/). See [What is Azure OpenAI Service?](https://learn.microsoft.com/azure/ai-services/openai/overview). Although we recommend you use the official [OpenAI client library](https://pypi.org/project/openai/) in your production code for this service, you can use the Azure AI Inference client library to easily compare the performance of OpenAI models to other models, using the same client library and Python code.
14
+
* Serverless API endpoints and Managed Compute endpoints - AI models from different providers deployed from [Azure AI Foundry](https://ai.azure.com). See [Overview: Deploy models, flows, and web apps with Azure AI Foundry](https://learn.microsoft.com/azure/ai-studio/concepts/deployments-overview).
15
+
* Azure OpenAI Service - OpenAI models deployed from [Azure AI Foundry](https://oai.azure.com/). See [What is Azure OpenAI Service?](https://learn.microsoft.com/azure/ai-services/openai/overview). Although we recommend you use the official [OpenAI client library](https://pypi.org/project/openai/) in your production code for this service, you can use the Azure AI Inference client library to easily compare the performance of OpenAI models to other models, using the same client library and Python code.
16
16
17
17
The Inference client library makes services calls using REST API version `2024-05-01-preview`, as documented in [Azure AI Model Inference API](https://aka.ms/azureai/modelinference).
18
18
@@ -27,18 +27,17 @@ The Inference client library makes services calls using REST API version `2024-0
27
27
### Prerequisites
28
28
29
29
*[Python 3.8](https://www.python.org/) or later installed, including [pip](https://pip.pypa.io/en/stable/).
30
-
Studio.
31
30
* For GitHub models
32
31
* The AI model name, such as "gpt-4o" or "mistral-large"
33
32
* A GitHub personal access token. [Create one here](https://github.com/settings/tokens). You do not need to give any permissions to the token. The token is a string that starts with `github_pat_`.
34
33
* For Serverless API endpoints or Managed Compute endpoints
35
34
* An [Azure subscription](https://azure.microsoft.com/free).
36
-
* An [AI Model from the catalog](https://ai.azure.com/explore/models) deployed through Azure AI Studio.
35
+
* An [AI Model from the catalog](https://ai.azure.com/explore/models) deployed through Azure AI Foundry.
37
36
* The endpoint URL of your model, in of the form `https://<your-host-name>.<your-azure-region>.models.ai.azure.com`, where `your-host-name` is your unique model deployment host name and `your-azure-region` is the Azure region where the model is deployed (e.g. `eastus2`).
38
37
* Depending on your authentication preference, you either need an API key to authenticate against the service, or Entra ID credentials. The API key is a 32-character string.
39
38
* For Azure OpenAI (AOAI) service
40
39
* An [Azure subscription](https://azure.microsoft.com/free).
41
-
* An [OpenAI Model from the catalog](https://oai.azure.com/resource/models) deployed through Azure OpenAI Studio.
40
+
* An [OpenAI Model from the catalog](https://oai.azure.com/resource/models) deployed through Azure AI Foundry.
42
41
* The endpoint URL of your model, in the form `https://<your-resouce-name>.openai.azure.com/openai/deployments/<your-deployment-name>`, where `your-resource-name` is your globally unique AOAI resource name, and `your-deployment-name` is your AI Model deployment name.
43
42
* Depending on your authentication preference, you either need an API key to authenticate against the service, or Entra ID credentials. The API key is a 32-character string.
44
43
* An api-version. Latest preview or GA version listed in the `Data plane - inference` row in [the API Specs table](https://aka.ms/azsdk/azure-ai-inference/azure-openai-api-versions). At the time of writing, latest GA version was "2024-06-01".
Copy file name to clipboardExpand all lines: sdk/ai/azure-ai-inference/tests/README.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,13 +4,13 @@ The instructions below are for running tests locally, on a Windows machine, agai
4
4
5
5
## Prerequisites
6
6
7
-
The live tests were written against the AI models mentioned below. You will need to deploy these two in [Azure AI Studio](https://ai.azure.com/) and have the endpoint and key for each one of them.
7
+
The live tests were written against the AI models mentioned below. You will need to deploy these two in [Azure AI Foundry](https://ai.azure.com/) and have the endpoint and key for each one of them.
8
8
9
9
-`Mistral-Large` for chat completion tests, including tool tests
10
10
-`Cohere-embed-v3-english` for embedding tests
11
11
<!-- - `TBD` for image generation tests -->
12
12
13
-
In addition, you will need to deploy a gpt-4o model in the Azure OpenAI Studio, and have the endpoint and key for it:
13
+
In addition, you will need to deploy a gpt-4o model in the Azure AI Foundry, and have the endpoint and key for it:
14
14
15
15
-`gpt-4o` on Azure OpenAI (AOAI), for chat completions tests with image input
0 commit comments