Skip to content

Commit 9bfab65

Browse files
authored
Merge pull request #280028 from msakande/update-mistral-docs
Update region availability and more in Mistral docs
2 parents a1e017c + 4b36b41 commit 9bfab65

File tree

6 files changed

+64
-46
lines changed

6 files changed

+64
-46
lines changed

articles/ai-studio/how-to/deploy-models-mistral.md

Lines changed: 29 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -18,12 +18,12 @@ ms.custom: references_regions, build-2024
1818
[!INCLUDE [Feature preview](~/reusable-content/ce-skilling/azure/includes/ai-studio/includes/feature-preview.md)]
1919

2020
In this article, you learn how to use Azure AI Studio to deploy the Mistral family of models as serverless APIs with pay-as-you-go token-based billing.
21-
Mistral AI offers two categories of models in [Azure AI Studio](https://ai.azure.com):
21+
Mistral AI offers two categories of models in the [Azure AI Studio](https://ai.azure.com). These models are available in the [model catalog](model-catalog-overview.md):
2222

23-
* __Premium models__: Mistral Large and Mistral Small. These models are available as serverless APIs with pay-as-you-go token-based billing in the AI Studio model catalog.
24-
* __Open models__: Mixtral-8x7B-Instruct-v01, Mixtral-8x7B-v01, Mistral-7B-Instruct-v01, and Mistral-7B-v01. These models are also available in the AI Studio model catalog and can be deployed to managed compute in your own Azure subscription.
23+
* __Premium models__: Mistral Large and Mistral Small. These models can be deployed as serverless APIs with pay-as-you-go token-based billing.
24+
* __Open models__: Mixtral-8x7B-Instruct-v01, Mixtral-8x7B-v01, Mistral-7B-Instruct-v01, and Mistral-7B-v01. These models can be deployed to managed computes in your own Azure subscription.
2525

26-
You can browse the Mistral family of models in the [Model Catalog](model-catalog-overview.md) by filtering on the Mistral collection.
26+
You can browse the Mistral family of models in the model catalog by filtering on the Mistral collection.
2727

2828
## Mistral family of models
2929

@@ -61,10 +61,18 @@ Certain models in the model catalog can be deployed as a serverless API with pay
6161
### Prerequisites
6262

6363
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
64-
- An [Azure AI Studio hub](../how-to/create-azure-ai-resource.md).
64+
- An [AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for eligible models in the Mistral family is only available with hubs created in these regions:
65+
66+
- East US
67+
- East US 2
68+
- North Central US
69+
- South Central US
70+
- West US
71+
- West US 3
72+
- Sweden Central
73+
74+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
6575

66-
> [!IMPORTANT]
67-
> The serverless API model deployment offering for eligible models in the Mistral family is only available in hubs created in the **East US 2** and **Sweden Central** regions.
6876
- An [Azure AI Studio project](../how-to/create-projects.md).
6977
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
7078

@@ -93,7 +101,7 @@ To create a deployment:
93101

94102
:::image type="content" source="../media/deploy-monitor/mistral/mistral-large-deploy-pay-as-you-go.png" alt-text="A screenshot showing how to deploy a model as a serverless API." lightbox="../media/deploy-monitor/mistral/mistral-large-deploy-pay-as-you-go.png":::
95103

96-
1. Select the project in which you want to deploy your model. To deploy the Mistral model, your project must be in the *EastUS2* or *Sweden Central* region.
104+
1. Select the project in which you want to deploy your model. To use the serverless API model deployment offering, your project must belong to one of the regions listed in the [prerequisites](#prerequisites).
97105
1. In the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use.
98106
1. Select the **Pricing and terms** tab to learn about pricing for the selected model.
99107
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering. This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
@@ -125,15 +133,15 @@ You can consume Mistral family models by using the chat API.
125133

126134
For more information on using the APIs, see the [reference](#reference-for-mistral-family-of-models-deployed-as-a-service) section.
127135

128-
### Reference for Mistral family of models deployed as a service
136+
## Reference for Mistral family of models deployed as a service
129137

130138
Mistral models accept both the [Azure AI Model Inference API](../reference/reference-model-inference-api.md) on the route `/chat/completions` and the native [Mistral Chat API](#mistral-chat-api) on `/v1/chat/completions`.
131139

132140
### Azure AI Model Inference API
133141

134142
The [Azure AI Model Inference API](../reference/reference-model-inference-api.md) schema can be found in the [reference for Chat Completions](../reference/reference-model-inference-chat-completions.md) article and an [OpenAPI specification can be obtained from the endpoint itself](../reference/reference-model-inference-api.md?tabs=rest#getting-started).
135143

136-
#### Mistral Chat API
144+
### Mistral Chat API
137145

138146
Use the method `POST` to send the request to the `/v1/chat/completions` route:
139147

@@ -168,7 +176,7 @@ The `messages` object has the following fields:
168176
| `role` | `string` | The role of the message's author. One of `system`, `user`, or `assistant`. |
169177

170178

171-
#### Example
179+
#### Request example
172180

173181
__Body__
174182

@@ -234,7 +242,7 @@ The `logprobs` object is a dictionary with the following fields:
234242
| `tokens` | `array` of `string` | Selected tokens. |
235243
| `top_logprobs` | `array` of `dictionary` | Array of dictionary. In each dictionary, the key is the token and the value is the probability. |
236244

237-
#### Example
245+
#### Response example
238246

239247
The following JSON is an example response:
240248

@@ -261,15 +269,16 @@ The following JSON is an example response:
261269
}
262270
}
263271
```
272+
264273
#### More inference examples
265274

266-
| **Sample Type** | **Sample Notebook** |
267-
|----------------|----------------------------------------|
268-
| CLI using CURL and Python web requests | [webrequests.ipynb](https://aka.ms/mistral-large/webrequests-sample)|
269-
| OpenAI SDK (experimental) | [openaisdk.ipynb](https://aka.ms/mistral-large/openaisdk) |
270-
| LangChain | [langchain.ipynb](https://aka.ms/mistral-large/langchain-sample) |
271-
| Mistral AI | [mistralai.ipynb](https://aka.ms/mistral-large/mistralai-sample) |
272-
| LiteLLM | [litellm.ipynb](https://aka.ms/mistral-large/litellm-sample)
275+
| **Sample Type** | **Sample Notebook** |
276+
|----------------------------------------|----------------------------------------------------------------------|
277+
| CLI using CURL and Python web requests | [webrequests.ipynb](https://aka.ms/mistral-large/webrequests-sample) |
278+
| OpenAI SDK (experimental) | [openaisdk.ipynb](https://aka.ms/mistral-large/openaisdk) |
279+
| LangChain | [langchain.ipynb](https://aka.ms/mistral-large/langchain-sample) |
280+
| Mistral AI | [mistralai.ipynb](https://aka.ms/mistral-large/mistralai-sample) |
281+
| LiteLLM | [litellm.ipynb](https://aka.ms/mistral-large/litellm-sample) |
273282

274283
## Cost and quotas
275284

@@ -291,3 +300,4 @@ Models deployed as a serverless API with pay-as-you-go billing are protected by
291300

292301
- [What is Azure AI Studio?](../what-is-ai-studio.md)
293302
- [Azure AI FAQ article](../faq.yml)
303+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

0 commit comments

Comments
 (0)