You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this article, you learn how to use Azure AI Studio to deploy the Mistral family of models as serverless APIs with pay-as-you-go token-based billing.
21
21
Mistral AI offers two categories of models in the [Azure AI Studio](https://ai.azure.com). These models are available in the [model catalog](model-catalog-overview.md):
22
22
23
-
*__Premium models__: Mistral Large, Mistral Large 2407, and Mistral Small.
23
+
*__Premium models__: Mistral Large (2402), Mistral Large (2407), and Mistral Small.
24
24
*__Open models__: Mistral Nemo, Mixtral-8x7B-Instruct-v01, Mixtral-8x7B-v01, Mistral-7B-Instruct-v01, and Mistral-7B-v01.
25
25
26
26
All the premium models and Mistral Nemo (an open model) can be deployed as serverless APIs with pay-as-you-go token-based billing. The other open models can be deployed to managed computes in your own Azure subscription.
@@ -33,17 +33,17 @@ You can browse the Mistral family of models in the model catalog by filtering on
33
33
34
34
Mistral Large is Mistral AI's most advanced Large Language Model (LLM). It can be used on any language-based task, thanks to its state-of-the-art reasoning and knowledge capabilities. There are two variants available for the Mistral Large model version:
35
35
36
-
- Mistral Large
37
-
- Mistral Large 2407
36
+
- Mistral Large (2402)
37
+
- Mistral Large (2407)
38
38
39
-
Additionally, some attributes of _Mistral Large_ include:
39
+
Additionally, some attributes of _Mistral Large (2402)_ include:
40
40
41
41
*__Specialized in RAG.__ Crucial information isn't lost in the middle of long context windows (up to 32-K tokens).
42
42
*__Strong in coding.__ Code generation, review, and comments. Supports all mainstream coding languages.
43
43
*__Multi-lingual by design.__ Best-in-class performance in French, German, Spanish, Italian, and English. Dozens of other languages are supported.
44
44
*__Responsible AI compliant.__ Efficient guardrails baked in the model and extra safety layer with the `safe_mode` option.
45
45
46
-
And attributes of _Mistral Large 2407_ include:
46
+
And attributes of _Mistral Large (2407)_ include:
47
47
48
48
-**Multi-lingual by design.** Supports dozens of languages, including English, French, German, Spanish, and Italian.
49
49
-**Proficient in coding.** Trained on more than 80 coding languages, including Python, Java, C, C++, JavaScript, and Bash. Also trained on more specific languages such as Swift and Fortran.
@@ -83,7 +83,7 @@ Additionally, Mistral Nemo is:
83
83
84
84
Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
85
85
86
-
**Mistral Large**, **Mistral Large 2407**, **Mistral Small**, and **Mistral Nemo** can be deployed as a serverless API with pay-as-you-go billing and are offered by Mistral AI through the Microsoft Azure Marketplace. Mistral AI can change or update the terms of use and pricing of these models.
86
+
**Mistral Large (2402)**, **Mistral Large (2407)**, **Mistral Small**, and **Mistral Nemo** can be deployed as a serverless API with pay-as-you-go billing and are offered by Mistral AI through the Microsoft Azure Marketplace. Mistral AI can change or update the terms of use and pricing of these models.
87
87
88
88
### Prerequisites
89
89
@@ -105,13 +105,13 @@ Certain models in the model catalog can be deployed as a serverless API with pay
105
105
106
106
### Create a new deployment
107
107
108
-
The following steps demonstrate the deployment of Mistral Large, but you can use the same steps to deploy Mistral Nemo or any of the premium Mistral models by replacing the model name.
108
+
The following steps demonstrate the deployment of Mistral Large (2402), but you can use the same steps to deploy Mistral Nemo or any of the premium Mistral models by replacing the model name.
109
109
110
110
To create a deployment:
111
111
112
112
1. Sign in to [Azure AI Studio](https://ai.azure.com).
113
113
1. Select **Model catalog** from the left sidebar.
114
-
1. Search for and select **Mistral-large** to open its Details page.
114
+
1. Search for and select the Mistral Large (2402) model to open its Details page.
115
115
116
116
:::image type="content" source="../media/deploy-monitor/mistral/mistral-large-deploy-directly-from-catalog.png" alt-text="A screenshot showing how to access the model details page by going through the model catalog." lightbox="../media/deploy-monitor/mistral/mistral-large-deploy-directly-from-catalog.png":::
117
117
@@ -120,7 +120,7 @@ To create a deployment:
120
120
121
121
1. From the left sidebar of your project, select **Components** > **Deployments**.
122
122
1. Select **+ Create deployment**.
123
-
1. Search for and select **Mistral-large**. to open the Model's Details page.
123
+
1. Search for and select the Mistral Large (2402) model to open the Model's Details page.
124
124
125
125
:::image type="content" source="../media/deploy-monitor/mistral/mistral-large-deploy-starting-from-project.png" alt-text="A screenshot showing how to access the model details page by going through the Deployments page in your project." lightbox="../media/deploy-monitor/mistral/mistral-large-deploy-starting-from-project.png":::
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-deploy-models-mistral.md
+10-10Lines changed: 10 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,7 +19,7 @@ In this article, you learn how to use Azure Machine Learning studio to deploy th
19
19
20
20
Mistral AI offers two categories of models in Azure Machine Learning studio. These models are available in the [model catalog](concept-model-catalog.md).
21
21
22
-
*__Premium models__: Mistral Large, Mistral Large 2407, and Mistral Small.
22
+
*__Premium models__: Mistral Large (2402), Mistral Large (2407), and Mistral Small.
23
23
*__Open models__: Mistral Nemo, Mixtral-8x7B-Instruct-v01, Mixtral-8x7B-v01, Mistral-7B-Instruct-v01, and Mistral-7B-v01.
24
24
25
25
All the premium models and Mistral Nemo (an open model) can be deployed as serverless APIs with pay-as-you-go token-based billing. The other open models can be deployed to managed computes in your own Azure subscription.
@@ -32,17 +32,17 @@ You can browse the Mistral family of models in the model catalog by filtering on
32
32
33
33
Mistral Large is Mistral AI's most advanced Large Language Model (LLM). It can be used on any language-based task, thanks to its state-of-the-art reasoning and knowledge capabilities. There are two variants available for the Mistral Large model version:
34
34
35
-
- Mistral Large
36
-
- Mistral Large 2407
35
+
- Mistral Large (2402)
36
+
- Mistral Large (2407)
37
37
38
-
Additionally, some attributes of _Mistral Large_ include:
38
+
Additionally, some attributes of _Mistral Large (2402)_ include:
39
39
40
40
-__Specialized in RAG.__ Crucial information isn't lost in the middle of long context windows (up to 32 K tokens).
41
41
-__Strong in coding.__ Code generation, review, and comments. Supports all mainstream coding languages.
42
42
-__Multi-lingual by design.__ Best-in-class performance in French, German, Spanish, and Italian - in addition to English. Dozens of other languages are supported.
43
43
-__Responsible AI compliant.__ Efficient guardrails baked in the model, and extra safety layer with the `safe_mode` option.
44
44
45
-
And attributes of _Mistral Large 2407_ include:
45
+
And attributes of _Mistral Large (2407)_ include:
46
46
47
47
-**Multi-lingual by design.** Supports dozens of languages, including English, French, German, Spanish, and Italian.
48
48
-**Proficient in coding.** Trained on more than 80 coding languages, including Python, Java, C, C++, JavaScript, and Bash. Also trained on more specific languages such as Swift and Fortran.
@@ -84,7 +84,7 @@ Additionally, Mistral Nemo is:
84
84
85
85
Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need. This deployment option doesn't require quota from your subscription.
86
86
87
-
**Mistral Large**, **Mistral Large 2407**, **Mistral Small**, and **Mistral Nemo** can be deployed as a serverless API with pay-as-you-go billing and are offered by Mistral AI through the Microsoft Azure Marketplace. Mistral AI can change or update the terms of use and pricing of these models.
87
+
**Mistral Large (2402)**, **Mistral Large (2407)**, **Mistral Small**, and **Mistral Nemo** can be deployed as a serverless API with pay-as-you-go billing and are offered by Mistral AI through the Microsoft Azure Marketplace. Mistral AI can change or update the terms of use and pricing of these models.
88
88
89
89
90
90
### Prerequisites
@@ -106,13 +106,13 @@ Certain models in the model catalog can be deployed as a serverless API with pay
106
106
107
107
### Create a new deployment
108
108
109
-
The following steps demonstrate the deployment of Mistral Large, but you can use the same steps to deploy Mistral Nemo or any of the premium Mistral models by replacing the model name.
109
+
The following steps demonstrate the deployment of Mistral Large (2402), but you can use the same steps to deploy Mistral Nemo or any of the premium Mistral models by replacing the model name.
110
110
111
111
To create a deployment:
112
112
113
113
1. Go to [Azure Machine Learning studio](https://ml.azure.com/home).
114
114
1. Select the workspace in which you want to deploy your model. To use the serverless API model deployment offering, your workspace must belong to one of the regions listed in the [prerequisites](#prerequisites).
115
-
1. Choose the model you want to deploy, for example Mistral-large, from the [model catalog](https://ml.azure.com/model/catalog).
115
+
1. Choose the model you want to deploy, for example the Mistral Large (2402) model, from the [model catalog](https://ml.azure.com/model/catalog).
116
116
117
117
Alternatively, you can initiate deployment by going to your workspace and selecting **Endpoints** > **Serverless endpoints** > **Create**.
118
118
@@ -123,7 +123,7 @@ To create a deployment:
123
123
124
124
1. In the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use.
125
125
1. You can also select the **Pricing and terms** tab to learn about pricing for the selected model.
126
-
1. If this is your first time deploying the model in the workspace, you have to subscribe your workspace for the particular offering (for example, Mistral-large). This step requires that your account has the **Azure AI Developer role** permissions on the Resource Group, as listed in the prerequisites. Each workspace has its own subscription to the particular Azure Marketplace offering, which allows you to control and monitor spending. Select **Subscribe and Deploy**. Currently you can have only one deployment for each model within a workspace.
126
+
1. If this is your first time deploying the model in the workspace, you have to subscribe your workspace for the particular offering (for example, Mistral Large (2402)). This step requires that your account has the **Azure AI Developer role** permissions on the Resource Group, as listed in the prerequisites. Each workspace has its own subscription to the particular Azure Marketplace offering, which allows you to control and monitor spending. Select **Subscribe and Deploy**. Currently you can have only one deployment for each model within a workspace.
127
127
128
128
1. Once you subscribe the workspace for the particular Azure Marketplace offering, subsequent deployments of the _same_ offering in the _same_ workspace don't require subscribing again. If this scenario applies to you, you'll see a **Continue to deploy** option to select.
129
129
@@ -142,7 +142,7 @@ To learn about billing for Mistral models deployed as a serverless API with pay-
142
142
143
143
### Consume the Mistral family of models as a service
144
144
145
-
You can consume Mistral Large by using the chat API.
145
+
You can consume Mistral Large models by using the chat API.
146
146
147
147
1. In the **workspace**, select **Endpoints** > **Serverless endpoints**.
0 commit comments