Skip to content

Commit cbe66ca

Browse files
committed
more Mistral updates
1 parent 9b9b78f commit cbe66ca

File tree

3 files changed

+22
-22
lines changed

3 files changed

+22
-22
lines changed

articles/ai-studio/how-to/deploy-models-mistral.md

Lines changed: 19 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: How to use Mistral premium chat models with Azure AI Studio
2+
title: How to use Mistral premium chat models with Azure AI Foundry
33
titleSuffix: Azure AI Foundry
4-
description: Learn how to use Mistral premium chat models with Azure AI Studio.
4+
description: Learn how to use Mistral premium chat models with Azure AI Foundry.
55
ms.service: azure-ai-studio
66
manager: scottpolly
77
ms.topic: how-to
@@ -33,7 +33,7 @@ The Mistral premium chat models include the following models:
3333

3434
Mistral Large models are Mistral AI's most advanced Large Language Models (LLM). They can be used on any language-based task, thanks to their state-of-the-art reasoning, knowledge, and coding capabilities. Several Mistral Large model variants are available, and their attributes are as follows.
3535

36-
Attributes of **Mistral Large** include:
36+
Attributes of **Mistral Large (2402)**, also abbreviated as Mistral Large, include:
3737

3838
* **Specialized in RAG**. Crucial information isn't lost in the middle of long context windows (up to 32-K tokens).
3939
* **Strong in coding**. Code generation, review, and comments. Supports all mainstream coding languages.
@@ -99,15 +99,15 @@ The following models are available:
9999
100100
## Prerequisites
101101

102-
To use Mistral premium chat models with Azure AI Studio, you need the following prerequisites:
102+
To use Mistral premium chat models with Azure AI Foundry, you need the following prerequisites:
103103

104104
### A model deployment
105105

106106
**Deployment to serverless APIs**
107107

108108
Mistral premium chat models can be deployed to serverless API endpoints with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need.
109109

110-
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Studio, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
110+
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Foundry portal, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
111111

112112
> [!div class="nextstepaction"]
113113
> [Deploy the model to serverless API endpoints](deploy-models-serverless.md)
@@ -133,7 +133,7 @@ Read more about the [Azure AI inference package and reference](https://aka.ms/az
133133
In this section, you use the [Azure AI model inference API](https://aka.ms/azureai/modelinference) with a chat completions model for chat.
134134

135135
> [!TIP]
136-
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Studio with the same code and structure, including Mistral premium chat models.
136+
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Foundry with the same code and structure, including Mistral premium chat models.
137137
138138
### Create a client to consume the model
139139

@@ -523,7 +523,7 @@ The Mistral premium chat models include the following models:
523523

524524
Mistral Large models are Mistral AI's most advanced Large Language Models (LLM). They can be used on any language-based task, thanks to their state-of-the-art reasoning, knowledge, and coding capabilities. Several Mistral Large model variants are available, and their attributes are as follows.
525525

526-
Attributes of **Mistral Large** include:
526+
Attributes of **Mistral Large (2402)**, also abbreviated as Mistral Large, include:
527527

528528
* **Specialized in RAG**. Crucial information isn't lost in the middle of long context windows (up to 32-K tokens).
529529
* **Strong in coding**. Code generation, review, and comments. Supports all mainstream coding languages.
@@ -589,15 +589,15 @@ The following models are available:
589589
590590
## Prerequisites
591591

592-
To use Mistral premium chat models with Azure AI Studio, you need the following prerequisites:
592+
To use Mistral premium chat models with Azure AI Foundry, you need the following prerequisites:
593593

594594
### A model deployment
595595

596596
**Deployment to serverless APIs**
597597

598598
Mistral premium chat models can be deployed to serverless API endpoints with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need.
599599

600-
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Studio, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
600+
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Foundry portal, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
601601

602602
> [!div class="nextstepaction"]
603603
> [Deploy the model to serverless API endpoints](deploy-models-serverless.md)
@@ -621,7 +621,7 @@ npm install @azure-rest/ai-inference
621621
In this section, you use the [Azure AI model inference API](https://aka.ms/azureai/modelinference) with a chat completions model for chat.
622622

623623
> [!TIP]
624-
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Studio with the same code and structure, including Mistral premium chat models.
624+
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Foundry with the same code and structure, including Mistral premium chat models.
625625
626626
### Create a client to consume the model
627627

@@ -1032,7 +1032,7 @@ The Mistral premium chat models include the following models:
10321032
10331033
Mistral Large models are Mistral AI's most advanced Large Language Models (LLM). They can be used on any language-based task, thanks to their state-of-the-art reasoning, knowledge, and coding capabilities. Several Mistral Large model variants are available, and their attributes are as follows.
10341034
1035-
Attributes of **Mistral Large** include:
1035+
Attributes of **Mistral Large (2402)**, also abbreviated as Mistral Large, include:
10361036
10371037
* **Specialized in RAG**. Crucial information isn't lost in the middle of long context windows (up to 32-K tokens).
10381038
* **Strong in coding**. Code generation, review, and comments. Supports all mainstream coding languages.
@@ -1098,15 +1098,15 @@ The following models are available:
10981098
10991099
## Prerequisites
11001100
1101-
To use Mistral premium chat models with Azure AI Studio, you need the following prerequisites:
1101+
To use Mistral premium chat models with Azure AI Foundry, you need the following prerequisites:
11021102
11031103
### A model deployment
11041104
11051105
**Deployment to serverless APIs**
11061106
11071107
Mistral premium chat models can be deployed to serverless API endpoints with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need.
11081108
1109-
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Studio, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
1109+
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Foundry portal, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
11101110
11111111
> [!div class="nextstepaction"]
11121112
> [Deploy the model to serverless API endpoints](deploy-models-serverless.md)
@@ -1153,7 +1153,7 @@ using System.Reflection;
11531153
In this section, you use the [Azure AI model inference API](https://aka.ms/azureai/modelinference) with a chat completions model for chat.
11541154

11551155
> [!TIP]
1156-
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Studio with the same code and structure, including Mistral premium chat models.
1156+
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Foundry with the same code and structure, including Mistral premium chat models.
11571157

11581158
### Create a client to consume the model
11591159

@@ -1563,7 +1563,7 @@ The Mistral premium chat models include the following models:
15631563
15641564
Mistral Large models are Mistral AI's most advanced Large Language Models (LLM). They can be used on any language-based task, thanks to their state-of-the-art reasoning, knowledge, and coding capabilities. Several Mistral Large model variants are available, and their attributes are as follows.
15651565
1566-
Attributes of **Mistral Large** include:
1566+
Attributes of **Mistral Large (2402)**, also abbreviated as Mistral Large, include:
15671567
15681568
* **Specialized in RAG**. Crucial information isn't lost in the middle of long context windows (up to 32-K tokens).
15691569
* **Strong in coding**. Code generation, review, and comments. Supports all mainstream coding languages.
@@ -1629,15 +1629,15 @@ The following models are available:
16291629
16301630
## Prerequisites
16311631
1632-
To use Mistral premium chat models with Azure AI Studio, you need the following prerequisites:
1632+
To use Mistral premium chat models with Azure AI Foundry, you need the following prerequisites:
16331633
16341634
### A model deployment
16351635
16361636
**Deployment to serverless APIs**
16371637
16381638
Mistral premium chat models can be deployed to serverless API endpoints with pay-as-you-go billing. This kind of deployment provides a way to consume models as an API without hosting them on your subscription, while keeping the enterprise security and compliance that organizations need.
16391639
1640-
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Studio, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
1640+
Deployment to a serverless API endpoint doesn't require quota from your subscription. If your model isn't deployed already, use the Azure AI Foundry portal, Azure Machine Learning SDK for Python, the Azure CLI, or ARM templates to [deploy the model as a serverless API](deploy-models-serverless.md).
16411641
16421642
> [!div class="nextstepaction"]
16431643
> [Deploy the model to serverless API endpoints](deploy-models-serverless.md)
@@ -1654,7 +1654,7 @@ Models deployed with the [Azure AI model inference API](https://aka.ms/azureai/m
16541654
In this section, you use the [Azure AI model inference API](https://aka.ms/azureai/modelinference) with a chat completions model for chat.
16551655
16561656
> [!TIP]
1657-
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Studio with the same code and structure, including Mistral premium chat models.
1657+
> The [Azure AI model inference API](https://aka.ms/azureai/modelinference) allows you to talk with most models deployed in Azure AI Foundry with the same code and structure, including Mistral premium chat models.
16581658
16591659
### Create a client to consume the model
16601660
@@ -2229,7 +2229,7 @@ For more examples of how to use Mistral models, see the following examples and t
22292229

22302230
Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits aren't sufficient for your scenarios.
22312231
2232-
Mistral models deployed as a serverless API are offered by MistralAI through the Azure Marketplace and integrated with Azure AI Studio for use. You can find the Azure Marketplace pricing when deploying the model.
2232+
Mistral models deployed as a serverless API are offered by MistralAI through the Azure Marketplace and integrated with Azure AI Foundry for use. You can find the Azure Marketplace pricing when deploying the model.
22332233
22342234
Each time a project subscribes to a given offer from the Azure Marketplace, a new resource is created to track the costs associated with its consumption. The same resource is used to track costs associated with inference; however, multiple meters are available to track each scenario independently.
22352235

articles/ai-studio/how-to/model-catalog-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,7 @@ The following list contains Serverless API models. For Azure OpenAI models, see
7777
Model | Managed compute | Serverless API (pay-per-token)
7878
--|--|--
7979
Llama family models | Llama-3.2-3B-Instruct<BR> Llama-3.2-1B-Instruct<BR> Llama-3.2-1B<BR> Llama-3.2-90B-Vision-Instruct<BR> Llama-3.2-11B-Vision-Instruct<BR> Llama-3.1-8B-Instruct<BR> Llama-3.1-8B<BR> Llama-3.1-70B-Instruct<BR> Llama-3.1-70B<BR> Llama-3-8B-Instruct<BR> Llama-3-70B<BR> Llama-3-8B<BR> Llama-Guard-3-1B<BR> Llama-Guard-3-8B<BR> Llama-Guard-3-11B-Vision<BR> Llama-2-7b<BR> Llama-2-70b<BR> Llama-2-7b-chat<BR> Llama-2-13b-chat<BR> CodeLlama-7b-hf<BR> CodeLlama-7b-Instruct-hf<BR> CodeLlama-34b-hf<BR> CodeLlama-34b-Python-hf<BR> CodeLlama-34b-Instruct-hf<BR> CodeLlama-13b-Instruct-hf<BR> CodeLlama-13b-Python-hf<BR> Prompt-Guard-86M<BR> CodeLlama-70b-hf<BR> | Llama-3.2-90B-Vision-Instruct<br> Llama-3.2-11B-Vision-Instruct<br> Llama-3.1-8B-Instruct<br> Llama-3.1-70B-Instruct<br> Llama-3.1-405B-Instruct<br> Llama-3-8B-Instruct<br> Llama-3-70B-Instruct<br> Llama-2-7b<br> Llama-2-7b-chat<br> Llama-2-70b<br> Llama-2-70b-chat<br> Llama-2-13b<br> Llama-2-13b-chat<br>
80-
Mistral family models | mistralai-Mixtral-8x22B-v0-1 <br> mistralai-Mixtral-8x22B-Instruct-v0-1 <br> mistral-community-Mixtral-8x22B-v0-1 <br> mistralai-Mixtral-8x7B-v01 <br> mistralai-Mistral-7B-Instruct-v0-2 <br> mistralai-Mistral-7B-v01 <br> mistralai-Mixtral-8x7B-Instruct-v01 <br> mistralai-Mistral-7B-Instruct-v01 | Mistral-large (2402) <br> Mistral-large (2407) <br> Mistral-small <br> Ministral-3B <br> Mistral-NeMo
80+
Mistral family models | mistralai-Mixtral-8x22B-v0-1 <br> mistralai-Mixtral-8x22B-Instruct-v0-1 <br> mistral-community-Mixtral-8x22B-v0-1 <br> mistralai-Mixtral-8x7B-v01 <br> mistralai-Mistral-7B-Instruct-v0-2 <br> mistralai-Mistral-7B-v01 <br> mistralai-Mixtral-8x7B-Instruct-v01 <br> mistralai-Mistral-7B-Instruct-v01 | Mistral-large (2402) <br> Mistral-large (2407) <br> Mistral-large (2411) <br> Mistral-small <br> Ministral-3B <br> Mistral-NeMo
8181
Cohere family models | Not available | Cohere-command-r-plus-08-2024 <br> Cohere-command-r-08-2024 <br> Cohere-command-r-plus <br> Cohere-command-r <br> Cohere-embed-v3-english <br> Cohere-embed-v3-multilingual <br> Cohere-rerank-v3-english <br> Cohere-rerank-v3-multilingual
8282
JAIS | Not available | jais-30b-chat
8383
AI21 family models | Not available | Jamba-1.5-Mini <br> Jamba-1.5-Large

articles/ai-studio/includes/region-availability-maas.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -60,8 +60,8 @@ Phi-3-Medium-4K-Instruct <br> Phi-3-Medium-128K-Instruct | Not applicable | E
6060
Mistral Nemo | [Microsoft Managed Countries](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) <br> Brazil <br> Hong Kong <br> Israel | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
6161
Ministral-3B | [Microsoft Managed Countries](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) <br> Brazil <br> Hong Kong <br> Israel | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
6262
Mistral Small | [Microsoft Managed Countries](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) <br> Brazil <br> Hong Kong <br> Israel | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
63-
Mistral Large (2402) | [Microsoft Managed Countries](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) <br> Brazil <br> Hong Kong <br> Israel | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
64-
Mistral-Large (2407) | [Microsoft Managed Countries](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) <br> Brazil <br> Hong Kong <br> Israel | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
63+
Mistral Large (2402) <br> Mistral-Large (2407) <br> Mistral-Large (2411) | [Microsoft Managed Countries](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) <br> Brazil <br> Hong Kong <br> Israel | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
64+
6565

6666

6767
### Nixtla models

0 commit comments

Comments
 (0)