You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/concepts/model-lifecycle-retirement.md
+3-2Lines changed: 3 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,9 +3,9 @@ title: Deprecation and retirement for models in Azure AI model catalog
3
3
titleSuffix: Azure AI Foundry
4
4
description: Learn about the lifecycle stages, deprecation, and retirement for models in the Azure AI model catalog.
5
5
manager: scottpolly
6
-
ms.service: azure-ai-studio
6
+
ms.service: azure-ai-foundry
7
7
ms.topic: concept-article
8
-
ms.date: 1/14/2025
8
+
ms.date: 02/03/2025
9
9
ms.author: mopeakande
10
10
author: msakande
11
11
ms.reviewer: kritifaujdar
@@ -67,6 +67,7 @@ The following table lists the timelines for models that are on track for retirem
67
67
68
68
| Model provider | Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
69
69
| ---- | ---- | ---- | --- | ---- | --- |
70
+
| AI21 Labs | Jamba Instruct | February 1, 2025 | February 1, 2025 | March 1, 2025 |[AI21-Jamba-1.5-Large](https://ai.azure.com/explore/models/AI21-Jamba-1.5-Large/version/1/registry/azureml-ai21) or [AI21-Jamba-1.5-Mini](https://ai.azure.com/explore/models/AI21-Jamba-1.5-Mini/version/1/registry/azureml-staging)|
70
71
| Mistral AI |[Mistral-large-2407](https://aka.ms/azureai/landing/Mistral-Large-2407)| January 13, 2025 | February 13, 2025 | May 13, 2025 |[Mistral-large-2411](https://aka.ms/aistudio/landing/Mistral-Large-2411)|
71
72
| Mistral AI |[Mistral-large](https://aka.ms/azureai/landing/Mistral-Large)| December 15, 2024 | January 15, 2025 | April 15, 2025 |[Mistral-large-2407](https://aka.ms/azureai/landing/Mistral-Large-2407)|
The [AI21-Jamba 1.5 Large model](https://aka.ms/aistudio/landing/ai21-labs-jamba-1.5-large) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5-large). AI21 can change or update the terms of use and pricing of this model.
34
+
The [AI21-Jamba 1.5 Large model](https://ai.azure.com/explore/models/AI21-Jamba-1.5-Large/version/1/registry/azureml-ai21) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5-large). AI21 can change or update the terms of use and pricing of this model.
35
35
36
-
To get started with Jamba 1.5 large deployed as a serverless API, explore our integrations with [LangChain](https://aka.ms/ai21-jamba-1.5-large-langchain-sample), [LiteLLM](https://aka.ms/ai21-jamba-1.5-large-litellm-sample), [OpenAI](https://aka.ms/ai21-jamba-1.5-large-openai-sample) and the [Azure API](https://aka.ms/ai21-jamba-1.5-large-azure-api-sample).
36
+
To get started with Jamba 1.5 large deployed as a serverless API, explore our integrations with [LangChain](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/langchain.ipynb), [LiteLLM](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/litellm.ipynb), [OpenAI](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/openaisdk.ipynb) and the [Azure API](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/webrequests.ipynb).
37
37
38
38
39
39
# [AI21 Jamba 1.5 Mini](#tab/ai21-jamba-1-5)
40
40
41
-
The [AI21 Jamba 1.5 Mini model](https://aka.ms/aistudio/landing/ai21-labs-jamba-1.5-mini) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5-mini). AI21 can change or update the terms of use and pricing of this model.
41
+
The [AI21 Jamba 1.5 Mini model](https://ai.azure.com/explore/models/AI21-Jamba-1.5-Mini/version/1/registry/azureml-staging) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5-mini). AI21 can change or update the terms of use and pricing of this model.
42
42
43
-
To get started with Jamba 1.5 mini deployed as a serverless API, explore our integrations with [LangChain](https://aka.ms/ai21-jamba-1.5-mini-langchain-sample), [LiteLLM](https://aka.ms/ai21-jamba-1.5-mini-litellm-sample), [OpenAI](https://aka.ms/ai21-jamba-1.5-mini-openai-sample) and the [Azure API](https://aka.ms/ai21-jamba-1.5-mini-azure-api-sample).
43
+
To get started with Jamba 1.5 mini deployed as a serverless API, explore our integrations with [LangChain](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/langchain.ipynb), [LiteLLM](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/litellm.ipynb), [OpenAI](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/openaisdk.ipynb) and the [Azure API](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/ai21-labs/jamba-1-5/webrequests.ipynb).
44
44
45
45
---
46
46
47
47
### Prerequisites
48
48
49
49
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
50
-
- An [Azure AI Foundry project](../how-to/create-projects.md). The serverless API model deployment offering for Jamba family models is only available with projects created in these regions:
51
-
52
-
* East US
53
-
* East US 2
54
-
* North Central US
55
-
* South Central US
56
-
* West US
57
-
* West US 3
58
-
* Sweden Central
50
+
- An [Azure AI Foundry project](../how-to/create-projects.md). The serverless API model deployment offering for Jamba family models is only available with projects created in specific regions. For a list of these regions, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md#ai21-models).
59
51
60
-
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
61
-
62
52
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Foundry portal. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
63
53
64
54
- On the Azure subscription—to subscribe the Azure AI Foundry project to the Azure Marketplace offering, once for each project, per offering:
@@ -85,15 +75,15 @@ These steps demonstrate the deployment of `AI21 Jamba 1.5 Large` or `AI21 Jamba
4. Search for and select an AI21 model like `AI21 Jamba 1.5 Large` or `AI21 Jamba 1.5 Mini`or `AI21 Jamba Instruct`to open its Details page.
78
+
4. Search for and select an AI21 model like `AI21 Jamba 1.5 Large` or `AI21 Jamba 1.5 Mini` to open its Details page.
89
79
90
80
1. Select **Deploy** to open a serverless API deployment window for the model.
91
81
92
82
1. Alternatively, you can initiate a deployment by starting from the **Models + endpoints** page in Azure AI Foundry portal.
93
83
94
84
1. From the left navigation pane of your project, select **My assets** > **Models + endpoints**.
95
85
1. Select **+ Deploy model** > **Deploy base model**.
96
-
1. Search for and select an AI21 model like `AI21 Jamba 1.5 Large` or `AI21 Jamba 1.5 Mini`or `AI21 Jamba Instruct`to open the Model's Details page.
86
+
1. Search for and select an AI21 model like `AI21 Jamba 1.5 Large` or `AI21 Jamba 1.5 Mini` to open the Model's Details page.
97
87
1. Select **Confirm** to open a serverless API deployment window for the model.
98
88
99
89
1. Your current project is specified for the deployment. To successfully deploy the AI21-Jamba family models, your project must be in one of the regions listed in the [Prerequisites](#prerequisites) section.
@@ -114,7 +104,7 @@ These steps demonstrate the deployment of `AI21 Jamba 1.5 Large` or `AI21 Jamba
114
104
115
105
1.[!INCLUDE [Find your deployment details](../includes/find-deployments.md)]
116
106
117
-
To learn about billing for the AI21-Jamba family models deployed as a serverless API with pay-as-you-go token-based billing, see [Cost and quota considerations for Jamba Instruct deployed as a serverless API](#cost-and-quota-considerations-for-jamba-family-models-deployed-as-a-serverless-api).
107
+
To learn about billing for the AI21-Jamba family models deployed as a serverless API with pay-as-you-go token-based billing, see [Cost and quota considerations for Jamba models deployed as a serverless API](#cost-and-quota-considerations-for-jamba-family-models-deployed-as-a-serverless-api).
118
108
119
109
120
110
### Consume Jamba family models as a serverless API
@@ -185,7 +175,7 @@ Payload is a JSON formatted string containing the following parameters:
|`model`|`string`| Y | Must be `jamba-1.5-large` or `jamba-1.5-mini` or `jamba-instruct`|
178
+
|`model`|`string`| Y | Must be `jamba-1.5-large` or `jamba-1.5-mini`|
189
179
|`messages`|`list[object]`| Y | A list of objects, one per message, from oldest to newest. The oldest message can be role `system`. All later messages must alternate between user and assistant roles. See the message object definition below.|
190
180
|`max_tokens`|`integer`| N <br>`4096`| 0 – 4096 | The maximum number of tokens to allow for each generated response message. Typically the best way to limit output length is by providing a length limit in the system prompt (for example, "limit your answers to three sentences")|
191
181
|`temperature`|`float`| N <br>`1`| 0.0 – 2.0 | How much variation to provide in each answer. Setting this value to 0 guarantees the same response to the same question every time. Setting a higher value encourages more variation. Modifies the distribution from which tokens are sampled. We recommend altering this or `top_p`, but not both. |
@@ -275,7 +265,7 @@ __Chat example (fourth request containing third user response)__
275
265
276
266
```JSON
277
267
{
278
-
"model": "jamba-instruct",
268
+
"model": "jamba-1.5-large",
279
269
"messages": [
280
270
{"role": "system",
281
271
"content": "You are a helpful genie just released from a bottle. You start the conversation with 'Thank you for freeing me! I grant you one wish.'"},
| Model |Offer Availability Region | Hub/Project Region for Deployment | Hub/Project Region for Fine tuning |
99
99
|---------|---------|---------|---------|
100
-
AI21-Jamba-1.5-Mini | [Microsoft Managed countries/regions](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) | East US 2 <br> South Central US <br> East US <br> West US 3 <br> West US <br> North Central US | Not available |
101
-
AI21-Jamba-1.5-Large | [Microsoft Managed countries/regions](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) | East US 2 <br> South Central US <br> East US <br> West US 3 <br> West US <br> North Central US | Not available |
100
+
AI21-Jamba-1.5-Mini | [Microsoft Managed countries/regions](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) | East US 2 <br> South Central US <br> East US <br> West US 3 <br> West US <br> North Central US <br> Sweden Central | Not available |
101
+
AI21-Jamba-1.5-Large | [Microsoft Managed countries/regions](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) | East US 2 <br> South Central US <br> East US <br> West US 3 <br> West US <br> North Central US <br> Sweden Central | Not available |
0 commit comments