You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/how-to/deploy-models-jamba.md
+40-29Lines changed: 40 additions & 29 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,30 +21,24 @@ In this article, you learn how to use Azure AI Studio to deploy AI21's Jamba fam
21
21
The Jamba family models are AI21's production-grade Mamba-based large language model (LLM) which leverages AI21's hybrid Mamba-Transformer architecture. It's an instruction-tuned version of AI21's hybrid structured state space model (SSM) transformer Jamba model. The Jamba family models are built for reliable commercial use with respect to quality and performance.
22
22
23
23
> [!TIP]
24
-
> See our announcements of AI21's Jamba family models available now on Azure AI Model Catalog through [AI21's blog](https://aka.ms/ai21-jamba-instruct-blog) and [Microsoft Tech Community Blog](https://aka.ms/ai21-jamba-instruct-announcement).
24
+
> See our announcements of AI21's Jamba family models available now on Azure AI Model Catalog through [AI21's blog](https://aka.ms/ai21-jamba-1.5-large-announcement) and [Microsoft Tech Community Blog](https://aka.ms/ai21-jamba-1.5-large-microsoft-annnouncement).
25
25
26
26
## Deploy the Jamba family models as a serverless API
27
27
28
28
Certain models in the model catalog can be deployed as a serverless API with pay-as-you-go billing, providing a way to consume them as an API without hosting them on your subscription, while keeping the enterprise security and compliance organizations need. This deployment option doesn't require quota from your subscription.
29
29
30
-
# [AI21 Jamba 1.5 Mini](#tab/ai21-jamba-1-5)
31
-
32
-
The [AI21 Jamba 1.5 mini model](https://aka.ms/aistudio/landing/ai21-labs-jamba-1.5) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5). AI21 can change or update the terms of use and pricing of this model.
33
-
34
-
To get started with Jamba 1.5 mini deployed as a serverless API, explore our integrations with [LangChain](https://aka.ms/ai21-jamba-1.5-langchain-sample), [LiteLLM](https://aka.ms/ai21-jamba-1.5-litellm-sample), [OpenAI](https://aka.ms/ai21-jamba-1.5-openai-sample) and the [Azure API](https://aka.ms/ai21-jamba-1.5-azure-api-sample).
The [AI21-Jamba 1.5 large model](https://aka.ms/aistudio/landing/ai21-labs-jamba-1.5-large) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5-large). AI21 can change or update the terms of use and pricing of this model.
32
+
The [AI21-Jamba 1.5 Large model](https://aka.ms/aistudio/landing/ai21-labs-jamba-1.5-large) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5-large). AI21 can change or update the terms of use and pricing of this model.
39
33
40
34
To get started with Jamba 1.5 large deployed as a serverless API, explore our integrations with [LangChain](https://aka.ms/ai21-jamba-1.5-large-langchain-sample), [LiteLLM](https://aka.ms/ai21-jamba-1.5-large-litellm-sample), [OpenAI](https://aka.ms/ai21-jamba-1.5-large-openai-sample) and the [Azure API](https://aka.ms/ai21-jamba-1.5-large-azure-api-sample).
41
35
42
36
43
-
# [AI21 Jamba Instruct](#tab/ai21-jamba-instruct)
37
+
# [AI21 Jamba 1.5 Mini](#tab/ai21-jamba-1-5)
44
38
45
-
The [AI21 Jamba Instruct model](https://aka.ms/aistudio/landing/ai21-labs-jamba-instruct) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-instruct). AI21 can change or update the terms of use and pricing of this model.
39
+
The [AI21 Jamba 1.5 Mini model](https://aka.ms/aistudio/landing/ai21-labs-jamba-1.5-mini) deployed as a serverless API with pay-as-you-go billing is [offered by AI21 through Microsoft Azure Marketplace](https://aka.ms/azure-marketplace-offer-ai21-jamba-1.5-mini). AI21 can change or update the terms of use and pricing of this model.
46
40
47
-
To get started with Jamba Instruct deployed as a serverless API, explore our integrations with [LangChain](https://aka.ms/ai21-jamba-instruct-langchain-sample), [LiteLLM](https://aka.ms/ai21-jamba-instruct-litellm-sample), [OpenAI](https://aka.ms/ai21-jamba-instruct-openai-sample) and the [Azure API](https://aka.ms/ai21-jamba-instruct-azure-api-sample).
41
+
To get started with Jamba 1.5 mini deployed as a serverless API, explore our integrations with [LangChain](https://aka.ms/ai21-jamba-1.5-mini-langchain-sample), [LiteLLM](https://aka.ms/ai21-jamba-1.5-mini-litellm-sample), [OpenAI](https://aka.ms/ai21-jamba-1.5-mini-openai-sample) and the [Azure API](https://aka.ms/ai21-jamba-1.5-mini-azure-api-sample).
48
42
49
43
---
50
44
@@ -85,16 +79,16 @@ To get started with Jamba Instruct deployed as a serverless API, explore our int
85
79
86
80
### Create a new deployment
87
81
88
-
These steps demonstrate the deployment of AI21-Jamba family models. To create a deployment:
82
+
These steps demonstrate the deployment of `AI21Jamba 1.5 Large` or `AI21 Jamba 1.5 Mini` models. To create a deployment:
89
83
90
84
1. Sign in to [Azure AI Studio](https://ai.azure.com).
91
85
1. Select **Model catalog** from the left sidebar.
92
-
1. Search for and select a AI21 model like `AI21 Jamba 1.5 Mini` or `AI21 Jamba 1.5 Large` or `AI21 Jamba Instruct` to open its Details page.
86
+
1. Search for and select a AI21 model like `AI21 Jamba 1.5 Large` or `AI21 Jamba 1.5 Mini` or `AI21 Jamba Instruct` to open its Details page.
93
87
1. Select **Deploy** to open a serverless API deployment window for the model.
94
88
1. Alternatively, you can initiate a deployment by starting from your project in AI Studio.
95
89
1. From the left sidebar of your project, select **Components** > **Deployments**.
96
90
1. Select **+ Create deployment**.
97
-
1. Search for and select a AI21 model like `AI21 Jamba 1.5 Mini` or `AI21 Jamba 1.5 Large` or `AI21 Jamba Instruct` to open the Model's Details page.
91
+
1. Search for and select a AI21 model like `AI21 Jamba 1.5 Large` or `AI21 Jamba 1.5 Mini` or `AI21 Jamba Instruct` to open the Model's Details page.
98
92
1. Select **Confirm** to open a serverless API deployment window for the model.
99
93
1. Select the project in which you want to deploy your model. To deploy the AI21-Jamba family models, your project must be in one of the regions listed in the [Prerequisites](#prerequisites) section.
100
94
1. In the deployment wizard, select the link to **Azure Marketplace Terms**, to learn more about the terms of use.
@@ -177,7 +171,7 @@ Payload is a JSON formatted string containing the following parameters:
|`model`|`string`| Y | Must be `jamba-1.5` or `jamba-1.5-large` or `jamba-instruct`|
174
+
|`model`|`string`| Y | Must be `jamba-1.5-large` or `jamba-1.5-mini` or `jamba-instruct`|
181
175
|`messages`|`list[object]`| Y | A list of objects, one per message, from oldest to newest. The oldest message can be role `system`. All later messages must alternate between user and assistant roles. See the message object definition below.|
182
176
|`max_tokens`|`integer`| N <br>`4096`| 0 – 4096 | The maximum number of tokens to allow for each generated response message. Typically the best way to limit output length is by providing a length limit in the system prompt (for example, "limit your answers to three sentences")|
183
177
|`temperature`|`float`| N <br>`1`| 0.0 – 2.0 | How much variation to provide in each answer. Setting this value to 0 guarantees the same response to the same question every time. Setting a higher value encourages more variation. Modifies the distribution from which tokens are sampled. We recommend altering this or `top_p`, but not both. |
@@ -212,11 +206,11 @@ The `document` object has the following fields:
212
206
213
207
#### Request example
214
208
215
-
__Single-turn example Jamba 1.5 mini and Jamba 1.5 large__
209
+
__Single-turn example Jamba 1.5 large and Jamba 1.5 mini__
0 commit comments