Skip to content

Commit 0febef1

Browse files
authored
Merge pull request #279097 from ssalgadodev/patch-119
Update deploy-models-llama.md
2 parents 3b48511 + 417fd08 commit 0febef1

14 files changed

+157
-57
lines changed

articles/ai-studio/how-to/deploy-jais-models.md

Lines changed: 11 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -25,18 +25,24 @@ You can find the JAIS model in the [Model Catalog](model-catalog.md) by filterin
2525
### Prerequisites
2626

2727
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions will not work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
28-
- An [Azure AI Studio hub](../how-to/create-azure-ai-resource.md).
28+
- An [Azure AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for JAIS is only available with hubs created in these regions:
2929

30-
> [!IMPORTANT]
31-
> For JAIS models, the serverless API model deployment offering is only available with hubs created in East US 2 or Sweden Central region.
30+
* East US
31+
* East US 2
32+
* North Central US
33+
* South Central US
34+
* West US
35+
* West US 3
36+
* Sweden Central
3237

38+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
3339
- An [AI Studio project](../how-to/create-projects.md) in Azure AI Studio.
3440
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
3541

3642

3743
### JAIS 30b Chat
3844

39-
JAIS 30b Chat is an auto-regressive bi-lingual LLM for **Arabic** & **English**. The tuned versions use supervised fine-tuning (SFT). The model is finetuned with both Arabic and English prompt-response pairs. The finetuning datasets included a wide range of instructional data across various domains. The model covers a wide range of common tasks including question answering, code generation, and reasoning over textual content. To enhance performance in Arabic, the Core42 team developed an in-house Arabic dataset as well as translating some open-source English instructions into Arabic.
45+
JAIS 30b Chat is an auto-regressive bi-lingual LLM for **Arabic** & **English**. The tuned versions use supervised fine-tuning (SFT). The model is fine-tuned with both Arabic and English prompt-response pairs. The fine-tuning datasets included a wide range of instructional data across various domains. The model covers a wide range of common tasks including question answering, code generation, and reasoning over textual content. To enhance performance in Arabic, the Core42 team developed an in-house Arabic dataset as well as translating some open-source English instructions into Arabic.
4046

4147
*Context length:* JAIS supports a context length of 8K.
4248

@@ -277,3 +283,4 @@ Models deployed as a service with pay-as-you-go billing are protected by [Azure
277283

278284
- [What is Azure AI Studio?](../what-is-ai-studio.md)
279285
- [Azure AI FAQ article](../faq.yml)
286+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

articles/ai-studio/how-to/deploy-models-cohere-command.md

Lines changed: 12 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -50,10 +50,17 @@ The previously mentioned Cohere models can be deployed as a service with pay-as-
5050
### Prerequisites
5151

5252
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
53-
- An [Azure AI Studio hub](../how-to/create-azure-ai-resource.md).
54-
55-
> [!IMPORTANT]
56-
> For Cohere family models, the serverless API model deployment offering is only available with hubs created in **EastUS2** or **Sweden Central** region.
53+
- An [Azure AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Cohere Command is only available with hubs created in these regions:
54+
55+
* East US
56+
* East US 2
57+
* North Central US
58+
* South Central US
59+
* West US
60+
* West US 3
61+
* Sweden Central
62+
63+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
5764

5865
- An [AI Studio project](../how-to/create-projects.md) in Azure AI Studio.
5966
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
@@ -684,3 +691,4 @@ Models deployed as a serverless API with pay-as-you-go billing are protected by
684691

685692
- [What is Azure AI Studio?](../what-is-ai-studio.md)
686693
- [Azure AI FAQ article](../faq.yml)
694+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

articles/ai-studio/how-to/deploy-models-cohere-embed.md

Lines changed: 12 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -50,10 +50,17 @@ The previously mentioned Cohere models can be deployed as a service with pay-as-
5050
### Prerequisites
5151

5252
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
53-
- An [AI Studio hub](../how-to/create-azure-ai-resource.md).
54-
55-
> [!IMPORTANT]
56-
> For Cohere family models, the serverless API model deployment offering is only available with hubs created in **EastUS2** or **Sweden Central** region.
53+
- An [AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Cohere Embed is only available with hubs created in these regions:
54+
55+
* East US
56+
* East US 2
57+
* North Central US
58+
* South Central US
59+
* West US
60+
* West US 3
61+
* Sweden Central
62+
63+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
5764

5865
- An [AI Studio project](../how-to/create-projects.md) in Azure AI Studio.
5966
- Azure role-based access controls are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
@@ -298,3 +305,4 @@ Models deployed as a serverless API are protected by [Azure AI Content Safety](.
298305

299306
- [What is Azure AI Studio?](../what-is-ai-studio.md)
300307
- [Azure AI FAQ article](../faq.yml)
308+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

articles/ai-studio/how-to/deploy-models-jamba.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,17 @@ To get started with Jamba Instruct deployed as a serverless API, explore our int
3535
### Prerequisites
3636

3737
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
38-
- An [AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Jamba Instruct is only available with hubs created in **East US 2** and **Sweden Central**.
38+
- An [AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Jamba Instruct is only available with hubs created in these regions:
39+
40+
* East US
41+
* East US 2
42+
* North Central US
43+
* South Central US
44+
* West US
45+
* West US 3
46+
* Sweden Central
47+
48+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
3949
- An Azure [AI Studio project](../how-to/create-projects.md).
4050
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
4151

@@ -303,3 +313,4 @@ Models deployed as a serverless API are protected by Azure AI content safety. Wi
303313

304314
- [What is Azure AI Studio?](../what-is-ai-studio.md)
305315
- [Azure AI FAQ article](../faq.yml)
316+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

articles/ai-studio/how-to/deploy-models-llama.md

Lines changed: 22 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -59,11 +59,17 @@ If you need to deploy a different model, [deploy it to managed compute](#deploy-
5959
# [Meta Llama 3](#tab/llama-three)
6060

6161
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
62-
- An [AI Studio hub](../how-to/create-azure-ai-resource.md).
63-
64-
> [!IMPORTANT]
65-
> For Meta Llama 3 models, the pay-as-you-go model deployment offering is only available with hubs created in **East US 2** and **Sweden Central** regions.
66-
62+
- An [AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Meta Llama 3 is only available with hubs created in these regions:
63+
64+
* East US
65+
* East US 2
66+
* North Central US
67+
* South Central US
68+
* West US
69+
* West US 3
70+
* Sweden Central
71+
72+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
6773
- An [AI Studio project](../how-to/create-projects.md) in Azure AI Studio.
6874
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
6975

@@ -87,11 +93,16 @@ If you need to deploy a different model, [deploy it to managed compute](#deploy-
8793
# [Meta Llama 2](#tab/llama-two)
8894

8995
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
90-
- An [AI Studio hub](../how-to/create-azure-ai-resource.md).
91-
92-
> [!IMPORTANT]
93-
> For Meta Llama 2 models, the pay-as-you-go model deployment offering is only available with hubs created in **East US 2** and **West US 3** regions.
94-
96+
- An [AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Meta Llama 2 is only available with hubs created in these regions:
97+
98+
* East US
99+
* East US 2
100+
* North Central US
101+
* South Central US
102+
* West US
103+
* West US 3
104+
105+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
95106
- An [AI Studio project](../how-to/create-projects.md) in Azure AI Studio.
96107
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __owner__ or __contributor__ role for the Azure subscription. Alternatively, your account can be assigned a custom role that has the following permissions:
97108

@@ -562,3 +573,4 @@ Models deployed as a serverless API with pay-as-you-go are protected by Azure AI
562573
- [What is Azure AI Studio?](../what-is-ai-studio.md)
563574
- [Fine-tune a Meta Llama 2 model in Azure AI Studio](fine-tune-model-llama.md)
564575
- [Azure AI FAQ article](../faq.yml)
576+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

articles/ai-studio/how-to/deploy-models-phi-3.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -49,11 +49,12 @@ Certain models in the model catalog can be deployed as a serverless API with pay
4949
### Prerequisites
5050

5151
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
52-
- An [Azure AI Studio hub](../how-to/create-azure-ai-resource.md).
52+
- An [Azure AI Studio hub](../how-to/create-azure-ai-resource.md). The serverless API model deployment offering for Phi-3 is only available with hubs created in these regions:
5353

54-
> [!IMPORTANT]
55-
> For Phi-3 family models, the serverless API model deployment offering is only available with hubs created in **East US 2** and **Sweden Central** regions.
54+
* East US 2
55+
* Sweden Central
5656

57+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md).
5758
- An [Azure AI Studio project](../how-to/create-projects.md).
5859
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, see [Role-based access control in Azure AI Studio](../concepts/rbac-ai-studio.md).
5960

@@ -118,5 +119,7 @@ Quota is managed per deployment. Each deployment has a rate limit of 200,000 tok
118119

119120
## Related content
120121

122+
121123
- [What is Azure AI Studio?](../what-is-ai-studio.md)
122124
- [Azure AI FAQ article](../faq.yml)
125+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

articles/ai-studio/how-to/deploy-models-timegen-1.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -250,3 +250,4 @@ Quota is managed per deployment. Each deployment has a rate limit of 200,000 tok
250250

251251
- [What is Azure AI Studio?](../what-is-ai-studio.md)
252252
- [Azure AI FAQ article](../faq.yml)
253+
- [Region availability for models in serverless API endpoints](deploy-models-serverless-availability.md)

articles/machine-learning/deploy-jais-models.md

Lines changed: 18 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: How to deploy JAIS models with Azure Machine Learning Studio
3-
titleSuffix: Azure Machine Learning Studio
4-
description: Learn how to deploy JAIS models with Azure Machine Learning Studio.
2+
title: How to deploy JAIS models with Azure Machine Learning studio
3+
titleSuffix: Azure Machine Learning studio
4+
description: Learn how to deploy JAIS models with Azure Machine Learning studio.
55
manager: scottpolly
66
ms.service: machine-learning
77
ms.topic: how-to
@@ -12,28 +12,35 @@ author: msakande
1212
ms.custom: references_regions, build-2024
1313
---
1414

15-
# How to deploy JAIS with Azure Machine Learning Studio
15+
# How to deploy JAIS with Azure Machine Learning studio
1616

17-
In this article, you learn how to use Azure Machine Learning Studio to deploy the JAIS model as a service with pay-as you go billing.
17+
In this article, you learn how to use Azure Machine Learning studio to deploy the JAIS model as a service with pay-as you go billing.
1818

19-
The JAIS model is available in Azure Machine Learning Studio with pay-as-you-go token based billing with Models as a Service.
19+
The JAIS model is available in Azure Machine Learning studio with pay-as-you-go token based billing with Models as a Service.
2020

2121
You can find the JAIS model in the model catalog by filtering on the JAIS collection.
2222

2323
### Prerequisites
2424

2525
- An Azure subscription with a valid payment method. Free or trial Azure subscriptions won't work. If you don't have an Azure subscription, create a [paid Azure account](https://azure.microsoft.com/pricing/purchase-options/pay-as-you-go) to begin.
26-
- An Azure Machine Learning workspace. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them.
26+
- An Azure Machine Learning workspace. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them. The serverless API model deployment offering for JAIS is only available with workspaces created in these regions:
2727

28-
> [!IMPORTANT]
29-
> For JAIS models, the pay-as-you-go model deployment offering is only available with workspaces created in East US 2 or Sweden Central region.
28+
* East US
29+
* East US 2
30+
* North Central US
31+
* South Central US
32+
* West US
33+
* West US 3
34+
* Sweden Central
35+
36+
For a list of regions that are available for each of the models supporting serverless API endpoint deployments, see [Region availability for models in serverless API endpoints](concept-endpoint-serverless-availability.md).
3037

3138
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Studio. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, see [Role-based access control in Azure AI Studio](../ai-studio/concepts/rbac-ai-studio.md).
3239

3340

3441
### JAIS 30b Chat
3542

36-
JAIS 30b Chat is an auto-regressive bi-lingual LLM for **Arabic** & **English**. The tuned versions use supervised fine-tuning (SFT). The model is finetuned with both Arabic and English prompt-response pairs. The finetuning datasets included a wide range of instructional data across various domains. The model covers a wide range of common tasks including question answering, code generation, and reasoning over textual content. To enhance performance in Arabic, the Core42 team developed an in-house Arabic dataset as well as translating some open-source English instructions into Arabic.
43+
JAIS 30b Chat is an auto-regressive bi-lingual LLM for **Arabic** & **English**. The tuned versions use supervised fine-tuning (SFT). The model is fine-tuned with both Arabic and English prompt-response pairs. The fine-tuning datasets included a wide range of instructional data across various domains. The model covers a wide range of common tasks including question answering, code generation, and reasoning over textual content. To enhance performance in Arabic, the Core42 team developed an in-house Arabic dataset as well as translating some open-source English instructions into Arabic.
3744

3845
*Context length:* JAIS 30b Chat supports a context length of 8K.
3946

@@ -278,3 +285,4 @@ Models deployed as a service with pay-as-you-go are protected by [Azure AI Conte
278285

279286
- [What is Azure AI Studio?](../ai-studio/what-is-ai-studio.md)
280287
- [Azure AI FAQ article](../ai-studio/faq.yml)
288+
- [Region availability for models in serverless API endpoints](concept-endpoint-serverless-availability.md)

0 commit comments

Comments
 (0)