Skip to content

Commit 88706af

Browse files
authored
Merge pull request #6041 from s-polly/stp-deploy-7-15
How to use foundation models: deployment update
2 parents 329a8e5 + 4f33c5d commit 88706af

File tree

3 files changed

+7
-4
lines changed

3 files changed

+7
-4
lines changed

articles/machine-learning/how-to-use-foundation-models.md

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -100,21 +100,24 @@ You can deploy foundation models (both pre-trained models from the model catalog
100100

101101
### Deploying using the studio
102102

103-
You can invoke the Deploy UI form by selecting the **Deploy** button on the model card for any foundation model, and selecting either **standard deployment with Azure AI Content Safety** or **Managed Compute without Azure AI Content Safety**
103+
You can deploy a model by selecting the **Use this model** button on the model card for any foundation model, and selecting either **Serverless API** or **Managed Compute**. Not all models support serverless API deployment. If the model does not support serverless API, you will only see the **Managed Compute** option.
104104

105105
:::image type="content" source="./media/how-to-use-foundation-models/deploy-button.png" lightbox="./media/how-to-use-foundation-models/deploy-button.png" alt-text="Screenshot showing the deploy button on the foundation model card.":::
106106

107107
#### Deployment settings
108108

109-
Since the scoring script and environment are automatically included with the foundation model, you only need to specify the Virtual machine SKU to use, number of instances and the endpoint name to use for the deployment.
109+
When you select **Serverless API**, you are taken to the deployment form. The deployment form allows you to specify the deployment name, and whether to enable content filtering and the model version to use for the deployment. You can also provide sample input data to test the model.
110+
111+
112+
When you select **Managed Compute**, you are taken to the deployment settings form. The deployment settings form allows you to specify the Virtual machine SKU to use, number of instances, and the endpoint name to use for the deployment.
110113

111114
:::image type="content" source="./media/how-to-use-foundation-models/deploy-options.png" alt-text="Screenshot showing the deploy options on the foundation model card after user selects the deploy button.":::
112115

113116
##### Shared quota
114117

115-
If you're deploying a Llama-2, Phi, Nemotron, Mistral, Dolly or Deci-DeciLM model from the model catalog but don't have enough quota available for the deployment, Azure Machine Learning allows you to use quota from a shared quota pool for a limited time. For more information on shared quota, see [Azure Machine Learning shared quota](how-to-manage-quotas.md#azure-machine-learning-shared-quota).
118+
If you're deploying a model from the model catalog but don't have enough quota available for the deployment, Azure Machine Learning allows you to use quota from a shared quota pool for a limited time. For more information on shared quota, see [Azure Machine Learning shared quota](how-to-manage-quotas.md#azure-machine-learning-shared-quota).
116119

117-
:::image type="content" source="media/how-to-use-foundation-models/deploy-llama-model-with-shared-quota.png" alt-text="Screenshot showing the option to deploy a Llama model temporarily, using shared quota." lightbox="media/how-to-use-foundation-models/deploy-llama-model-with-shared-quota.png":::
120+
:::image type="content" source="media/how-to-use-foundation-models/deploy-phi-model-with-shared-quota.png" alt-text="Screenshot showing the option to deploy a Llama model temporarily, using shared quota." lightbox="media/how-to-use-foundation-models/deploy-llama-model-with-shared-quota.png":::
118121

119122
### Deploying using code based samples
120123

50.7 KB
Loading
62.4 KB
Loading

0 commit comments

Comments
 (0)