Skip to content

Commit 51864fc

Browse files
authored
Merge pull request #7490 from s-polly/stp_ml_oct_freshness
ML freshness
2 parents bf28439 + 89cb884 commit 51864fc

File tree

7 files changed

+64
-63
lines changed

7 files changed

+64
-63
lines changed

articles/machine-learning/concept-compute-instance.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.topic: concept-article
99
ms.author: scottpolly
1010
author: s-polly
1111
ms.reviewer: vijetaj
12-
ms.date: 10/01/2024
12+
ms.date: 10/06/2025
1313
monikerRange: 'azureml-api-2 || azureml-api-1'
1414
#Customer intent: As a data scientist, I want to know what a compute instance is and how to use it for Azure Machine Learning.
1515
---

articles/machine-learning/concept-endpoints-online.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ author: s-polly
1010
ms.author: scottpolly
1111
ms.reviewer: sehan
1212
ms.custom: devplatv2, FY25Q1-Linter
13-
ms.date: 09/23/2024
13+
ms.date: 10/06/2025
1414

1515
#Customer intent: As an ML pro, I want to understand what an online endpoint is and why I need it.
1616
---
@@ -48,12 +48,12 @@ Managed online endpoints are the recommended way to use online endpoints in Azur
4848
|Attributes |Managed online endpoints (v2) |Container Instances or AKS (v1) |
4949
|---------|---------|---------|
5050
|Network security/isolation |Easy inbound/outbound control with quick toggle |Virtual network not supported or requires complex manual configuration |
51-
|Managed service | Fully managed compute provisioning/scaling<br> Network configuration for data exfiltration prevention<br> Host OS upgrade, controlled rollout of in-place updates | Scaling is limited<br> User must manage network configuration or upgrade |
51+
|Managed service |* Fully managed compute provisioning/scaling<br>* Network configuration for data exfiltration prevention<br>* Host OS upgrade, controlled rollout of in-place updates |* Scaling is limited<br>* User must manage network configuration or upgrade |
5252
|Endpoint/deployment concept |Distinction between endpoint and deployment enables complex scenarios such as safe rollout of models |No concept of endpoint |
53-
|Diagnostics and Monitoring | Local endpoint debugging possible with Docker and Visual Studio Code<br> Advanced metrics and logs analysis with chart/query to compare between deployments<br> Cost breakdown to deployment level |No easy local debugging |
54-
|Scalability |Elastic, and automatic scaling (not bound by the default cluster size) | Container Instances isn't scalable <br> AKS v1 supports in-cluster scale only and requires scalability configuration |
53+
|Diagnostics and Monitoring |* Local endpoint debugging possible with Docker and Visual Studio Code<br>* Advanced metrics and logs analysis with chart/query to compare between deployments<br>* Cost breakdown to deployment level |No easy local debugging |
54+
|Scalability |Elastic, and automatic scaling (not bound by the default cluster size) |* Container Instances isn't scalable <br>* AKS v1 supports in-cluster scale only and requires scalability configuration |
5555
|Enterprise readiness |Private link, customer managed keys, Microsoft Entra ID, quota management, billing integration, Service Level Agreement (SLA) |Not supported |
56-
|Advanced ML features | Model data collection<br> Model monitoring<br> Champion-challenger model, safe rollout, traffic mirroring<br> Responsible AI extensibility |Not supported |
56+
|Advanced ML features |* Model data collection<br>* Model monitoring<br>* Champion-challenger model, safe rollout, traffic mirroring<br>* Responsible AI extensibility |Not supported |
5757

5858
### Managed online endpoints vs Kubernetes online endpoints
5959

@@ -79,7 +79,7 @@ Managed online endpoints can help streamline your deployment process and provide
7979
> [!NOTE]
8080
> Managed online endpoints are based on Azure Machine Learning compute. When you use a managed online endpoint, you pay for the compute and networking charges. There's no added surcharge. For more information on pricing, see the [Azure pricing calculator](https://azure.microsoft.com/pricing/calculator/).
8181
>
82-
> If you use an Azure Machine Learning virtual network to secure outbound traffic from the managed online endpoint, you're charged for the Azure private link and fully-qualified domain name (FQDN) outbound rules that the managed virtual network uses. For more information, see [Pricing for managed virtual network](how-to-managed-network.md#pricing).
82+
> If you use an Azure Machine Learning virtual network to secure outbound traffic from the managed online endpoint, you're charged for the Azure private link and fully qualified domain name (FQDN) outbound rules that the managed virtual network uses. For more information, see [Pricing for managed virtual network](how-to-managed-network.md#pricing).
8383
8484
The following table highlights the key differences between managed online endpoints and Kubernetes online endpoints.
8585

@@ -150,7 +150,7 @@ The following table describes the key attributes of a deployment:
150150

151151
[!INCLUDE [machine-learning-shared-quota](includes/machine-learning-shared-quota.md)]
152152

153-
To deploy Llama-2, Phi, Nemotron, Mistral, Dolly, and Deci-DeciLM models from the model catalog by using the shared quota, you must have an [Enterprise Agreement subscription](/azure/cost-management-billing/manage/create-enterprise-subscription). For more information on how to use the shared quota for online endpoint deployment, see [How to deploy foundation models using the studio](how-to-use-foundation-models.md#shared-quota).
153+
To deploy Llama, Phi, Mistral, and other models from the model catalog by using the shared quota, you must have an [Enterprise Agreement subscription](/azure/cost-management-billing/manage/create-enterprise-subscription). For more information on how to use the shared quota for online endpoint deployment, see [How to deploy foundation models using the studio](how-to-use-foundation-models.md#shared-quota).
154154

155155
For more information on quotas and limits for resources in Azure Machine Learning, see [Manage and increase quotas and limits for resources with Azure Machine Learning](how-to-manage-quotas.md).
156156

0 commit comments

Comments
 (0)