Merge branch 'main' into release-ignite-azure-ai-language

v-alje · v-alje · commit f7cdf47fd3f4 · 2024-11-06T18:24:53.000Z
diff --git a/articles/ai-studio/includes/create-hub.md b/articles/ai-studio/includes/create-hub.md
@@ -26,7 +26,7 @@ To create a hub in [Azure AI Studio](https://ai.azure.com), follow these steps:
     :::image type="content" source="../media/how-to/hubs/hub-new-connect-services.png" alt-text="Screenshot of the dialog to connect services while creating a new hub." lightbox="../media/how-to/hubs/hub-new-connect-services.png":::
 
     > [!NOTE]
-    > If you don't see (new) before the **Resource group** and **Connect Azure AI Services** entries then an existing resource is being used. For the purposes of this tutorial, create a seperate entity via **Create new resource group** and **Create new AI Services**. This will allow you to prevent any unexpected charges by deleting the entities after the tutorial.
+    > If you don't see (new) before the **Resource group** and **Connect Azure AI Services** entries then an existing resource is being used. For the purposes of this tutorial, create a separate entity via **Create new resource group** and **Create new AI Services**. This will allow you to prevent any unexpected charges by deleting the entities after the tutorial.
 
 1. Review the information and select **Create**.
 
diff --git a/articles/machine-learning/reference-managed-online-endpoints-vm-sku-list.md b/articles/machine-learning/reference-managed-online-endpoints-vm-sku-list.md
@@ -21,22 +21,90 @@ The following table shows the virtual machine (VM) stock keeping units (SKUs) th
 
 * For more information on configuration details such as CPU and RAM, see [Azure Machine Learning Pricing](https://azure.microsoft.com/pricing/details/machine-learning/) and [VM sizes](/azure/virtual-machines/sizes).
 
-| Relative Size | General Purpose | Compute Optimized | Memory Optimized | GPU |
-| ---           | ---             | ---               | ---              | --- |
-| X-Small       | Standard_DS1_v2 <br/> Standard_DS2_v2 <br/> Standard_D2a_v4 <br/> Standard_D2as_v4 | Standard_F2s_v2 | Standard_E2s_v3  | Standard_NC4as_T4_v3 |
-| Small         | Standard_DS3_v2 <br/> Standard_D4a_v4 <br/> Standard_D4as_v4                       | Standard_F4s_v2 <br/> Standard_FX4mds  | Standard_E4s_v3  | Standard_NC6s_v2 <br/> Standard_NC6s_v3 <br/> Standard_NC8as_T4_v3 |
-| Medium        | Standard_DS4_v2 </br> Standard_D8a_v4 </br> Standard_D8as_v4 | Standard_F8s_v2 </br> Standard_FX12mds | Standard_E8s_v3 | Standard_NC12s_v2 <br/> Standard_NC12s_v3 <br/> Standard_NC16as_T4_v3 |
-| Large         | Standard_DS5_v2 </br> Standard_D16a_v4 </br> Standard_D16as_v4 | Standard_F16s_v2 | Standard_E16s_v3 | Standard_NC24s_v2 <br/> Standard_NC24s_v3 <br/> Standard_NC64as_T4_v3 </br> Standard_NC24ads_A100_v4 |
-| X-Large       | Standard_D32a_v4 </br> Standard_D32as_v4 </br> Standard_D48a_v4 </br> Standard_D48as_v4 </br> Standard_D64a_v4 </br> Standard_D64as_v4 </br> Standard_D96a_v4 </br> Standard_D96as_v4 | Standard_F32s_v2 <br/> Standard_F48s_v2 <br/> Standard_F64s_v2 <br/> Standard_F72s_v2 <br/> Standard_FX24mds <br/> Standard_FX36mds <br/> Standard_FX48mds | Standard_E32s_v3 <br/> Standard_E48s_v3 <br/> Standard_E64s_v3 | Standard_NC48ads_A100_v4 </br> Standard_NC96ads_A100_v4 </br> Standard_ND96asr_v4 </br> Standard_ND96amsr_A100_v4 </br> Standard_ND40rs_v2 |
+| Family Name | VM Size Name | Supports Infiniband | Architecture |  numberOfGPUs  |  numberOfCores  | Skip 20% Reservation |
+| --- | --- | --- | --- | --- | --- | --- |
+|  standardDASv4Family  | STANDARD_D2AS_V4 | - | Cpu | 0 | 2 | - |
+|  standardDASv4Family  | STANDARD_D4AS_V4 | - | Cpu | 0 | 4 | - |
+|  standardDASv4Family  | STANDARD_D8AS_V4 | - | Cpu | 0 | 8 | - |
+|  standardDASv4Family  | STANDARD_D16AS_V4 | - | Cpu | 0 | 16 | - |
+|  standardDASv4Family  | STANDARD_D32AS_V4 | - | Cpu | 0 | 32 | - |
+|  standardDASv4Family  | STANDARD_D48AS_V4 | - | Cpu | 0 | 48 | - |
+|  standardDASv4Family  | STANDARD_D64AS_V4 | - | Cpu | 0 | 64 | - |
+|  standardDASv4Family  | STANDARD_D96AS_V4 | - | Cpu | 0 | 96 | - |
+|  standardDAv4Family  | STANDARD_D2A_V4 | - | Cpu | 0 | 2 | - |
+|  standardDAv4Family  | STANDARD_D4A_V4 | - | Cpu | 0 | 4 | - |
+|  standardDAv4Family  | STANDARD_D8A_V4 | - | Cpu | 0 | 8 | - |
+|  standardDAv4Family  | STANDARD_D16A_V4 | - | Cpu | 0 | 16 | - |
+|  standardDAv4Family  | STANDARD_D32A_V4 | - | Cpu | 0 | 32 | - |
+|  standardDAv4Family  | STANDARD_D48A_V4 | - | Cpu | 0 | 48 | - |
+|  standardDAv4Family  | STANDARD_D64A_V4 | - | Cpu | 0 | 64 | - |
+|  standardDAv4Family  | STANDARD_D96A_V4 | - | Cpu | 0 | 96 | - |
+|  standardDSv2Family  | STANDARD_DS1_V2 | - | Cpu | 0 | 1 | - |
+|  standardDSv2Family  | STANDARD_DS2_V2 | - | Cpu | 0 | 2 | - |
+|  standardDSv2Family  | STANDARD_DS3_V2 | - | Cpu | 0 | 4 | - |
+|  standardDSv2Family  | STANDARD_DS4_V2 | - | Cpu | 0 | 8 | - |
+|  standardDSv2Family  | STANDARD_DS5_V2 | - | Cpu | 0 | 16 | - |
+|  standardESv3Family  | STANDARD_E2S_V3 | - | Cpu | 0 | 2 | - |
+|  standardESv3Family  | STANDARD_E4S_V3 | - | Cpu | 0 | 4 | - |
+|  standardESv3Family  | STANDARD_E8S_V3 | - | Cpu | 0 | 8 | - |
+|  standardESv3Family  | STANDARD_E16S_V3 | - | Cpu | 0 | 16 | - |
+|  standardESv3Family  | STANDARD_E32S_V3 | - | Cpu | 0 | 32 | - |
+|  standardESv3Family  | STANDARD_E48S_V3 | - | Cpu | 0 | 48 | - |
+|  standardESv3Family  | STANDARD_E64S_V3 | - | Cpu | 0 | 64 | - |
+|  standardFSv2Family  | STANDARD_F2S_V2 | - | Cpu | 0 | 2 | - |
+|  standardFSv2Family  | STANDARD_F4S_V2 | - | Cpu | 0 | 4 | - |
+|  standardFSv2Family  | STANDARD_F8S_V2 | - | Cpu | 0 | 8 | - |
+|  standardFSv2Family  | STANDARD_F16S_V2 | - | Cpu | 0 | 16 | - |
+|  standardFSv2Family  | STANDARD_F32S_V2 | - | Cpu | 0 | 32 | - |
+|  standardFSv2Family  | STANDARD_F48S_V2 | - | Cpu | 0 | 48 | - |
+|  standardFSv2Family  | STANDARD_F64S_V2 | - | Cpu | 0 | 64 | - |
+|  standardFSv2Family  | STANDARD_F72S_V2 | - | Cpu | 0 | 72 | - |
+|  standardFXMDVSFamily  | STANDARD_FX4MDS | - | Cpu | 0 | 4 | - |
+|  standardFXMDVSFamily  | STANDARD_FX12MDS | - | Cpu | 0 | 12 | - |
+|  standardFXMDVSFamily  | STANDARD_FX24MDS | - | Cpu | 0 | 24 | - |
+|  standardFXMDVSFamily  | STANDARD_FX36MDS | - | Cpu | 0 | 36 | - |
+|  standardFXMDVSFamily  | STANDARD_FX48MDS | - | Cpu | 0 | 48 | - |
+|  standardLASv3Family  | STANDARD_L8AS_V3 | - | Cpu | 0 | 8 | - |
+|  standardLASv3Family  | STANDARD_L16AS_V3 | - | Cpu | 0 | 16 | - |
+|  standardLASv3Family  | STANDARD_L32AS_V3 | - | Cpu | 0 | 32 | - |
+|  standardLASv3Family  | STANDARD_L48AS_V3 | - | Cpu | 0 | 48 | - |
+|  standardLASv3Family  | STANDARD_L64AS_V3 | - | Cpu | 0 | 64 | - |
+|  standardLASv3Family  | STANDARD_L80AS_V3 | - | Cpu | 0 | 80 | - |
+|  standardLSv2Family  | STANDARD_L8S_V2 | - | Cpu | 0 | 8 | - |
+|  standardLSv2Family  | STANDARD_L16S_V2 | - | Cpu | 0 | 16 | - |
+|  standardLSv2Family  | STANDARD_L32S_V2 | - | Cpu | 0 | 32 | - |
+|  standardLSv2Family  | STANDARD_L48S_V2 | - | Cpu | 0 | 48 | - |
+|  standardLSv2Family  | STANDARD_L64S_V2 | - | Cpu | 0 | 64 | - |
+|  standardLSv2Family  | STANDARD_L80S_V2 | - | Cpu | 0 | 80 | - |
+|  standardLSv3Family  | STANDARD_L8S_V3 | - | Cpu | 0 | 8 | - |
+|  standardLSv3Family  | STANDARD_L16S_V3 | - | Cpu | 0 | 16 | - |
+|  standardLSv3Family  | STANDARD_L32S_V3 | - | Cpu | 0 | 32 | - |
+|  standardLSv3Family  | STANDARD_L48S_V3 | - | Cpu | 0 | 48 | - |
+|  standardLSv3Family  | STANDARD_L64S_V3 | - | Cpu | 0 | 64 | - |
+|  standardLSv3Family  | STANDARD_L80S_V3 | - | Cpu | 0 | 80 | - |
+|  standardNCADSA100v4Family  | STANDARD_NC24ADS_A100_V4 | - | NvidiaGpu | 1 | 24 | Yes |
+|  standardNCADSA100v4Family  | STANDARD_NC48ADS_A100_V4 | - | NvidiaGpu | 2 | 48 | Yes |
+|  standardNCADSA100v4Family  | STANDARD_NC96ADS_A100_V4 | - | NvidiaGpu | 4 | 96 | Yes |
+|  Standard NCASv3_T4 Family  | STANDARD_NC4AS_T4_V3 | - | NvidiaGpu | 1 | 4 | - |
+|  Standard NCASv3_T4 Family  | STANDARD_NC8AS_T4_V3 | - | NvidiaGpu | 1 | 8 | - |
+|  Standard NCASv3_T4 Family  | STANDARD_NC16AS_T4_V3 | - | NvidiaGpu | 1 | 16 | - |
+|  Standard NCASv3_T4 Family  | STANDARD_NC64AS_T4_V3 | - | NvidiaGpu | 4 | 64 | - |
+|  standardNCSv2Family  | STANDARD_NC6S_V2 | - | NvidiaGpu | 1 | 6 | - |
+|  standardNCSv2Family  | STANDARD_NC12S_V2 | - | NvidiaGpu | 2 | 12 | - |
+|  standardNCSv2Family  | STANDARD_NC24S_V2 | - | NvidiaGpu | 4 | 24 | - |
+|  standardNCSv3Family  | STANDARD_NC6S_V3 | - | NvidiaGpu | 1 | 6 | - |
+|  standardNCSv3Family  | STANDARD_NC12S_V3 | - | NvidiaGpu | 2 | 12 | - |
+|  standardNCSv3Family  | STANDARD_NC24S_V3 | - | NvidiaGpu | 4 | 24 | - |
+|  standardNCADSH100v5Family  | STANDARD_NC40ADS_H100_V5 | - | NvidiaGpu | 1 | 40 | Yes |
+|  standardNCADSH100v5Family  | STANDARD_NC80ADIS_H100_V5 | - | NvidiaGpu | 2 | 80 | Yes |
+|  standard NDAMSv4_A100Family  | STANDARD_ND96AMSR_A100_V4 | Yes | NvidiaGpu | 8 | 96 | Yes |
+|  Standard NDASv4_A100 Family  | STANDARD_ND96ASR_V4 | Yes | NvidiaGpu | 8 | 96 | Yes |
+|  standardNDSv2Family  | STANDARD_ND40RS_V2 | Yes | NvidiaGpu | 8 | 40 | Yes |
+|  standardNDv5H100Family  | STANDARD_ND96IS_H100_v5 | - | NvidiaGpu | 8 | 96 | Yes |
+|  standardNDv5H100Family  | STANDARD_ND96ISR_H100_v5 | Yes | NvidiaGpu | 8 | 96 | Yes |
 
 > [!CAUTION]
-> `Standard_DS1_v2` and `Standard_F2s_v2` may be too small for bigger models and may lead to container termination due to insufficient memory, not enough space on the disk, or probe failure as it takes too long to initiate the container. If you face [OutOfQuota errors](how-to-troubleshoot-online-endpoints.md?tabs=cli#error-outofquota) or [ReourceNotReady errors](how-to-troubleshoot-online-endpoints.md?tabs=cli#error-resourcenotready), try bigger VM SKUs. If you want to reduce the cost of deploying multiple models with managed online endpoint, see [Deployment for several local models](concept-online-deployment-model-specification.md#deployment-for-several-local-models).
+> Small VM SKUs such as `Standard_DS1_v2` and `Standard_F2s_v2` may be too small for bigger models and may lead to container termination due to insufficient memory, not enough space on the disk, or probe failure as it takes too long to initiate the container. If you face [OutOfQuota errors](how-to-troubleshoot-online-endpoints.md?tabs=cli#error-outofquota) or [ReourceNotReady errors](how-to-troubleshoot-online-endpoints.md?tabs=cli#error-resourcenotready), try bigger VM SKUs. If you want to reduce the cost of deploying multiple models with managed online endpoint, see [Deployment for several local models](concept-online-deployment-model-specification.md#deployment-for-several-local-models).
 
 > [!NOTE]
-> We recommend having more than 3 instances for deployments in production scenarios. In addition, Azure Machine Learning reserves 20% of your compute resources for performing upgrades on some VM SKUs as described in [Virtual machine quota allocation for deployment](how-to-manage-quotas.md#virtual-machine-quota-allocation-for-deployment). VM SKUs that are exempted from this extra quota reservation are listed below:
-> - Standard_NC24ads_A100_v4
-> - Standard_NC48ads_A100_v4
-> - Standard_NC96ads_A100_v4
-> - Standard_ND96asr_v4
-> - Standard_ND96amsr_A100_v4
-> - Standard_ND40rs_v2
+> We recommend having more than 3 instances for deployments in production scenarios. In addition, Azure Machine Learning reserves 20% of your compute resources for performing upgrades on some VM SKUs as described in [Virtual machine quota allocation for deployment](how-to-manage-quotas.md#virtual-machine-quota-allocation-for-deployment). VM SKUs that are exempted from this extra quota reservation are specified in the "Skip 20% Reservation" column.