Skip to content

Commit 03798ee

Browse files
authored
Merge pull request #58003 from cynthn/ndv2
NDv2
2 parents 01aeb41 + 9205c74 commit 03798ee

7 files changed

+43
-25
lines changed

articles/virtual-machines/linux/sizes.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ ms.devlang: na
1414
ms.topic: article
1515
ms.tgt_pltfrm: vm-linux
1616
ms.workload: infrastructure-services
17-
ms.date: 10/01/2018
17+
ms.date: 11/14/2018
1818
ms.author: jonbeck
1919
---
2020

@@ -28,7 +28,7 @@ This article describes the available sizes and options for the Azure virtual mac
2828
| [Compute optimized](sizes-compute.md) | Fsv2, Fs, F | High CPU-to-memory ratio. Good for medium traffic web servers, network appliances, batch processes, and application servers. |
2929
| [Memory optimized](sizes-memory.md) | Esv3, Ev3, M, GS, G, DSv2, Dv2 | High memory-to-CPU ratio. Great for relational database servers, medium to large caches, and in-memory analytics. |
3030
| [Storage optimized](sizes-storage.md) | Ls | High disk throughput and IO. Ideal for Big Data, SQL, and NoSQL databases. |
31-
| [GPU](sizes-gpu.md) | NV, NVv2, NC, NCv2, NCv3, ND | Specialized virtual machines targeted for heavy graphic rendering and video editing, as well as model training and inferencing (ND) with deep learning. Available with single or multiple GPUs. |
31+
| [GPU](sizes-gpu.md) | NV, NVv2, NC, NCv2, NCv3, ND, NDv2 (Preview) | Specialized virtual machines targeted for heavy graphic rendering and video editing, as well as model training and inferencing (ND) with deep learning. Available with single or multiple GPUs. |
3232
| [High performance compute](sizes-hpc.md) | H | Our fastest and most powerful CPU virtual machines with optional high-throughput network interfaces (RDMA). |
3333

3434

articles/virtual-machines/windows/n-series-driver-setup.md

Lines changed: 4 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -2,8 +2,7 @@
22
title: Azure N-series GPU driver setup for Windows | Microsoft Docs
33
description: How to set up NVIDIA GPU drivers for N-series VMs running Windows Server or Windows in Azure
44
services: virtual-machines-windows
5-
documentationcenter: ''
6-
author: dlepow
5+
author: cynthn
76
manager: jeconnoc
87
editor: ''
98
tags: azure-resource-manager
@@ -15,9 +14,8 @@ ms.topic: article
1514
ms.tgt_pltfrm: vm-windows
1615
ms.workload: infrastructure-services
1716
ms.date: 09/24/2018
18-
ms.author: danlep
17+
ms.author: cynthn
1918
ms.custom: H1Hack27Feb2017
20-
2119
---
2220
# Install NVIDIA GPU drivers on N-series VMs running Windows
2321

@@ -47,13 +45,13 @@ To query the GPU device state, run the [nvidia-smi](https://developer.nvidia.com
4745

4846
1. Open a command prompt and change to the **C:\Program Files\NVIDIA Corporation\NVSMI** directory.
4947

50-
2. Run `nvidia-smi`. If the driver is installed you will see output similar to the following. Note that **GPU-Util** shows **0%** unless you are currently running a GPU workload on the VM. Your driver version and GPU details may be different from the ones shown.
48+
2. Run `nvidia-smi`. If the driver is installed, you will see output similar to the following. The **GPU-Util** shows **0%** unless you are currently running a GPU workload on the VM. Your driver version and GPU details may be different from the ones shown.
5149

5250
![NVIDIA device status](./media/n-series-driver-setup/smi.png)
5351

5452
## RDMA network connectivity
5553

56-
RDMA network connectivity can be enabled on RDMA-capable N-series VMs such as NC24r deployed in the same availability set or in a single placement group in a VM scale set. The HpcVmDrivers extension must be added to install Windows network device drivers that enable RDMA connectivity. To add the VM extension to an RDMA-enabled N-series VM, use [Azure PowerShell](/powershell/azure/overview) cmdlets for Azure Resource Manager.
54+
RDMA network connectivity can be enabled on RDMA-capable N-series VMs such as NC24r deployed in the same availability set or in a single placement group in a virtual machine scale set. The HpcVmDrivers extension must be added to install Windows network device drivers that enable RDMA connectivity. To add the VM extension to an RDMA-enabled N-series VM, use [Azure PowerShell](/powershell/azure/overview) cmdlets for Azure Resource Manager.
5755

5856
To install the latest version 1.1 HpcVMDrivers extension on an existing RDMA-capable VM named myVM in the West US region:
5957
```PowerShell

articles/virtual-machines/windows/sizes.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ ms.devlang: na
1414
ms.topic: article
1515
ms.tgt_pltfrm: vm-windows
1616
ms.workload: infrastructure-services
17-
ms.date: 10/01/2018
17+
ms.date: 11/14/2018
1818
ms.author: jonbeck
1919
---
2020

@@ -29,7 +29,7 @@ This article describes the available sizes and options for the Azure virtual mac
2929
| [Compute optimized](sizes-compute.md) | Fsv2, Fs, F | High CPU-to-memory ratio. Good for medium traffic web servers, network appliances, batch processes, and application servers. |
3030
| [Memory optimized](../virtual-machines-windows-sizes-memory.md) | Esv3, Ev3, M, GS, G, DSv2, Dv2 | High memory-to-CPU ratio. Great for relational database servers, medium to large caches, and in-memory analytics. |
3131
| [Storage optimized](../virtual-machines-windows-sizes-storage.md) | Ls | High disk throughput and IO. Ideal for Big Data, SQL, and NoSQL databases. |
32-
| [GPU](sizes-gpu.md) | NV, NVv2, NC, NCv2, NCv3, ND | Specialized virtual machines targeted for heavy graphic rendering and video editing, as well as model training and inferencing (ND) with deep learning. Available with single or multiple GPUs. |
32+
| [GPU](sizes-gpu.md) | NV, NVv2, NC, NCv2, NCv3, ND, Ndv2 (Preview) | Specialized virtual machines targeted for heavy graphic rendering and video editing, as well as model training and inferencing (ND) with deep learning. Available with single or multiple GPUs. |
3333
| [High performance compute](sizes-hpc.md) | H | Our fastest and most powerful CPU virtual machines with optional high-throughput network interfaces (RDMA). |
3434

3535

includes/virtual-machines-common-sizes-gpu.md

Lines changed: 26 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -2,17 +2,17 @@
22
title: include file
33
description: include file
44
services: virtual-machines-windows, virtual-machines-linux
5-
author: dlepow
5+
author: cynthn
66
ms.service: multiple
77
ms.topic: include
8-
ms.date: 10/23/2018
9-
ms.author: danlep;azcspmt;jonbeck
8+
ms.date: 11/14/2018
9+
ms.author: cynthn;azcspmt;jonbeck
1010
ms.custom: include file
1111
---
1212

1313
GPU optimized VM sizes are specialized virtual machines available with single or multiple NVIDIA GPUs. These sizes are designed for compute-intensive, graphics-intensive, and visualization workloads. This article provides information about the number and type of GPUs, vCPUs, data disks, and NICs. Storage throughput and network bandwidth are also included for each size in this grouping.
1414

15-
* **NC, NCv2, NCv3, and ND** sizes are optimized for compute-intensive and network-intensive applications and algorithms. Some examples are CUDA- and OpenCL-based applications and simulations, AI, and Deep Learning. The NCv3-series is focused on high-performance computing workloads featuring NVIDIA’s Tesla V100 GPU. The ND-series is focused on training and inference scenarios for deep learning. It uses the NVIDIA Tesla P40 GPU.
15+
* **NC, NCv2, NCv3, ND, and NDv2** sizes are optimized for compute-intensive and network-intensive applications and algorithms. Some examples are CUDA- and OpenCL-based applications and simulations, AI, and Deep Learning. The NCv3-series is focused on high-performance computing workloads featuring NVIDIA’s Tesla V100 GPU. The ND-series is focused on training and inference scenarios for deep learning. It uses the NVIDIA Tesla P40 GPU.
1616
* **NV and NVv2** sizes are optimized and designed for remote visualization, streaming, gaming, encoding, and VDI scenarios using frameworks such as OpenGL and DirectX. These VMs are backed by the NVIDIA Tesla M60 GPU.
1717

1818

@@ -22,7 +22,7 @@ Premium Storage: Not Supported
2222

2323
Premium Storage Caching: Not Supported
2424

25-
NC-series VMs are powered by the [NVIDIA Tesla K80](http://images.nvidia.com/content/pdf/kepler/Tesla-K80-BoardSpec-07317-001-v05.pdf) card. Users can crunch through data faster by leveraging CUDA for energy exploration applications, crash simulations, ray traced rendering, deep learning and more. The NC24r configuration provides a low latency, high-throughput network interface optimized for tightly coupled parallel computing workloads.
25+
NC-series VMs are powered by the [NVIDIA Tesla K80](http://images.nvidia.com/content/pdf/kepler/Tesla-K80-BoardSpec-07317-001-v05.pdf) card. Users can crunch through data faster by leveraging CUDA for energy exploration applications, crash simulations, ray traced rendering, deep learning, and more. The NC24r configuration provides a low latency, high-throughput network interface optimized for tightly coupled parallel computing workloads.
2626

2727

2828
| Size | vCPU | Memory: GiB | Temp storage (SSD) GiB | GPU | GPU memory: GiB | Max data disks | Max NICs |
@@ -82,13 +82,33 @@ NCv3-series VMs are powered by [NVIDIA Tesla V100](http://www.nvidia.com/content
8282

8383
*RDMA capable
8484

85+
## NDv2-series (Preview)
86+
87+
88+
Premium Storage: Supported
89+
90+
Premium Storage Caching: Supported
91+
92+
Infiniband: Not supported
93+
94+
95+
NDv2-series virtual machine is a new addition to the GPU family designed for the needs of the HPC, AI, and machine learning workloads. It’s powered by 8 NVIDIA Tesla V100 NVLINK interconnected GPUs and 40 Intel Skylake cores and 672 GiB of system memory. NDv2 instance provides excellent FP32 and FP64 performance for HPC and AI workloads utilizing Cuda, TensorFlow, Pytorch, Caffe, and other frameworks.
96+
97+
[Sign-up and get access to these machines during preview](https://aka.ms/ndv2signup).
98+
<br>
99+
100+
101+
| Size | vCPU’s | GPU | Memory | NICs (Max) | Max. disk size | Max. data disks (1023 GB each) | Max network bandwidth |
102+
|-------------------|-------------|-------------------|--------|------------------|---------|------------|--------------------------|--------------------|--------------------------------|-----------------------------------------|-----------------------|------------|
103+
| Standard_ND40s_v2 | 40 | 8 V100 (NVlilnk) | 672 GiB | 8 | Temporary 1344 / 2948XIO | 32 | 24,000 Mbps |
104+
85105
## ND-series
86106

87107
Premium Storage: Supported
88108

89109
Premium Storage Caching: Supported
90110

91-
The ND-series virtual machines are a new addition to the GPU family designed for AI and Deep Learning workloads. They offer excellent performance for training and inference. ND instances are powered by [NVIDIA Tesla P40](http://images.nvidia.com/content/pdf/tesla/184427-Tesla-P40-Datasheet-NV-Final-Letter-Web.pdf) GPUs. These instances provide excellent performance for single-precision floating point operations, for AI workloads utilizing Microsoft Cognitive Toolkit, TensorFlow, Caffe, and other frameworks. The ND-series also offers a much larger GPU memory size (24 GB), enabling to fit much larger neural net models. Like the NC-series, the ND-series offers a configuration with a secondary low-latency, high-throughput network through RDMA, and InfiniBand connectivity so you can run large-scale training jobs spanning many GPUs.
111+
The ND-series virtual machines are a new addition to the GPU family designed for AI, and Deep Learning workloads. They offer excellent performance for training and inference. ND instances are powered by [NVIDIA Tesla P40](http://images.nvidia.com/content/pdf/tesla/184427-Tesla-P40-Datasheet-NV-Final-Letter-Web.pdf) GPUs. These instances provide excellent performance for single-precision floating point operations, for AI workloads utilizing Microsoft Cognitive Toolkit, TensorFlow, Caffe, and other frameworks. The ND-series also offers a much larger GPU memory size (24 GB), enabling to fit much larger neural net models. Like the NC-series, the ND-series offers a configuration with a secondary low-latency, high-throughput network through RDMA, and InfiniBand connectivity so you can run large-scale training jobs spanning many GPUs.
92112

93113
> [!IMPORTANT]
94114
> For this size family, the vCPU (core) quota per region in your subscription is initially set to 0. [Request a vCPU quota increase](../articles/azure-supportability/resource-manager-core-quotas-request.md) for this family in an [available region](https://azure.microsoft.com/regions/services/).

includes/virtual-machines-n-series-considerations.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,11 +16,11 @@
1616

1717
* N-series VMs can only be deployed in the Resource Manager deployment model.
1818

19-
* N-series VMs differ in the type of Azure Storage they support for their disks. NC and NV VMs only support VM disks that are backed by Standard Disk Storage (HDD). NCv2, NCv3, ND, and NVv2 VMs only support VM disks that are backed by Premium Disk Storage (SSD).
19+
* N-series VMs differ in the type of Azure Storage they support for their disks. NC and NV VMs only support VM disks that are backed by Standard Disk Storage (HDD). NCv2, NCv3, ND, NDv2, and NVv2 VMs only support VM disks that are backed by Premium Disk Storage (SSD).
2020

2121
* If you want to deploy more than a few N-series VMs, consider a pay-as-you-go subscription or other purchase options. If you're using an [Azure free account](https://azure.microsoft.com/free/), you can use only a limited number of Azure compute cores.
2222

23-
* You might need to increase the cores quota (per region) in your Azure subscription, and increase the separate quota for NC, NCv2, NCv3, ND, NV, or NVv2 cores. To request a quota increase, [open an online customer support request](../articles/azure-supportability/how-to-create-azure-support-request.md) at no charge. Default limits may vary depending on your subscription category.
23+
* You might need to increase the cores quota (per region) in your Azure subscription, and increase the separate quota for NC, NCv2, NCv3, ND, NDv2, NV, or NVv2 cores. To request a quota increase, [open an online customer support request](../articles/azure-supportability/how-to-create-azure-support-request.md) at no charge. Default limits may vary depending on your subscription category.
2424

2525

2626

includes/virtual-machines-n-series-linux-support.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -2,19 +2,19 @@
22
title: include file
33
description: include file
44
services: virtual-machines-linux
5-
author: dlepow
5+
author: cynthn
66
ms.service: virtual-machines-linux
77
ms.topic: include
88
ms.date: 11/08/2018
9-
ms.author: danlep
9+
ms.author: cynthn
1010
ms.custom: include file
1111
---
1212

1313
## Supported distributions and drivers
1414

1515
### NVIDIA CUDA drivers
1616

17-
NVIDIA CUDA drivers for NC, NCv2, NCv3, and ND-series VMs (optional for NV-series) are supported only on the Linux distributions listed in the following table. CUDA driver information is current at time of publication. For the latest CUDA drivers, visit the [NVIDIA](https://developer.nvidia.com/cuda-zone) website. Ensure that you install or upgrade to the latest CUDA drivers for your distribution.
17+
NVIDIA CUDA drivers for NC, NCv2, NCv3, ND, and NDv2-series VMs (optional for NV-series) are supported only on the Linux distributions listed in the following table. CUDA driver information is current at time of publication. For the latest CUDA drivers, visit the [NVIDIA](https://developer.nvidia.com/cuda-zone) website. Ensure that you install or upgrade to the latest CUDA drivers for your distribution.
1818

1919
> [!TIP]
2020
> As an alternative to manual CUDA driver installation on a Linux VM, you can deploy an Azure [Data Science Virtual Machine](../articles/machine-learning/data-science-virtual-machine/overview.md) image. The DSVM editions for Ubuntu 16.04 LTS or CentOS 7.4 pre-install NVIDIA CUDA drivers, the CUDA Deep Neural Network Library, and other tools.

includes/virtual-machines-n-series-windows-support.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,19 +2,19 @@
22
title: include file
33
description: include file
44
services: virtual-machines-windows
5-
author: dlepow
5+
author: cynthn
66
ms.service: virtual-machines-windows
77
ms.topic: include
8-
ms.date: 11/08/2018
9-
ms.author: danlep
8+
ms.date: 11/14/2018
9+
ms.author: cynthn
1010
ms.custom: include file
1111
---
1212

1313
## Supported operating systems and drivers
1414

1515
### NVIDIA Tesla (CUDA) drivers
1616

17-
NVIDIA Tesla (CUDA) drivers for NC, NCv2, NCv3, and ND-series VMs (optional for NV-series) are supported only on the operating systems listed in the following table. Driver download links are current at time of publication. For the latest drivers, visit the [NVIDIA](http://www.nvidia.com/) website.
17+
NVIDIA Tesla (CUDA) drivers for NC, NCv2, NCv3, ND, and NDv2-series VMs (optional for NV-series) are supported only on the operating systems listed in the following table. Driver download links are current at time of publication. For the latest drivers, visit the [NVIDIA](http://www.nvidia.com/) website.
1818

1919
> [!TIP]
2020
> As an alternative to manual CUDA driver installation on a Windows Server VM, you can deploy an Azure [Data Science Virtual Machine](../articles/machine-learning/data-science-virtual-machine/overview.md) image. The DSVM editions for Windows Server 2016 pre-install NVIDIA CUDA drivers, the CUDA Deep Neural Network Library, and other tools.

0 commit comments

Comments
 (0)