Skip to content

Commit b5bdf0e

Browse files
Merge pull request #273555 from fbsolo-ms1/update-data-science-virtual-machine-files
Update data science virtual machine files
2 parents c161c9d + 9024ac0 commit b5bdf0e

File tree

5 files changed

+29
-24
lines changed

5 files changed

+29
-24
lines changed

articles/lab-services/class-type-deep-learning-natural-language-processing.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ For instructions on how to create a lab, see [Tutorial: Set up a lab](tutorial-s
3535
| Lab settings | Value |
3636
| ------------ | ------------------ |
3737
| Virtual machine (VM) size | **Small GPU (Compute)**. This size is best suited for compute-intensive and network-intensive applications like Artificial Intelligence and Deep Learning. |
38-
| VM image | [Data Science Virtual Machine for Linux (Ubuntu)](https://azuremarketplace.microsoft.com/marketplace/apps?search=Data%20science%20Virtual%20machine&page=1&filters=microsoft%3Blinux). This image provides deep learning frameworks and tools for machine learning and data science. To view the full list of installed tools on this image, see [What's included on the DSVM?](../machine-learning/data-science-virtual-machine/overview.md#whats-included-on-the-dsvm). |
38+
| VM image | [Data Science Virtual Machine for Linux (Ubuntu)](https://azuremarketplace.microsoft.com/marketplace/apps?search=Data%20science%20Virtual%20machine&page=1&filters=microsoft%3Blinux). This image provides deep learning frameworks and tools for machine learning and data science. To view the full list of installed tools on this image, see [What does the DSVM include?](../machine-learning/data-science-virtual-machine/overview.md#what-does-the-dsvm-include). |
3939
| Enable remote desktop connection | Optionally, check **Enable remote desktop connection**. The Data Science image is already configured to use X2Go so that teachers and students can connect using a GUI remote desktop. X2Go *doesn't* require the **Enable remote desktop connection** setting to be enabled. |
4040
| Template Virtual Machine Settings | Optionally, choose **Use a virtual machine image without customization**. If you're using [lab plans](concept-lab-accounts-versus-lab-plans.md) and the DSVM has all the tools that your class requires, you can skip the template customization step. |
4141

articles/lab-services/how-to-setup-lab-gpu-1.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ As shown in the preceding image, this option is enabled by default, which ensure
5252
- When you select a *visualization* GPU size, your lab VMs are powered by the [NVIDIA Tesla M60](https://images.nvidia.com/content/tesla/pdf/188417-Tesla-M60-DS-A4-fnl-Web.pdf) GPU and [GRID technology](https://www.nvidia.com/content/dam/en-zz/Solutions/design-visualization/solutions/resources/documents1/NVIDIA_GRID_vPC_Solution_Overview.pdf). In this case, recent GRID drivers are installed, which enables the use of graphics-intensive applications.
5353

5454
> [!IMPORTANT]
55-
> The **Install GPU drivers** option only installs the drivers when they aren't present on your lab's image. For example, the GPU drivers are already installed on the Azure marketplace's [Data Science image](../machine-learning/data-science-virtual-machine/overview.md#whats-included-on-the-dsvm). If you create a lab using the Data Science image and choose to **Install GPU drivers**, the drivers won't be updated to a more recent version. To update the drivers, you will need to manually install them as explained in the next section.
55+
> The **Install GPU drivers** option only installs the drivers when they aren't present on your lab's image. For example, the GPU drivers are already installed on the Azure marketplace's [Data Science image](../machine-learning/data-science-virtual-machine/overview.md#what-does-the-dsvm-include). If you create a lab using the Data Science image and choose to **Install GPU drivers**, the drivers won't be updated to a more recent version. To update the drivers, you will need to manually install them as explained in the next section.
5656
5757
### Install the drivers manually
5858

articles/lab-services/how-to-setup-lab-gpu.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -60,7 +60,7 @@ When you select **Install GPU drivers**, it ensures that recently released drive
6060
- When you select the Medium GPU *(Visualization)* size, your lab VMs are powered by the [NVIDIA Tesla M60](https://images.nvidia.com/content/tesla/pdf/188417-Tesla-M60-DS-A4-fnl-Web.pdf) GPU and [GRID technology](https://www.nvidia.com/content/dam/en-zz/Solutions/design-visualization/solutions/resources/documents1/NVIDIA_GRID_vPC_Solution_Overview.pdf). In this case, recent GRID drivers are installed, which enables the use of graphics-intensive applications.
6161

6262
> [!IMPORTANT]
63-
> The **Install GPU drivers** option only installs the drivers when they aren't present on your lab's image. For example, NVIDIA GPU drivers are already installed on the Azure marketplace's [Data Science Virtual Machine image](../machine-learning/data-science-virtual-machine/overview.md#whats-included-on-the-dsvm). If you create a Small GPU (Compute) lab using the Data Science image and choose to **Install GPU drivers**, the drivers won't be updated to a more recent version. To update the drivers, you will need to manually install the drivers.
63+
> The **Install GPU drivers** option only installs the drivers when they aren't present on your lab's image. For example, NVIDIA GPU drivers are already installed on the Azure marketplace's [Data Science Virtual Machine image](../machine-learning/data-science-virtual-machine/overview.md#what-does-the-dsvm-include). If you create a Small GPU (Compute) lab using the Data Science image and choose to **Install GPU drivers**, the drivers won't be updated to a more recent version. To update the drivers, you will need to manually install the drivers.
6464
6565
### Install GPU drivers manually
6666

articles/lab-services/how-to-use-shared-image-gallery.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ An image contains the operating system, software applications, files, and settin
1313

1414
You can use two types of images to set up a new lab:
1515

16-
- Azure Marketplace images are prebuilt by Microsoft for use within Azure. These images have either Windows or Linux installed and may also include software applications. For example, the [Data Science Virtual Machine image](../machine-learning/data-science-virtual-machine/overview.md#whats-included-on-the-dsvm) includes deep learning frameworks and tools.
16+
- Azure Marketplace images are prebuilt by Microsoft for use within Azure. These images have either Windows or Linux installed and may also include software applications. For example, the [Data Science Virtual Machine image](../machine-learning/data-science-virtual-machine/overview.md#what-does-the-dsvm-include) includes deep learning frameworks and tools.
1717
- Custom images are created by your institution’s IT department and\or other educators. You can create both Windows and Linux custom images. You have the flexibility to install Microsoft and third-party applications based on your unique needs. You also can add files, change application settings, and more.
1818

1919
> [!IMPORTANT]

articles/machine-learning/data-science-virtual-machine/overview.md

Lines changed: 25 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -8,26 +8,34 @@ ms.service: data-science-vm
88
ms.custom: linux-related-content
99
author: jesscioffi
1010
ms.author: jcioffi
11-
ms.reviewer: mattmcinnes
1211
ms.topic: overview
13-
ms.date: 06/23/2022
12+
ms.reviewer: franksolomon
13+
ms.date: 04/26/2024
1414
---
1515

1616
# What is the Azure Data Science Virtual Machine for Linux and Windows?
1717

18-
The Data Science Virtual Machine (DSVM) is a customized VM image on the Azure cloud platform built specifically for doing data science. It has many popular data science tools preinstalled and preconfigured to jump-start building intelligent applications for advanced analytics.
18+
The Data Science Virtual Machine (DSVM) is a customized VM image available on the Azure cloud platform, and it can handle data science. It has many popular data science tools preinstalled and preconfigured to jump-start building intelligent applications for advanced analytics.
1919

2020
The DSVM is available on:
2121

2222
+ Windows Server 2019
2323
+ Windows Server 2022
2424
+ Ubuntu 20.04 LTS
2525

26-
Additionally, we're excited to offer Azure DSVM for PyTorch, which is an Ubuntu 20.04 image from Azure Marketplace that is optimized for large, distributed deep learning workloads. It comes preinstalled and validated with the latest PyTorch version to reduce setup costs and accelerate time to value. It comes packaged with various optimization functionalities (ONNX Runtime​, DeepSpeed​, MSCCL​, ORTMoE​, Fairscale​, Nvidia Apex​), and an up-to-date stack with the latest compatible versions of Ubuntu, Python, PyTorch, CUDA.
26+
Additionally, we offer Azure DSVM for PyTorch - an Ubuntu 20.04 image from Azure Marketplace optimized for large, distributed deep learning workloads. This preinstalled DSVM comes validated with the latest PyTorch version, to reduce setup costs and accelerate time to value. It comes packaged with various optimization features:
27+
28+
- ONNX Runtime​
29+
- DeepSpeed​
30+
- MSCCL​
31+
- ORTMoE​
32+
- Fairscale​
33+
- Nvidia Apex​
34+
- An up-to-date stack with the latest compatible versions of Ubuntu, Python, PyTorch, and CUDA
2735

2836
## Comparison with Azure Machine Learning
2937

30-
The DSVM is a customized VM image for Data Science but [Azure Machine Learning](../overview-what-is-azure-machine-learning.md) is an end-to-end platform that encompasses:
38+
The DSVM is a customized VM image for Data Science, but [Azure Machine Learning](../overview-what-is-azure-machine-learning.md) is an end-to-end platform that covers:
3139

3240
+ Fully Managed Compute
3341
+ Compute Instances
@@ -43,9 +51,9 @@ The DSVM is a customized VM image for Data Science but [Azure Machine Learning](
4351

4452
### Comparison with Azure Machine Learning Compute Instances
4553

46-
[Azure Machine Learning Compute Instances](../concept-compute-instance.md) are a fully configured and __managed__ VM image whereas the DSVM is an __unmanaged__ VM.
54+
[Azure Machine Learning Compute Instances](../concept-compute-instance.md) are a fully configured and __managed__ VM image, while the DSVM is an __unmanaged__ VM.
4755

48-
Key differences between these:
56+
Key differences between a DSVM and an Azure Machine Learning compute instance:
4957

5058
|Feature |Data Science<br>VM |Azure Machine Learning<br>Compute Instance |
5159
|---------|---------|---------|
@@ -61,38 +69,35 @@ Key differences between these:
6169
|Built-in Collaboration | No | Yes |
6270
|Preinstalled Tools | Jupyter(lab), VS Code,<br> Visual Studio, PyCharm, Juno,<br>Power BI Desktop, SSMS, <br>Microsoft Office 365, Apache Drill | Jupyter(lab) |
6371

64-
## Sample use cases
65-
66-
Here's some common use cases for DSVM customers.
72+
## Sample DSVM customer use cases
6773

6874
### Short-term experimentation and evaluation
6975

70-
You can use the DSVM to evaluate or learn new data science [tools](./tools-included.md), especially by going through some of our published [samples and walkthroughs](./dsvm-samples-and-walkthroughs.md).
76+
The DSVM can evaluate or learn new data science [tools](./tools-included.md). Try some of our published [samples and walkthroughs](./dsvm-samples-and-walkthroughs.md).
7177

7278
### Deep learning with GPUs
7379

74-
In the DSVM, your training models can use deep learning algorithms on hardware that's based on graphics processing units (GPUs). By taking advantage of the VM scaling capabilities of the Azure platform, the DSVM helps you use GPU-based hardware in the cloud according to your needs. You can switch to a GPU-based VM when you're training large models, or when you need high-speed computations while keeping the same OS disk. You can choose any of the N series GPUs enabled virtual machine SKUs with DSVM. Note GPU enabled virtual machine SKUs aren't supported on Azure free accounts.
80+
In the DSVM, your training models can use deep learning algorithms on graphics processing unit (GPU)-based hardware. If you take advantage of the VM scaling capabilities of the Azure platform, the DSVM helps you lever GPU-based hardware in the cloud, according to your needs. You can switch to a GPU-based VM when you train large models, or when you need high-speed computations while you keep the same OS disk. You can choose any of the N series GPU-enabled virtual machine SKUs with DSVM. Azure free accounts don't support GPU-enabled virtual machine SKUs.
7581

76-
The Windows editions of the DSVM come preinstalled with GPU drivers, frameworks, and GPU versions of deep learning frameworks. On the Linux editions, deep learning on GPUs is enabled on the Ubuntu DSVMs.
82+
A Windows-edition DSVM comes preinstalled with GPU drivers, frameworks, and GPU versions of deep learning frameworks. On the Linux editions, deep learning on GPUs is enabled on the Ubuntu DSVMs.
7783

78-
You can also deploy the Ubuntu or Windows editions of the DSVM to an Azure virtual machine that isn't based on GPUs. In this case, all the deep learning frameworks falls back to the CPU mode.
84+
You can also deploy the Ubuntu or Windows DSVM editions to an Azure virtual machine that isn't based on GPUs. In this case, all the deep learning frameworks fall back to the CPU mode.
7985

8086
[Learn more about available deep learning and AI frameworks](dsvm-tools-deep-learning-frameworks.md).
8187

8288
### Data science training and education
8389

84-
Enterprise trainers and educators who teach data science classes usually provide a virtual machine image. The image ensures students have a consistent setup and that the samples work predictably.
85-
86-
The DSVM creates an on-demand environment with a consistent setup that eases the support and incompatibility challenges. Cases where these environments need to be built frequently, especially for shorter training classes, benefit substantially.
90+
Enterprise trainers and educators who teach data science classes usually provide a virtual machine image. The image ensures that students both have a consistent setup and that the samples work predictably.
8791

92+
The DSVM creates an on-demand environment with a consistent setup, to ease the support and incompatibility challenges. Cases where these environments need to be built frequently, especially for shorter training classes, benefit substantially.
8893

89-
## What's included on the DSVM?
94+
## What does the DSVM include?
9095

91-
See a full list of tools on both the Windows and Linux DSVMs [here](tools-included.md).
96+
For more information, see this [full list of tools on both Windows and Linux DSVMs](tools-included.md).
9297

9398
## Next steps
9499

95-
Learn more with these articles:
100+
For more information, visit these resources:
96101

97102
+ Windows:
98103
+ [Set up a Windows DSVM](provision-vm.md)

0 commit comments

Comments
 (0)