Skip to content

Commit 0305cc2

Browse files
authored
Merge pull request #274141 from paulth1/data-science-virtual-machine-articles
[AQ] edit pass: Data science virtual machine articles
2 parents 160528c + a3a4be4 commit 0305cc2

File tree

3 files changed

+84
-89
lines changed

3 files changed

+84
-89
lines changed

articles/machine-learning/data-science-virtual-machine/dsvm-tools-deep-learning-frameworks.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ Deep learning frameworks on the DSVM are listed here:
5252
| Supported versions | 1.9.0 (Linux, Windows 2019) |
5353
| Supported DSVM editions | Windows Server 2019<br>Linux |
5454
| How is it configured and installed on the DSVM? | Installed in Python, conda environments 'py38_default', 'py38_pytorch' |
55-
| How to run it | At the terminal, activate the appropriate environment, and then run Python.<br/>* [JupyterHub](dsvm-ubuntu-intro.md#how-to-access-the-ubuntu-data-science-virtual-machine): Connect, and then open the PyTorch directory for samples. |
55+
| How to run it | At the terminal, activate the appropriate environment, and then run Python.<br/>* [JupyterHub](dsvm-ubuntu-intro.md#access-the-ubuntu-data-science-virtual-machine): Connect, and then open the PyTorch directory for samples. |
5656

5757
## [TensorFlow](https://www.tensorflow.org/)
5858

@@ -61,4 +61,4 @@ Deep learning frameworks on the DSVM are listed here:
6161
| Supported versions | 2.5 |
6262
| Supported DSVM editions | Windows Server 2019<br>Linux |
6363
| How is it configured and installed on the DSVM? | Installed in Python, conda environments 'py38_default', 'py38_tensorflow' |
64-
| How to run it | At the terminal, activate the correct environment, and then run Python. <br/> * Jupyter: Connect to [Jupyter](provision-vm.md) or [JupyterHub](dsvm-ubuntu-intro.md#how-to-access-the-ubuntu-data-science-virtual-machine), and then open the TensorFlow directory for samples. |
64+
| How to run it | At the terminal, activate the correct environment, and then run Python. <br/> * Jupyter: Connect to [Jupyter](provision-vm.md) or [JupyterHub](dsvm-ubuntu-intro.md#access-the-ubuntu-data-science-virtual-machine), and then open the TensorFlow directory for samples. |

articles/machine-learning/data-science-virtual-machine/dsvm-ubuntu-intro.md

Lines changed: 52 additions & 57 deletions
Original file line numberDiff line numberDiff line change
@@ -20,50 +20,45 @@ Get up and running with the Ubuntu 20.04 Data Science Virtual Machine (DSVM) and
2020

2121
You need an Azure subscription to create either an Ubuntu 20.04 Data Science Virtual Machine or an Azure DSVM for PyTorch. [Try Azure for free](https://azure.com/free).
2222

23-
> [!NOTE]
24-
> Azure free accounts don't support GPU enabled virtual machine SKUs.
23+
Azure free accounts don't support GPU-enabled virtual machine (VM) SKUs.
2524

2625
## Create your Data Science Virtual Machine for Linux
2726

28-
These steps describe how to create an instance of either the Ubuntu 20.04 Data Science Virtual Machine (DSVM) or the Azure DSVM for PyTorch:
29-
30-
1. Go to the [Azure portal](https://portal.azure.com). You might get a prompt to sign in to your Azure account if you haven't signed in yet
31-
1. Find the virtual machine listing. First type in "data science virtual machine," and then select "Data Science Virtual Machine- Ubuntu 20.04" or "Azure DSVM for PyTorch"
32-
33-
1. At the next window, select **Create**
27+
To create an instance of either the Ubuntu 20.04 DSVM or the Azure DSVM for PyTorch:
3428

35-
1. You should be redirected to the "Create a virtual machine" blade
29+
1. Go to the [Azure portal](https://portal.azure.com). You might get a prompt to sign in to your Azure account if you haven't signed in yet.
30+
1. Find the VM listing by entering **data science virtual machine**. Then select **Data Science Virtual Machine- Ubuntu 20.04** or **Azure DSVM for PyTorch**.
3631

37-
1. Enter the following information, to configure each step of the wizard:
32+
1. Select **Create**.
3833

39-
1. **Basics**:
34+
1. On the **Create a virtual machine** pane, fill in the **Basics** tab:
4035

41-
* **Subscription**: If you have more than one subscription, select the one on which the machine will be created and billed. You must have resource creation privileges for this subscription.
42-
* **Resource group**: Create a new group, or use an existing one.
43-
* **Virtual machine name**: Enter the name of the virtual machine. This name is used in your Azure portal.
44-
* **Region**: Select the datacenter that's most appropriate. For fastest network access, this is the datacenter that has most of your data, or is closest to your physical location. For more information, visit [Azure Regions](https://azure.microsoft.com/global-infrastructure/regions/).
45-
* **Image**: Don't change the default value.
46-
* **Size**: This option should autopopulate with a size that is appropriate for general workloads. For more information, visit [Linux VM sizes in Azure](../../virtual-machines/sizes.md).
47-
* **Authentication type**: For quicker setup, select "Password."
36+
* **Subscription**: If you have more than one subscription, select the one on which the machine will be created and billed. You must have resource creation privileges for this subscription.
37+
* **Resource group**: Create a new group or use an existing one.
38+
* **Virtual machine name**: Enter the name of the VM. This name is used in your Azure portal.
39+
* **Region**: Select the datacenter that's most appropriate. For fastest network access, the datacenter that hosts most of your data or is located closest to your physical location is the best choice. For more information, refer to [Azure regions](https://azure.microsoft.com/global-infrastructure/regions/).
40+
* **Image**: Don't change the default value.
41+
* **Size**: This option should autopopulate with a size that's appropriate for general workloads. For more information, refer to [Linux VM sizes in Azure](../../virtual-machines/sizes.md).
42+
* **Authentication type**: For quicker setup, select **Password**.
4843

4944
> [!NOTE]
50-
> If you plan to use JupyterHub, make sure to select "Password," because JupyterHub is *not* configured to use SSH public keys.
45+
> If you plan to use JupyterHub, make sure to select **Password** because JupyterHub is *not* configured to use Secure Shell (SSH) Protocol public keys.
5146
52-
* **Username**: Enter the administrator username. You use this username to log into your virtual machine. This username doesn't need to match your Azure username. Do *not* use capitalized letters.
47+
* **Username**: Enter the administrator username. You use this username to sign in to your VM. It doesn't need to match your Azure username. Don't use capital letters.
5348

5449
> [!IMPORTANT]
55-
> If you use capitalized letters in your username, JupyterHub will not work, and you'll encounter a 500 internal server error.
50+
> If you use capital letters in your username, JupyterHub won't work, and you'll encounter a 500 internal server error.
5651
57-
* **Password**: Enter the password you'll use to log into your virtual machine.
52+
* **Password**: Enter the password you plan to use to sign in to your VM.
5853

59-
1. Select **Review + create**.
60-
1. **Review+create**
54+
1. Select **Review + create**.
55+
1. On the **Review + create** pane:
6156
* Verify that all the information you entered is correct.
6257
* Select **Create**.
6358

64-
The provisioning process should take about 5 minutes. The Azure portal displays the status.
59+
The provisioning process takes about 5 minutes. You can view the status of your VM in the Azure portal.
6560

66-
## How to access the Ubuntu Data Science Virtual Machine
61+
## Access the Ubuntu Data Science Virtual Machine
6762

6863
You can access the Ubuntu DSVM in one of four ways:
6964

@@ -74,10 +69,10 @@ You can access the Ubuntu DSVM in one of four ways:
7469

7570
### SSH
7671

77-
If you configured your VM with SSH authentication, you can sign in with the account credentials that you created in the **Basics** section of step 3 for the text shell interface. For more information, visit [Learn more about connecting to a Linux VM](../../virtual-machines/linux-vm-connect.md).
72+
If you configured your VM with SSH authentication, you can sign in with the account credentials that you created in the **Basics** section of step 4 for the text shell interface. For more information, refer to [Learn more about connecting to a Linux VM](../../virtual-machines/linux-vm-connect.md).
7873

7974
### xrdp
80-
xrdp is the standard tool for accessing Linux graphical sessions. While the distro doesn't include this tool by default, [these instructions](../../virtual-machines/linux/use-remote-desktop.md) explain how to install it.
75+
The standard tool for accessing Linux graphical sessions is xrdp. While the distribution doesn't include this tool by default, [these instructions](../../virtual-machines/linux/use-remote-desktop.md) explain how to install it.
8176

8277
### X2Go
8378
> [!NOTE]
@@ -86,56 +81,56 @@ xrdp is the standard tool for accessing Linux graphical sessions. While the dist
8681
The Linux VM is already provisioned with X2Go Server and is ready to accept client connections. To connect to the Linux VM graphical desktop, complete the following procedure on your client:
8782

8883
1. Download and install the X2Go client for your client platform from [X2Go](https://wiki.x2go.org/doku.php/doc:installation:x2goclient).
89-
1. Note the public IP address of the virtual machine. In the Azure portal, open the virtual machine you created to find this information.
84+
1. Note the public IP address of the VM. In the Azure portal, open the VM you created to find this information.
9085

91-
:::image type="content" source="./media/dsvm-ubuntu-intro/ubuntu-ip-address.png" alt-text="Screenshot showing the public IP address of the virtual machine." lightbox= "./media/dsvm-ubuntu-intro/ubuntu-ip-address.png":::
86+
:::image type="content" source="./media/dsvm-ubuntu-intro/ubuntu-ip-address.png" alt-text="Screenshot that shows the public IP address of the VM." lightbox= "./media/dsvm-ubuntu-intro/ubuntu-ip-address.png":::
9287

93-
1. Run the X2Go client. If the "New Session" window doesn't automatically pop up, go to Session -> New Session.
88+
1. Run the X2Go client. If the **New Session** pane doesn't automatically pop up, select **Session** > **New Session**.
9489

95-
1. On the resulting configuration window, enter these configuration parameters:
96-
* **Session tab**:
90+
1. On the resulting configuration pane, enter these configuration parameters:
91+
* **Session**:
9792
* **Host**: Enter the IP address of your VM, which you noted earlier.
9893
* **Login**: Enter the username on the Linux VM.
99-
* **SSH Port**: Leave it at 22. This is the default value.
100-
* **Session Type**: Change the value to **XFCE**. Currently, the Linux VM supports only the XFCE desktop.
101-
* **Media tab**: You can turn off sound support and client printing if you don't need to use them.
102-
* **Shared folders**: Use this tab to add client machine directory that you would like to mount on the VM.
94+
* **SSH port**: Leave it at the default value **22**.
95+
* **Session type**: Change the value to **XFCE**. Currently, the Linux VM supports only the XFCE desktop.
96+
* **Media**: You can turn off sound support and client printing if you don't need to use them.
97+
* **Shared folders**: Use this tab to add the client machine directory that you want to mount on the VM.
10398

104-
:::image type="content" source="./media/dsvm-ubuntu-intro/x2go-ubuntu.png" alt-text="Screenshot showing preferences for a new X2Go session." lightbox= "./media/dsvm-ubuntu-intro/x2go-ubuntu.png":::
99+
:::image type="content" source="./media/dsvm-ubuntu-intro/x2go-ubuntu.png" alt-text="Screenshot that shows preferences for a new X2Go session." lightbox= "./media/dsvm-ubuntu-intro/x2go-ubuntu.png":::
105100
1. Select **OK**.
106-
1. Select on the box in the right pane of the X2Go window to bring up the sign-in screen for your VM.
101+
1. Select the box in the right pane of the X2Go pane to bring up the sign-in pane for your VM.
107102
1. Enter the password for your VM.
108103
1. Select **OK**.
109104
1. You might need to give X2Go permission to bypass your firewall to finish the connection process.
110105
1. You should now see the graphical interface for your Ubuntu DSVM.
111106

112107
### JupyterHub and JupyterLab
113108

114-
The Ubuntu DSVM runs [JupyterHub](https://github.com/jupyterhub/jupyterhub), a multiuser Jupyter server. To connect, take the following steps:
109+
The Ubuntu DSVM runs [JupyterHub](https://github.com/jupyterhub/jupyterhub), which is a multiuser Jupyter server. To connect, follow these steps:
115110

116-
1. Note the public IP address of your VM. To find this value, search for and select your VM in the Azure portal, as shown in this screenshot:
111+
1. Note the public IP address of your VM. To find this value, search for and select your VM in the Azure portal, as shown in this screenshot.
117112

118-
:::image type="content" source="./media/dsvm-ubuntu-intro/ubuntu-ip-address.png" alt-text="Screenshot highlighting the public IP address of your VM." lightbox= "./media/dsvm-ubuntu-intro/ubuntu-ip-address.png":::
113+
:::image type="content" source="./media/dsvm-ubuntu-intro/ubuntu-ip-address.png" alt-text="Screenshot that shows the public IP address of your VM." lightbox= "./media/dsvm-ubuntu-intro/ubuntu-ip-address.png":::
119114

120-
1. From your local machine, open a web browser, and navigate to https:\//**your-vm-ip**:8000, replacing "**your-vm-ip**" with the IP address you noted earlier.
121-
1. Your browser will probably prevent you from opening the page directly. It might tell you that there's a certificate error. The DSVM provides security with a self-signed certificate. Most browsers will allow you to select through after this warning. Many browsers will continue to provide some kind of visual warning about the certificate throughout your Web session.
115+
1. From your local machine, open a web browser and go to `https://your-vm-ip:8000`. Replace **your-vm-ip** with the IP address you noted earlier.
116+
1. Your browser will probably prevent you from opening the pane directly. It might tell you that there's a certificate error. The DSVM provides security with a self-signed certificate. Most browsers will allow you to select through after this warning. Many browsers will continue to provide some kind of visual warning about the certificate throughout your web session.
122117

123-
>[!NOTE]
124-
> If you see the `ERR_EMPTY_RESPONSE` error message in your browser, make sure you access the machine by explicit use of the *HTTPS* protocol. *HTTP* or just the web address don't work for this step. If you type the web address without `https://` in the address line, most browsers will default to `http`, and the error will appear.
118+
If you see the `ERR_EMPTY_RESPONSE` error message in your browser, make sure you access the machine by explicit use of the *HTTPS* protocol. *HTTP* or just the web address don't work for this step. If you enter the web address without `https://` in the address line, most browsers will default to `http` and the error will appear.
125119

126-
1. Enter the username and password that you used to create the VM, and sign in, as shown in this screenshot
120+
1. Enter the username and password that you used to create the VM and sign in, as shown in this screenshot.
127121

128-
:::image type="content" source="./media/dsvm-ubuntu-intro/jupyter-login.png" alt-text="Screenshot the JupyterHub sign-in screen." lightbox= "./media/dsvm-ubuntu-intro/jupyter-login.png":::
122+
:::image type="content" source="./media/dsvm-ubuntu-intro/jupyter-login.png" alt-text="Screenshot that shows the JupyterHub sign-in pane." lightbox= "./media/dsvm-ubuntu-intro/jupyter-login.png":::
129123

130-
>[!NOTE]
131-
> If you receive a 500 Error at this stage, you probably used capitalized letters in your username. This is a known interaction between Jupyter Hub and the PAMAuthenticator it uses.
132-
> If you receive a "Can't reach this page" error, it is likely that your Network Security Group permissions need adjustment. In the Azure portal, find the Network Security Group resource within your Resource Group. To access JupyterHub from the public Internet, you must have port 8000 open. (The image shows that this VM is configured for just-in-time access, which is highly recommended. For more information, visit [Secure your management ports with just-in time access](../../security-center/security-center-just-in-time.md).)
124+
125+
If you receive a 500 error at this stage, you probably used capital letters in your username. This issue is a known interaction between JupyterHub and the PAM authenticator it uses.
126+
127+
If you receive a "Can't reach this page" error, it's likely that your network security group (NSG) permissions need adjustment. In the Azure portal, find the NSG resource within your resource group. To access JupyterHub from the public internet, you must have port 8000 open. (The image shows that this VM is configured for just-in-time access, which we highly recommend. For more information, refer to [Secure your management ports with just-in time access](../../security-center/security-center-just-in-time.md).)
133128
>
134-
> :::image type="content" source="./media/dsvm-ubuntu-intro/nsg-permissions.png" alt-text="Screenshot of Network Security Group configuration values." lightbox= "./media/dsvm-ubuntu-intro/nsg-permissions.png":::
129+
> :::image type="content" source="./media/dsvm-ubuntu-intro/nsg-permissions.png" alt-text="Screenshot that shows NSG configuration values." lightbox= "./media/dsvm-ubuntu-intro/nsg-permissions.png":::
135130
136131
1. Browse the available sample notebooks.
137132

138-
JupyterLab, the next generation of Jupyter notebooks and JupyterHub, is also available. To access it, sign in to JupyterHub, and then browse to the URL https:\//your-vm-ip:8000/user/**your-username**/lab, replacing "**your-username**" with the username you chose when you configured the VM. Again, potential certificate errors might initially block you from accessing the site.
133+
JupyterLab, the next generation of Jupyter notebooks and JupyterHub, is also available. To access it, sign in to JupyterHub. Then browse to the URL `https://your-vm-ip:8000/user/your-username/lab`. Replace **your-username** with the username you chose when you configured the VM. Again, potential certificate errors might initially block you from accessing the site.
139134

140135
To set JupyterLab as the default notebook server, add this line to `/etc/jupyterhub/jupyterhub_config.py`:
141136

@@ -145,8 +140,8 @@ c.Spawner.default_url = '/lab'
145140

146141
## Next steps
147142

148-
* The [Data science on the Data Science Virtual Machine for Linux](linux-dsvm-walkthrough.md) walkthrough shows how to do several common data science tasks with the Linux DSVM provisioned here.
149-
* Try out the tools this article describes to explore the various data science tools on the DSVM. You can also run `dsvm-more-info` on the shell within the virtual machine for a basic introduction and pointers to more information about the tools installed on the VM.
143+
* See the [Data science on the Data Science Virtual Machine for Linux](linux-dsvm-walkthrough.md) walkthrough to learn how to do several common data science tasks with the Linux DSVM provisioned here.
144+
* Try out the tools this article describes to explore the various data science tools on the DSVM. You can also run `dsvm-more-info` on the shell within the VM for a basic introduction and pointers to more information about the tools installed on the VM.
150145
* Learn how to systematically build analytical solutions with the [Team Data Science Process](/azure/architecture/data-science-process/overview).
151-
* Visit the [Azure AI Gallery](https://gallery.azure.ai/) for machine learning and data analytics samples that use the Azure AI services.
152-
* Visit the appropriate [reference documentation](./reference-ubuntu-vm.md) for this virtual machine.
146+
* See the [Azure AI Gallery](https://gallery.azure.ai/) for machine learning and data analytics samples that use the Azure AI services.
147+
* See the appropriate [reference documentation](./reference-ubuntu-vm.md) for this VM.

0 commit comments

Comments
 (0)