Skip to content

Commit 24ca7d0

Browse files
committed
Freshness update for dsvm-ubuntu-intro.md . . .
1 parent 024ec97 commit 24ca7d0

File tree

1 file changed

+46
-47
lines changed

1 file changed

+46
-47
lines changed

articles/machine-learning/data-science-virtual-machine/dsvm-ubuntu-intro.md

Lines changed: 46 additions & 47 deletions
Original file line numberDiff line numberDiff line change
@@ -6,50 +6,51 @@ ms.service: data-science-vm
66
author: jeff-shepherd
77
ms.author: jeffshep
88
ms.topic: quickstart
9-
ms.date: 04/18/2023
9+
ms.reviewer: franksolomon
10+
ms.date: 04/23/2024
1011
ms.custom: mode-other, linux-related-content
1112
ms.reviewer: mattmcinnes
1213
#Customer intent: As a data scientist, I want to learn how to provision the Linux DSVM so that I can move my existing workflow to the cloud.
1314
---
1415

1516
# Quickstart: Set up the Data Science Virtual Machine for Linux (Ubuntu)
1617

17-
Get up and running with the Ubuntu 20.04 Data Science Virtual Machine and Azure DSVM for PyTorch.
18+
Get up and running with the Ubuntu 20.04 Data Science Virtual Machine (DSVM) and the Azure DSVM for PyTorch.
1819

1920
## Prerequisites
2021

21-
To create an Ubuntu 20.04 Data Science Virtual Machine or an Azure DSVM for PyTorch, you must have an Azure subscription. [Try Azure for free](https://azure.com/free).
22+
You need an Azure subscription to create either an Ubuntu 20.04 Data Science Virtual Machine or an Azure DSVM for PyTorch. [Try Azure for free](https://azure.com/free).
2223

23-
>[!NOTE]
24-
>Azure free accounts don't support GPU enabled virtual machine SKUs.
24+
> [!NOTE]
25+
> Azure free accounts don't support GPU enabled virtual machine SKUs.
2526
2627
## Create your Data Science Virtual Machine for Linux
2728

28-
Here are the steps to create an instance of the Ubuntu 20.04 Data Science Virtual Machine or the Azure DSVM for PyTorch:
29+
These steps describe how to create an instance of either the Ubuntu 20.04 Data Science Virtual Machine (DSVM) or the Azure DSVM for PyTorch:
2930

30-
1. Go to the [Azure portal](https://portal.azure.com). You might be prompted to sign in to your Azure account if you're not already signed in.
31-
1. Find the virtual machine listing by typing in "data science virtual machine" and selecting "Data Science Virtual Machine- Ubuntu 20.04" or "Azure DSVM for PyTorch"
31+
1. Go to the [Azure portal](https://portal.azure.com). You might get a prompt to sign in to your Azure account if you haven't signed in yet
32+
1. Find the virtual machine listing. First type in "data science virtual machine," and then select "Data Science Virtual Machine- Ubuntu 20.04" or "Azure DSVM for PyTorch"
3233

33-
1. On the next window, select **Create**.
34+
1. At the next window, select **Create**
3435

35-
1. You should be redirected to the "Create a virtual machine" blade.
36+
1. You should be redirected to the "Create a virtual machine" blade
3637

37-
1. Enter the following information to configure each step of the wizard:
38+
1. Enter the following information, to configure each step of the wizard:
3839

3940
1. **Basics**:
4041

4142
* **Subscription**: If you have more than one subscription, select the one on which the machine will be created and billed. You must have resource creation privileges for this subscription.
42-
* **Resource group**: Create a new group or use an existing one.
43-
* **Virtual machine name**: Enter the name of the virtual machine. This name will be used in your Azure portal.
44-
* **Region**: Select the datacenter that's most appropriate. For fastest network access, it's the datacenter that has most of your data or is closest to your physical location. Learn more about [Azure Regions](https://azure.microsoft.com/global-infrastructure/regions/).
45-
* **Image**: Leave the default value.
46-
* **Size**: This option should autopopulate with a size that is appropriate for general workloads. Read more about [Linux VM sizes in Azure](../../virtual-machines/sizes.md).
43+
* **Resource group**: Create a new group, or use an existing one.
44+
* **Virtual machine name**: Enter the name of the virtual machine. This name is used in your Azure portal.
45+
* **Region**: Select the datacenter that's most appropriate. For fastest network access, this is the datacenter that has most of your data, or is closest to your physical location. For more information, visit [Azure Regions](https://azure.microsoft.com/global-infrastructure/regions/).
46+
* **Image**: Don't change the default value.
47+
* **Size**: This option should autopopulate with a size that is appropriate for general workloads. For more information, visit [Linux VM sizes in Azure](../../virtual-machines/sizes.md).
4748
* **Authentication type**: For quicker setup, select "Password."
4849

4950
> [!NOTE]
50-
> If you intend to use JupyterHub, make sure to select "Password," as JupyterHub is *not* configured to use SSH public keys.
51+
> If you plan to use JupyterHub, make sure to select "Password," because JupyterHub is *not* configured to use SSH public keys.
5152
52-
* **Username**: Enter the administrator username. You'll use this username to log into your virtual machine. This username need not be the same as your Azure username. Do *not* use capitalized letters.
53+
* **Username**: Enter the administrator username. You use this username to log into your virtual machine. This username doesn't need to match your Azure username. Do *not* use capitalized letters.
5354

5455
> [!IMPORTANT]
5556
> If you use capitalized letters in your username, JupyterHub will not work, and you'll encounter a 500 internal server error.
@@ -61,7 +62,7 @@ Here are the steps to create an instance of the Ubuntu 20.04 Data Science Virtua
6162
* Verify that all the information you entered is correct.
6263
* Select **Create**.
6364

64-
The provisioning should take about 5 minutes. The status is displayed in the Azure portal.
65+
The provisioning process should take about 5 minutes. The Azure portal displays the status.
6566

6667
## How to access the Ubuntu Data Science Virtual Machine
6768

@@ -74,80 +75,78 @@ You can access the Ubuntu DSVM in one of four ways:
7475

7576
### SSH
7677

77-
If you configured your VM with SSH authentication, you can log on using the account credentials that you created in the **Basics** section of step 3 for the text shell interface. [Learn more about connecting to a Linux VM](../../virtual-machines/linux-vm-connect.md).
78+
If you configured your VM with SSH authentication, you can sign in with the account credentials that you created in the **Basics** section of step 3 for the text shell interface. For more information, visit [Learn more about connecting to a Linux VM](../../virtual-machines/linux-vm-connect.md).
7879

7980
### xrdp
80-
xrdp is the standard tool for accessing Linux graphical sessions. While this isn't included in the distro by default, you can [install it by following these instructions](../../virtual-machines/linux/use-remote-desktop.md).
81+
xrdp is the standard tool for accessing Linux graphical sessions. While the distro doesn't include this tool by default, [these instructions](../../virtual-machines/linux/use-remote-desktop.md) explain how to install it.
8182

8283
### X2Go
8384
> [!NOTE]
84-
> The X2Go client performed better than X11 forwarding in testing. We recommend using the X2Go client for a graphical desktop interface.
85+
> In testing, the X2Go client performed better than X11 forwarding. We recommend use of the X2Go client for a graphical desktop interface.
8586
86-
The Linux VM is already provisioned with X2Go Server and ready to accept client connections. To connect to the Linux VM graphical desktop, complete the following procedure on your client:
87+
The Linux VM is already provisioned with X2Go Server and is ready to accept client connections. To connect to the Linux VM graphical desktop, complete the following procedure on your client:
8788

8889
1. Download and install the X2Go client for your client platform from [X2Go](https://wiki.x2go.org/doku.php/doc:installation:x2goclient).
89-
1. Make note of the virtual machine's public IP address, which you can find in the Azure portal by opening the virtual machine you created.
90+
1. Note the public IP address of the virtual machine. In the Azure portal, open the virtual machine you created to find this information.
9091

9192
![Ubuntu machine IP address](./media/dsvm-ubuntu-intro/ubuntu-ip-address.png)
9293

93-
1. Run the X2Go client. If the "New Session" window doesn't pop up automatically, go to Session -> New Session.
94+
1. Run the X2Go client. If the "New Session" window doesn't automatically pop up, go to Session -> New Session.
9495

95-
1. On the resulting configuration window, enter the following configuration parameters:
96+
1. On the resulting configuration window, enter these configuration parameters:
9697
* **Session tab**:
97-
* **Host**: Enter the IP address of your VM, which you made note of earlier.
98+
* **Host**: Enter the IP address of your VM, which you noted earlier.
9899
* **Login**: Enter the username on the Linux VM.
99-
* **SSH Port**: Leave it at 22, the default value.
100+
* **SSH Port**: Leave it at 22. This is the default value.
100101
* **Session Type**: Change the value to **XFCE**. Currently, the Linux VM supports only the XFCE desktop.
101102
* **Media tab**: You can turn off sound support and client printing if you don't need to use them.
102103
* **Shared folders**: Use this tab to add client machine directory that you would like to mount on the VM.
103104

104105
![X2go configuration](./media/dsvm-ubuntu-intro/x2go-ubuntu.png)
105106
1. Select **OK**.
106-
1. Click on the box in the right pane of the X2Go window to bring up the log-in screen for your VM.
107+
1. Select on the box in the right pane of the X2Go window to bring up the sign-in screen for your VM.
107108
1. Enter the password for your VM.
108109
1. Select **OK**.
109-
1. You may have to give X2Go permission to bypass your firewall to finish connecting.
110+
1. You might need to give X2Go permission to bypass your firewall to finish the connection process.
110111
1. You should now see the graphical interface for your Ubuntu DSVM.
111112

112-
113113
### JupyterHub and JupyterLab
114114

115115
The Ubuntu DSVM runs [JupyterHub](https://github.com/jupyterhub/jupyterhub), a multiuser Jupyter server. To connect, take the following steps:
116116

117-
1. Make note of the public IP address for your VM, by searching for and selecting your VM in the Azure portal.
117+
1. Note the public IP address for your VM. To find this value, search for and select your VM in the Azure portal.
118118
![Ubuntu machine IP address](./media/dsvm-ubuntu-intro/ubuntu-ip-address.png)
119119

120-
1. From your local machine, open a web browser and navigate to https:\//your-vm-ip:8000, replacing "your-vm-ip" with the IP address you took note of earlier.
121-
1. Your browser will probably prevent you from opening the page directly, telling you that there's a certificate error. The DSVM is providing security via a self-signed certificate. Most browsers will allow you to click through after this warning. Many browsers will continue to provide some kind of visual warning about the certificate throughout your Web session.
120+
1. From your local machine, open a web browser, and navigate to https:\//**your-vm-ip**:8000, replacing "**your-vm-ip**" with the IP address you noted earlier.
121+
1. Your browser will probably prevent you from opening the page directly. It might tell you that there's a certificate error. The DSVM provides security with a self-signed certificate. Most browsers will allow you to select through after this warning. Many browsers will continue to provide some kind of visual warning about the certificate throughout your Web session.
122122

123123
>[!NOTE]
124-
> If you see the `ERR_EMPTY_RESPONSE` error message in your browser, make sure you access the machine by explicitly using the *HTTPS* protocol, and not by using *HTTP* or just the web address. If you type the web address without `https://` in the address line, most browsers will default to `http`, and you will see this error.
124+
> If you see the `ERR_EMPTY_RESPONSE` error message in your browser, make sure you access the machine by explicit use of the *HTTPS* protocol. *HTTP* or just the web address don't work for this step. If you type the web address without `https://` in the address line, most browsers will default to `http`, and the error will appear.
125125
126126
1. Enter the username and password that you used to create the VM, and sign in.
127127

128128
![Enter Jupyter login](./media/dsvm-ubuntu-intro/jupyter-login.png)
129129

130130
>[!NOTE]
131-
> If you receive a 500 Error at this stage, it is likely that you used capitalized letters in your username. This is a known interaction between Jupyter Hub and the PAMAuthenticator it uses.
132-
> If you receive a "Can't reach this page" error, it is likely that your Network Security Group permissions need to be adjusted. In the Azure portal, find the Network Security Group resource within your Resource Group. To access JupyterHub from the public Internet, you must have port 8000 open. (The image shows that this VM is configured for just-in-time access, which is highly recommended. See [Secure your management ports with just-in time access](../../security-center/security-center-just-in-time.md).)
133-
> ![Configuration of Network Security Group](./media/dsvm-ubuntu-intro/nsg-permissions.png)
131+
> If you receive a 500 Error at this stage, you probably used capitalized letters in your username. This is a known interaction between Jupyter Hub and the PAMAuthenticator it uses.
132+
> If you receive a "Can't reach this page" error, it is likely that your Network Security Group permissions need adjustment. In the Azure portal, find the Network Security Group resource within your Resource Group. To access JupyterHub from the public Internet, you must have port 8000 open. (The image shows that this VM is configured for just-in-time access, which is highly recommended. For more information, visit [Secure your management ports with just-in time access](../../security-center/security-center-just-in-time.md).)
133+
>
134+
> :::image type="content" source="./media/dsvm-ubuntu-intro/nsg-permissions.png" alt-text="Screenshot of Network Security Group configuration values." lightbox= "./media/dsvm-ubuntu-intro/nsg-permissions.png":::
134135
135-
1. Browse the many sample notebooks that are available.
136+
1. Browse the available sample notebooks.
136137

137-
JupyterLab, the next generation of Jupyter notebooks and JupyterHub, is also available. To access it, sign in to JupyterHub, and then browse to the URL https:\//your-vm-ip:8000/user/your-username/lab, replacing "your-username" with the username you chose when configuring the VM. Again, you may be initially blocked from accessing the site because of a certificate error.
138+
JupyterLab, the next generation of Jupyter notebooks and JupyterHub, is also available. To access it, sign in to JupyterHub, and then browse to the URL https:\//your-vm-ip:8000/user/**your-username**/lab, replacing "**your-username**" with the username you chose when you configured the VM. Again, potential certificate errors might initially block you from accessing the site.
138139

139-
You can set JupyterLab as the default notebook server by adding this line to `/etc/jupyterhub/jupyterhub_config.py`:
140+
To set JupyterLab as the default notebook server, add this line to `/etc/jupyterhub/jupyterhub_config.py`:
140141

141142
```python
142143
c.Spawner.default_url = '/lab'
143144
```
144145

145146
## Next steps
146147

147-
Here's how you can continue your learning and exploration:
148-
149-
* The [Data science on the Data Science Virtual Machine for Linux](linux-dsvm-walkthrough.md) walkthrough shows you how to do several common data science tasks with the Linux DSVM provisioned here.
150-
* Explore the various data science tools on the DSVM by trying out the tools described in this article. You can also run `dsvm-more-info` on the shell within the virtual machine for a basic introduction and pointers to more information about the tools installed on the VM.
151-
* Learn how to systematically build analytical solutions using the [Team Data Science Process](/azure/architecture/data-science-process/overview).
148+
* The [Data science on the Data Science Virtual Machine for Linux](linux-dsvm-walkthrough.md) walkthrough shows how to do several common data science tasks with the Linux DSVM provisioned here.
149+
* Try out the tools this article describes to explore the various data science tools on the DSVM. You can also run `dsvm-more-info` on the shell within the virtual machine for a basic introduction and pointers to more information about the tools installed on the VM.
150+
* Learn how to systematically build analytical solutions with the [Team Data Science Process](/azure/architecture/data-science-process/overview).
152151
* Visit the [Azure AI Gallery](https://gallery.azure.ai/) for machine learning and data analytics samples that use the Azure AI services.
153-
* Consult the appropriate [reference documentation](./reference-ubuntu-vm.md) for this virtual machine.
152+
* Visit the appropriate [reference documentation](./reference-ubuntu-vm.md) for this virtual machine.

0 commit comments

Comments
 (0)