Skip to content

Commit ac60ccf

Browse files
Merge pull request #593 from sdgilley/sdg-freshness2
freshness - concept-compute-instance
2 parents e679ca6 + 6d49e79 commit ac60ccf

File tree

1 file changed

+17
-16
lines changed

1 file changed

+17
-16
lines changed

articles/machine-learning/concept-compute-instance.md

Lines changed: 17 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -5,18 +5,18 @@ description: Learn about the Azure Machine Learning compute instance, a fully ma
55
services: machine-learning
66
ms.service: azure-machine-learning
77
ms.subservice: compute
8-
ms.topic: conceptual
8+
ms.topic: concept-article
99
ms.author: sgilley
1010
author: sdgilley
1111
ms.reviewer: vijetaj
12-
ms.date: 01/17/2024
12+
ms.date: 10/01/2024
1313
monikerRange: 'azureml-api-2 || azureml-api-1'
1414
#Customer intent: As a data scientist, I want to know what a compute instance is and how to use it for Azure Machine Learning.
1515
---
1616

1717
# What is an Azure Machine Learning compute instance?
1818

19-
An Azure Machine Learning compute instance is a managed cloud-based workstation for data scientists. Each compute instance has only one owner, although you can share files between multiple compute instances.
19+
An Azure Machine Learning compute instance is a managed cloud-based workstation for data scientists. Each compute instance has only one owner, although you can share files between multiple compute instances.
2020

2121
Compute instances make it easy to get started with Azure Machine Learning development and provide management and enterprise readiness capabilities for IT administrators.
2222

@@ -37,7 +37,7 @@ A compute instance is a fully managed cloud-based workstation optimized for your
3737
|----|----|
3838
|Productivity|You can build and deploy models using integrated notebooks and the following tools in Azure Machine Learning studio:<br/>- Jupyter<br/>- JupyterLab<br/>- VS Code (preview)<br/>Compute instance is fully integrated with Azure Machine Learning workspace and studio. You can share notebooks and data with other data scientists in the workspace.<br/>
3939
|Managed & secure|Reduce your security footprint and add compliance with enterprise security requirements. Compute instances provide robust management policies and secure networking configurations such as:<br/><br/>- Autoprovisioning from Resource Manager templates or Azure Machine Learning SDK<br/>- [Azure role-based access control (Azure RBAC)](/azure/role-based-access-control/overview)<br/>- [Virtual network support](./how-to-secure-training-vnet.md)<br/> - Azure policy to disable SSH access<br/> - Azure policy to enforce creation in a virtual network <br/> - Auto-shutdown/auto-start based on schedule <br/>- TLS 1.2 enabled |
40-
|Preconfigured&nbsp;for&nbsp;ML|Save time on setup tasks with pre-configured and up-to-date ML packages, deep learning frameworks, GPU drivers.|
40+
|Preconfigured&nbsp;for&nbsp;ML|Save time on setup tasks with preconfigured and up-to-date ML packages, deep learning frameworks, GPU drivers.|
4141
|Fully customizable|Broad support for Azure VM types including GPUs and persisted low-level customization such as installing packages and drivers makes advanced scenarios a breeze. You can also use setup scripts to automate customization |
4242

4343
* Secure your compute instance with **[No public IP](./how-to-secure-training-vnet.md)**.
@@ -53,7 +53,7 @@ Azure Machine Learning compute instance enables you to author, train, and deploy
5353

5454
You can run notebooks from [your Azure Machine Learning workspace](./how-to-run-jupyter-notebooks.md), [Jupyter](https://jupyter.org/), [JupyterLab](https://jupyterlab.readthedocs.io), or [Visual Studio Code](./how-to-launch-vs-code-remote.md). VS Code Desktop can be configured to access your compute instance. Or use VS Code for the Web, directly from the browser, and without any required installations or dependencies.
5555

56-
We recommend you try VS Code for the Web to take advantage of the easy integration and rich development environment it provides. VS Code for the Web gives you many of the features of VS Code Desktop that you love, including search and syntax highlighting while browsing and editing. For more information about using VS Code Desktop and VS Code for the Web, see [Launch Visual Studio Code integrated with Azure Machine Learning (preview)](how-to-launch-vs-code-remote.md) and [Work in VS Code remotely connected to a compute instance (preview)](how-to-work-in-vs-code-remote.md).
56+
We recommend you try VS Code for the Web to take advantage of the easy integration and rich development environment it provides. VS Code for the Web gives you many of the features of VS Code Desktop that you love, including search and syntax highlighting while browsing and editing. For more information about using VS Code Desktop and VS Code for the Web, see [Launch Visual Studio Code integrated with Azure Machine Learning (preview)](how-to-launch-vs-code-remote.md) and [Work in VS Code remotely connected to a compute instance (preview)](how-to-work-in-vs-code-remote.md).
5757

5858
You can [install packages](how-to-access-terminal.md#install-packages) and [add kernels](how-to-access-terminal.md#add-new-kernels) to your compute instance.
5959

@@ -81,7 +81,7 @@ You can [Add RStudio or Posit Workbench (formerly RStudio Workbench)](how-to-cre
8181
|Anaconda Python||
8282
|Jupyter and extensions||
8383
|Jupyterlab and extensions||
84-
[Azure Machine Learning SDK <br/> for Python](https://aka.ms/sdk-v2-install) from PyPI|Includes azure-ai-ml and many common azure extra packages. To see the full list, <br/> [open a terminal window on your compute instance](how-to-access-terminal.md) and run <br/> `conda list -n azureml_py310_sdkv2 ^azure` |
84+
[Azure Machine Learning SDK <br/> for Python](https://aka.ms/sdk-v2-install) from PyPI|Includes azure-ai-ml and many common Azure extra packages. To see the full list, <br/> [open a terminal window on your compute instance](how-to-access-terminal.md) and run <br/> `conda list -n azureml_py310_sdkv2 ^azure` |
8585
|Other PyPI packages|`jupytext`</br>`tensorboard`</br>`nbconvert`</br>`notebook`</br>`Pillow`|
8686
|Conda packages|`cython`</br>`numpy`</br>`ipykernel`</br>`scikit-learn`</br>`matplotlib`</br>`tqdm`</br>`joblib`</br>`nodejs`|
8787
|Deep learning packages|`PyTorch`</br>`TensorFlow`</br>`Keras`</br>`Horovod`</br>`MLFlow`</br>`pandas-ml`</br>`scrapbook`|
@@ -92,26 +92,27 @@ The compute instance has Ubuntu as the base OS.
9292

9393
## Accessing files
9494

95-
Notebooks and Python scripts are stored in the default storage account of your workspace in Azure file share. These files are located under your "User files" directory. This storage makes it easy to share notebooks between compute instances. The storage account also keeps your notebooks safely preserved when you stop or delete a compute instance.
95+
Notebooks and Python scripts are stored in the default storage account of your workspace in Azure file share. These files are located under your "User files" directory. This storage makes it easy to share notebooks between compute instances. The storage account also keeps your notebooks safely preserved when you stop or delete a compute instance.
9696

9797
The Azure file share account of your workspace is mounted as a drive on the compute instance. This drive is the default working directory for Jupyter, Jupyter Labs, RStudio, and Posit Workbench. This means that the notebooks and other files you create in Jupyter, JupyterLab, VS Code for Web, RStudio, or Posit are automatically stored on the file share and available to use in other compute instances as well.
9898

99-
The files in the file share are accessible from all compute instances in the same workspace. Any changes to these files on the compute instance will be reliably persisted back to the file share.
99+
The files in the file share are accessible from all compute instances in the same workspace. Any changes to these files on the compute instance are reliably persisted back to the file share.
100100

101101
You can also clone the latest Azure Machine Learning samples to your folder under the user files directory in the workspace file share.
102102

103-
Writing small files can be slower on network drives than writing to the compute instance local disk itself. If you're writing many small files, try using a directory directly on the compute instance, such as a `/tmp` directory. Note these files won't be accessible from other compute instances.
103+
Writing small files can be slower on network drives than writing to the compute instance local disk itself. If you're writing many small files, try using a directory directly on the compute instance, such as a `/tmp` directory. Note files on the compute instance aren't accessible from other compute instances.
104104

105-
Don't store training data on the notebooks file share. For information on the various options to store data, see [Access data in a job](how-to-read-write-data-v2.md).
105+
Don't store training data on the notebooks file share. For information on the various options to store data, see [Access data in a job](how-to-read-write-data-v2.md).
106106

107-
You can use the `/tmp` directory on the compute instance for your temporary data. However, don't write large files of data on the OS disk of the compute instance. OS disk on compute instance has 120-GB capacity. You can also store temporary training data on temporary disk mounted on /mnt. Temporary disk size is based on the VM size chosen and can store larger amounts of data if a higher size VM is chosen. Any software packages you install are saved on the OS disk of compute instance. Note customer managed key encryption is currently not supported for OS disk. The OS disk for compute instance is encrypted with Microsoft-managed keys.
107+
You can use the `/tmp` directory on the compute instance for your temporary data. However, don't write large files of data on the OS disk of the compute instance. OS disk on compute instance has 120-GB capacity. You can also store temporary training data on temporary disk mounted on /mnt. Temporary disk size is based on the VM size chosen and can store larger amounts of data if a higher size VM is chosen. Any software packages you install are saved on the OS disk of compute instance. Note customer managed key encryption is currently not supported for OS disk. The OS disk for compute instance is encrypted with Microsoft-managed keys.
108108

109109
:::moniker range="azureml-api-1"
110110
You can also mount [datastores and datasets](v1/concept-azure-machine-learning-architecture.md?view=azureml-api-1&preserve-view=true#datasets-and-datastores).
111111
:::moniker-end
112+
112113
## Create
113114

114-
Follow the steps in [Create resources you need to get started](quickstart-create-resources.md) to create a basic compute instance.
115+
Follow the steps in [Create resources you need to get started](quickstart-create-resources.md) to create a basic compute instance.
115116

116117
For more options, see [create a new compute instance](how-to-create-compute-instance.md?tabs=azure-studio#create).
117118

@@ -125,28 +126,28 @@ Other ways to create a compute instance:
125126
* With [Azure Machine Learning SDK](how-to-create-compute-instance.md?tabs=python#create)
126127
* From the [CLI extension for Azure Machine Learning](how-to-create-compute-instance.md?tabs=azure-cli#create)
127128

128-
The dedicated cores per region per VM family quota and total regional quota, which applies to compute instance creation, is unified and shared with Azure Machine Learning training compute cluster quota. Stopping the compute instance doesn't release quota to ensure you'll be able to restart the compute instance. Don't stop the compute instance through the OS terminal by doing a sudo shutdown.
129+
The dedicated cores per region per VM family quota and total regional quota, which applies to compute instance creation, is unified and shared with Azure Machine Learning training compute cluster quota. Stopping the compute instance doesn't release quota to ensure you are able to restart the compute instance. Don't stop the compute instance through the OS terminal by doing a sudo shutdown.
129130

130131
Compute instance comes with P10 OS disk. Temp disk type depends on the VM size chosen. Currently, it isn't possible to change the OS disk type.
131132

132133

133134
## Compute target
134135

135-
Compute instances can be used as a [training compute target](concept-compute-target.md#training-compute-targets) similar to Azure Machine Learning [compute training clusters](how-to-create-attach-compute-cluster.md). But a compute instance has only a single node, while a compute cluster can have more nodes.
136+
Compute instances can be used as a [training compute target](concept-compute-target.md#training-compute-targets) similar to Azure Machine Learning [compute training clusters](how-to-create-attach-compute-cluster.md). But a compute instance has only a single node, while a compute cluster can have more nodes.
136137

137138
A compute instance:
138139

139140
* Has a job queue.
140141
* Runs jobs securely in a virtual network environment, without requiring enterprises to open up SSH port. The job executes in a containerized environment and packages your model dependencies in a Docker container.
141-
* Can run multiple small jobs in parallel. One job per vCPU can run in parallel while the rest of the jobs are queued.
142+
* Can run multiple small jobs in parallel. One job per vCPU can run in parallel while the rest of the jobs are queued.
142143
* Supports single-node multi-GPU [distributed training](how-to-train-distributed-gpu.md) jobs
143144

144145
You can use compute instance as a local inferencing deployment target for test/debug scenarios.
145146

146147
> [!TIP]
147148
> The compute instance has 120GB OS disk. If you run out of disk space and get into an unusable state, please clear at least 5 GB disk space on OS disk (mounted on /) through the compute instance terminal by removing files/folders and then do `sudo reboot`. Temporary disk will be freed after restart; you do not need to clear space on temp disk manually. To access the terminal go to compute list page or compute instance details page and click on **Terminal** link. You can check available disk space by running `df -h` on the terminal. Clear at least 5 GB space before doing `sudo reboot`. Please do not stop or restart the compute instance through the Studio until 5 GB disk space has been cleared. Auto shutdowns, including scheduled start or stop as well as idle shutdowns, will not work if the CI disk is full.
148149
149-
## Next steps
150+
## Related content
150151

151152
* [Create resources you need to get started](quickstart-create-resources.md).
152153
* [Tutorial: Train your first ML model](tutorial-1st-experiment-sdk-train.md) shows how to use a compute instance with an integrated notebook.

0 commit comments

Comments
 (0)