You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-share-data-across-workspaces-with-registries.md
+11-11Lines changed: 11 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,14 +8,14 @@ ms.subservice: mlops
8
8
ms.author: larryfr
9
9
author: Blackmist
10
10
ms.reviewer: kritifaujdar
11
-
ms.date: 04/09/2024
11
+
ms.date: 04/07/2025
12
12
ms.topic: how-to
13
13
ms.custom: devx-track-azurecli, sdkv2, build-2023
14
14
---
15
15
16
16
# Share data across workspaces with registries (preview)
17
17
18
-
Azure Machine Learning registry enables you to collaborate across workspaces within your organization. Using registries, you can share models, components, environments and data. Sharing data with registries is currently a preview feature. In this article, you learn how to:
18
+
Azure Machine Learning registry enables you to collaborate across workspaces within your organization. Using registries, you can share models, components, environments, and data. Sharing data with registries is currently a preview feature. In this article, you learn how to:
19
19
20
20
* Create a data asset in the registry.
21
21
* Share an existing data asset from workspace to registry
@@ -58,7 +58,7 @@ You can create three data asset types:
58
58
When you create a data asset, you must specify a **path** parameter that points to the data location. Currently, the only supported paths are to locations on your local computer.
59
59
60
60
> [!TIP]
61
-
> "Local" means the local storage for the computer you are using. For example, if you're using a laptop, the local drive. If an Azure Machine Learning compute instance, the "local" drive of the compute instance.
61
+
> "Local" means the local storage for the computer you're using. For example, if you're using a laptop, the local drive. If you use an Azure Machine Learning compute instance, then the "local" drive of the compute instance.
62
62
63
63
64
64
## Prerequisites
@@ -83,7 +83,7 @@ Before following the steps in this article, make sure you have the following pre
83
83
To install the Azure CLI and extension, see [Install, set up, and use the CLI (v2)](how-to-configure-cli.md).
84
84
85
85
> [!IMPORTANT]
86
-
> * The CLI examples in this article assume that you are using the Bash (or compatible) shell. For example, from a Linux system or [Windows Subsystem for Linux](/windows/wsl/about).
86
+
> * The CLI examples in this article assume that you're using the Bash (or compatible) shell. For example, from a Linux system or [Windows Subsystem for Linux](/windows/wsl/about).
87
87
> * The examples also assume that you have configured defaults for the Azure CLI so that you don't have to specify the parameters for your subscription, workspace, resource group, or location. To set default settings, use the following commands. Replace the following parameters with the values for your configuration:
88
88
>
89
89
> * Replace `<subscription>` with your Azure subscription ID.
@@ -203,7 +203,7 @@ az ml data show --name transformed-nyc-taxt-data --version 1 --registry-name <re
203
203
# [Python SDK](#tab/python)
204
204
205
205
> [!TIP]
206
-
> The same `MLClient.environmentsdata.create_or_update()` can be used to create data in either a workspace or a registry depending on the target it has been initialized with. Since you work wth both workspace and registry in this document, you have initialized `ml_client_workspace` and `ml_client_registry` to work with workspace and registry respectively.
206
+
> The same `MLClient.environmentsdata.create_or_update()` can be used to create data in either a workspace or a registry depending on the target it has been initialized with. Since you work with both workspace and registry in this document, you have initialized `ml_client_workspace` and `ml_client_registry` to work with workspace and registry respectively.
207
207
208
208
209
209
The source data directory `data_transformed` is available in`cli/jobs/pipelines-with-components/nyc_taxi_data_regression/`. Initialize the data object and create the data.
@@ -267,7 +267,7 @@ jobs:
267
267
```
268
268
269
269
> [!WARNING]
270
-
> * Before running the pipeline job, confirm that the workspace in which you will run the job is in an Azure region that is supported by the registry in which you created the data.
270
+
> * Before running the pipeline job, confirm that the workspace in which you run the job is in an Azure region that is supported by the registry in which you created the data.
271
271
> * Confirm that the workspace has a compute cluster with the name `cpu-cluster` or edit the `compute` field under `jobs.train_job.compute` with the name of your compute.
272
272
273
273
Run the pipeline job with the `az ml job create` command.
@@ -277,7 +277,7 @@ az ml job create --file single-job-pipeline.yml
277
277
```
278
278
279
279
> [!TIP]
280
-
> If you have not configured the default workspace and resource group as explained in the prerequisites section, you will need to specify the `--workspace-name` and `--resource-group` parameters for the `az ml job create` to work.
280
+
> If you haven't configured the default workspace and resource group as explained in the prerequisites section, you need to specify the `--workspace-name` and `--resource-group` parameters for the `az ml job create` to work.
281
281
282
282
For more information on running jobs, see the following articles:
283
283
@@ -306,7 +306,7 @@ print(pipeline_job)
306
306
```
307
307
308
308
> [!WARNING]
309
-
> * Confirm that the workspace in which you will run this job is in an Azure location that is supported by the registry in which you created the component before you run the pipeline job.
309
+
>* Confirm that the workspace in which you run this job is in an Azure location that is supported by the registry in which you created the component before you run the pipeline job.
310
310
>* Confirm that the workspace has a compute cluster with the name `cpu-cluster` or update it `pipeline_job.settings.default_compute=<compute-cluster-name>`.
311
311
312
312
Run the pipeline job and waitfor it to complete.
@@ -321,7 +321,7 @@ pipeline_job
321
321
```
322
322
323
323
> [!TIP]
324
-
> Notice that you are using `ml_client_workspace` to run the pipeline job whereas you had used `ml_client_registry` to use create environment and component.
324
+
> Notice that you're using `ml_client_workspace` to run the pipeline job whereas you had used `ml_client_registry` to use create environment and component.
325
325
326
326
Since the component used in the training job is shared through a registry, you can submit the job to any workspace that you have access to in your organization, even across different subscriptions. For example, if you have `dev-workspace`, `test-workspace` and `prod-workspace`, you can connect to those workspaces and resubmit the job.
327
327
@@ -356,7 +356,7 @@ az ml data create -f local-folder.yml
356
356
357
357
For more information on creating data assets in a workspace, see [How to create data assets](how-to-create-data-assets.md).
358
358
359
-
The data asset created in the workspace can be shared to a registry. From the registry, it can be used in multiple workspaces. Note that we are passing `--share_with_name` and `--share_with_version` parameter in share function. These parameters are optional and if you do not pass these data will be shared with same name and version as in workspace.
359
+
The data asset created in the workspace can be shared to a registry. From the registry, it can be used in multiple workspaces. We are passing `--share_with_name` and `--share_with_version` parameter in share function. These parameters are optional and if you don't pass these data will be shared with same name and version as in workspace.
360
360
361
361
The following example demonstrates using share command to share a data asset. Replace `<registry-name>` with the name of the registry that the data will be shared to.
362
362
@@ -383,7 +383,7 @@ For more information on creating data assets in a workspace, see [How to create
383
383
384
384
The data asset created in workspace can be shared to a registry and it can be used in multiple workspaces from there. You can also change the name and version when sharing the data from workspace to registry.
385
385
386
-
Note that we are passing `share_with_name` and `share_with_version` parameter in share function. These parameters are optional and if you do not pass these data will be shared with same name and version as in workspace.
386
+
We are passing `share_with_name` and `share_with_version` parameter in share function. These parameters are optional and if you don't pass these data will be shared with same name and version as in workspace.
0 commit comments