Skip to content

Commit 4007583

Browse files
committed
registered datasets
1 parent 2ddbfcb commit 4007583

File tree

1 file changed

+5
-2
lines changed

1 file changed

+5
-2
lines changed

articles/machine-learning/service/how-to-create-register-datasets.md

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -106,7 +106,7 @@ titanic_ds.take(3).to_pandas_dataframe()
106106

107107
#### Workspace landing page
108108

109-
Sign in to the [workspace landing page](https://ml.azure.com) to create a a dataset via the web experience. Currently, the workspace landing page only supports the creation of TabularDatasets.
109+
Sign in to the [workspace landing page](https://ml.azure.com) to create a dataset via the web experience. Currently, the workspace landing page only supports the creation of TabularDatasets.
110110

111111
The following animation shows how to create a dataset in the workspace landing page.
112112

@@ -146,14 +146,17 @@ titanic_ds = titanic_ds.register(workspace = workspace,
146146
description = 'titanic training data')
147147
```
148148

149+
>[!Note]
150+
> Datasets created via the workspace landing page are automatically registered to the workspace.
151+
149152
## Version datasets
150153

151154
You can register a new dataset under the same name by creating a new version. Dataset version is a way to bookmark the state of your data, so you can apply a specific version of the dataset for experimentation or future reproduction. Typical scenarios to consider versioning:
152155
* When new data is available for retraining.
153156
* When you are applying different data preparation or feature engineering approaches.
154157

155158
```Python
156-
# create a TabularDataset from new Titanic training data
159+
# create a TabularDataset from Titanic training data
157160
web_paths = [
158161
'https://dprepdata.blob.core.windows.net/demo/Titanic.csv',
159162
'https://dprepdata.blob.core.windows.net/demo/Titanic2.csv'

0 commit comments

Comments
 (0)